Extracting and mining protein-protein interaction network from biomedical literature

Xiaohua Hu, Illhoi Yoo, Il Yeol Song, Min Song, Jianchao Han, Mark Lechner

Research output: Chapter in Book/Report/Conference proceedingConference contribution

15 Citations (Scopus)

Abstract

In this paper we present a biomedical literature data mining system SPIE-DM (Scalable and Portable Information Extraction and Data Mining) to extract and mine the protein-protein interaction network from biomedical literature such as MedLine. SPIE-DM consists of two phases: in Phase 1, we develop a Scalable and Portable IE method (SPIE) to extract the protein-protein interaction from the biomedical literature. These extracted protein-protein interactions form a scale-free network graph. In Phase 2, we apply a novel clustering method SFCluster to mine the protein-protein interaction network. The clusters in the network graph represent some potential protein complexes, which are very important for biologist to study the protein functionality. The clustering algorithm considers the characteristics of the scale-free network graphs and is based on the local density of the vertex and its neighborhood functions that can be used to find more meaningful clusters at different density levels. The experiments of SPIE-DM on around 1600 chromatin proteins indicate that our system is very promising for extracting and mining from biomedical literature databases.

Original languageEnglish
Title of host publicationProceedings of the 2004 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB'04
Pages244-251
Number of pages8
Publication statusPublished - 2004
EventProceedings of the 2004 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB'04 - La Jolla, CA, United States
Duration: 2004 Oct 72004 Oct 8

Publication series

NameProceedings of the 2004 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB'04

Other

OtherProceedings of the 2004 IEEE Symposium on Computational Intelligence in Bioinformatics and Computational Biology, CIBCB'04
Country/TerritoryUnited States
CityLa Jolla, CA
Period04/10/704/10/8

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Fingerprint

Dive into the research topics of 'Extracting and mining protein-protein interaction network from biomedical literature'. Together they form a unique fingerprint.

Cite this