CGM: A biomedical text categorization approach using concept graph mining

Said Bleik, Min Song, Aaron Smalter, Jun Huan, Gerald Lushington

Research output: Chapter in Book/Report/Conference proceedingConference contribution

6 Citations (Scopus)

Abstract

Text Categorization is used to organize and manage biomedical text databases that are growing at an exponential rate. Feature representations for documents are a crucial factor for the performance of text categorization. Most of the successful existing techniques use a vector representation based on key entities extracted from the text. In this paper we investigate a new direction where we represent a document as a graph. In this representation we identify high level concepts and build a rich graph structure that contains additional concepts and relationships. We then use graph kernel techniques to perform text categorization. The results show a significant improvement in accuracy when compared to categorization based on only the extracted concepts.

Original languageEnglish
Title of host publicationProceedings - 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009
Pages38-43
Number of pages6
DOIs
Publication statusPublished - 2009
Event2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009 - Washington, DC, United States
Duration: 2009 Nov 12009 Nov 4

Publication series

NameProceedings - 2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009

Other

Other2009 IEEE International Conference on Bioinformatics and Biomedicine Workshops, BIBMW 2009
Country/TerritoryUnited States
CityWashington, DC
Period09/11/109/11/4

All Science Journal Classification (ASJC) codes

  • Biomedical Engineering
  • Health Informatics
  • Health Information Management

Fingerprint

Dive into the research topics of 'CGM: A biomedical text categorization approach using concept graph mining'. Together they form a unique fingerprint.

Cite this