Abstract
Cocitation and co-word methods have long been used to detect and track emerging topics in scientific literature, but both have weaknesses. Recently, while many researchers have adopted generative probabilistic models for topic detection and tracking, few have compared generative probabilistic models with traditional cocitation and co-word methods in terms of their overall performance. In this article, we compare the performance of hierarchical Dirichlet process (HDP), a promising generative probabilistic model, with that of the 2 traditional topic detecting and tracking methods - cocitation analysis and co-word analysis. We visualize and explore the relationships between topics identified by the 3 methods in hierarchical edge bundling graphs and time flow graphs. Our result shows that HDP is more sensitive and reliable than the other 2 methods in both detecting and tracking emerging topics. Furthermore, we demonstrate the important topics and topic evolution trends in the literature of terrorism research with the HDP method.
Original language | English |
---|---|
Pages (from-to) | 2084-2097 |
Number of pages | 14 |
Journal | Journal of the Association for Information Science and Technology |
Volume | 65 |
Issue number | 10 |
DOIs | |
Publication status | Published - 2014 Oct 1 |
Bibliographical note
Publisher Copyright:© 2014 ASIS&T.
All Science Journal Classification (ASJC) codes
- Information Systems
- Computer Networks and Communications
- Information Systems and Management
- Library and Information Sciences