Exploring scientific trajectories of a large-scale dataset using topic-integrated path extraction

Erin H.J. Kim, Yoo Kyung Jeong, Yong Hwan Kim, Min Song

Research output: Contribution to journalArticlepeer-review

7 Citations (Scopus)


Main path analysis (MPA) is the most widely accepted approach to tracing knowledge transfer in a research field. In this study, we extracted multiple longest paths from the multidisciplinary academic field's citation network and integrating topic modeling to the extracted paths. We consider three main aspects of trajectory analysis when analyzing the represented documents through the extracted paths: emergence, authority, and topic dynamics. For path extraction, we adopt the longest path algorithm that consists of the following three steps: 1) topological sort, 2) edge relaxation, and 3) multiple path extraction. For topic integration into multiple paths, we employ latent Dirichlet allocation (LDA) by utilizing the topic-document matrix that LDA derives to select an article's topic from the citation network, where each article is labeled with the topic that is assigned with the highest topical probability for that article. We conduct a series of experiments to examine the results on a dataset from the field of healthcare informatics that PubMed provides.

Original languageEnglish
Article number101242
JournalJournal of Informetrics
Issue number1
Publication statusPublished - 2022 Feb

Bibliographical note

Publisher Copyright:
© 2021 Elsevier Ltd

All Science Journal Classification (ASJC) codes

  • Computer Science Applications
  • Library and Information Sciences


Dive into the research topics of 'Exploring scientific trajectories of a large-scale dataset using topic-integrated path extraction'. Together they form a unique fingerprint.

Cite this