TY - GEN
T1 - Mining undiscovered public knowledge from complementary and non-interactive biomedical literature through semantic pruning
AU - Hu, Xiaohua
AU - Yoo, Illhoi
AU - Song, Min
AU - Zhang, Yanqing
AU - Song, Il Yeol
PY - 2005
Y1 - 2005
N2 - Two complementary and non-interactive literature sets of articles, when they are considered together, can reveal useful information of scientific interest not apparent in either of the two document sets. Swanson called the existence of such knowledge, undiscovered public knowledge (UDPK). This paper proposes a semantic-based mining model for UDPK. Our method replaces manual ad-hoc pruning with using semantic knowledge from the biomedical ontologies. Using the semantic types and semantic relationships of the biomedical concepts, our prototype system can identify the relevant concepts collected from Medline and generate the novel hypothesis between these concepts. The system successfully replicates Swanson's two famous discoveries: Raynaud disease/fish oils and migraine/magnesium. Compared with previous approaches, our methods generate much fewer but more relevant novel hypotheses, and require much less human intervention in the discovery procedure.
AB - Two complementary and non-interactive literature sets of articles, when they are considered together, can reveal useful information of scientific interest not apparent in either of the two document sets. Swanson called the existence of such knowledge, undiscovered public knowledge (UDPK). This paper proposes a semantic-based mining model for UDPK. Our method replaces manual ad-hoc pruning with using semantic knowledge from the biomedical ontologies. Using the semantic types and semantic relationships of the biomedical concepts, our prototype system can identify the relevant concepts collected from Medline and generate the novel hypothesis between these concepts. The system successfully replicates Swanson's two famous discoveries: Raynaud disease/fish oils and migraine/magnesium. Compared with previous approaches, our methods generate much fewer but more relevant novel hypotheses, and require much less human intervention in the discovery procedure.
UR - http://www.scopus.com/inward/record.url?scp=33745777620&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=33745777620&partnerID=8YFLogxK
U2 - 10.1145/1099554.1099611
DO - 10.1145/1099554.1099611
M3 - Conference contribution
AN - SCOPUS:33745777620
SN - 1595931406
SN - 9781595931405
T3 - International Conference on Information and Knowledge Management, Proceedings
SP - 249
EP - 250
BT - CIKM'05 - Proceedings of the 14th ACM International Conference on Information and Knowledge Management
PB - Association for Computing Machinery
T2 - CIKM'05 - Proceedings of the 14th ACM International Conference on Information and Knowledge Management
Y2 - 31 October 2005 through 5 November 2005
ER -