A methodology to measure the semantic similarity between words based on the formal concept analysis

Yewon Jeong, Yiyeon Yoon, Dongkyu Jeon, Youngsang Cho, Wooju Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Recently, web users feel difficult to find the desired information on the internet despite a lot of useful information since it takes more time and effort to find it. In order to solve this problem, the query expansion is considered as a new alternative. It is the process of reformulating a query to improve retrieval performance in information retrieval operations. Although there are a few techniques of query expansion, synonym identification is one of them. Therefore, this paper proposes the method to measure the semantic similarity between two words by using the keyword-based web documents. The formal concept analysis and our proposed expansion algorithm are used to estimate the similarity between two words. To evaluate the performance of our method, we conducted two experiments. As the results, the average of similarity between synonym pairs is much higher than random pairs. Also, our method shows the remarkable performance in comparison with other method. Therefore, the suggested method in this paper has the contribution to find the synonym among a lot of candidate words.

Original languageEnglish
Title of host publicationWEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies
PublisherSciTePress
Pages313-321
Number of pages9
ISBN (Print)9789897580246
DOIs
Publication statusPublished - 2014
Event10th International Conference on Web Information Systems and Technologies, WEBIST 2014 - Barcelona, Spain
Duration: 2014 Apr 32014 Apr 5

Publication series

NameWEBIST 2014 - Proceedings of the 10th International Conference on Web Information Systems and Technologies
Volume2

Other

Other10th International Conference on Web Information Systems and Technologies, WEBIST 2014
Country/TerritorySpain
CityBarcelona
Period14/4/314/4/5

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications

Fingerprint

Dive into the research topics of 'A methodology to measure the semantic similarity between words based on the formal concept analysis'. Together they form a unique fingerprint.

Cite this