SocialSearch: Enhancing entity search with social network matching

Gae Won You, Seung Won Hwang, Zaiqing Nie, Ji Rong Wen

Research output: Chapter in Book/Report/Conference proceedingConference contribution

14 Citations (Scopus)


This paper introduces the problem of matching people names to their corresponding social network identities such as their Twitter accounts. Existing tools for this purpose build upon naive textual matching and inevitably suffer low precision, due to false positives (e.g., fake impersonator accounts) and false negatives (e.g., accounts using nicknames). To overcome these limitations, we leverage "relational" evidences extracted from the Web corpus. In particular, as such an example, weadopt Web document co-occurrences, which can be interpreted as an "implicit" counterpart of Twitter follower relationships. Using both textual and relational features, we learn a ranking function aggregating these features for the accurate ordering of candidate matches. Another key contribution of this paper is to formulate confidence scoring as a separate problem from relevance ranking. A baseline approach is to use the relevance of the top match itself as the confidence score. In contrast, we train a separate classifier, using not only the top relevance score but also various statistical features extracted from the relevance scores of all candidates, and empirically validate to outperform the baseline approach. We evaluate our proposed system using real-life internetscale entity-relationship and social network graphs.

Original languageEnglish
Title of host publicationAdvances in Database Technology - EDBT 2011
Subtitle of host publication14th International Conference on Extending Database Technology, Proceedings
PublisherAssociation for Computing Machinery
Number of pages6
ISBN (Print)9781450305280
Publication statusPublished - 2011
Event14th International Conference on Extending Database Technology: Advances in Database Technology, EDBT 2011 - Uppsala, Sweden
Duration: 2011 Mar 222011 Mar 24

Publication series

NameACM International Conference Proceeding Series


Other14th International Conference on Extending Database Technology: Advances in Database Technology, EDBT 2011

All Science Journal Classification (ASJC) codes

  • Software
  • Human-Computer Interaction
  • Computer Vision and Pattern Recognition
  • Computer Networks and Communications


Dive into the research topics of 'SocialSearch: Enhancing entity search with social network matching'. Together they form a unique fingerprint.

Cite this