TY - GEN
T1 - Instant code clone search
AU - Lee, Mu Woong
AU - Roh, Jong Won
AU - Hwang, Seung Won
AU - Kim, Sunghun
PY - 2010
Y1 - 2010
N2 - In this paper, we propose a scalable instant code clone search engine for large-scale software repositories. While there are commercial code search engines available, they treat software as text and often fail to find semantically related code. Meanwhile, existing tools for semantic code clone searches take a "post-mortem" approach involving the detection of clones "after" the code development is completed, and hence, fail to return the results instantly. In clear contrast, we combine the strength of these two lines of existing research, by supporting instant code clone detection. To achieve this goal, we propose scalable indexing structures on vector abstractions of code. Our proposed algorithms allow developers to detect clones of a given code segment among the 1.7 million code segments from 492 open source projects in sub-second response times, without compromising the accuracy obtained by a state-of-the-art tool.
AB - In this paper, we propose a scalable instant code clone search engine for large-scale software repositories. While there are commercial code search engines available, they treat software as text and often fail to find semantically related code. Meanwhile, existing tools for semantic code clone searches take a "post-mortem" approach involving the detection of clones "after" the code development is completed, and hence, fail to return the results instantly. In clear contrast, we combine the strength of these two lines of existing research, by supporting instant code clone detection. To achieve this goal, we propose scalable indexing structures on vector abstractions of code. Our proposed algorithms allow developers to detect clones of a given code segment among the 1.7 million code segments from 492 open source projects in sub-second response times, without compromising the accuracy obtained by a state-of-the-art tool.
UR - http://www.scopus.com/inward/record.url?scp=78751519375&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=78751519375&partnerID=8YFLogxK
U2 - 10.1145/1882291.1882317
DO - 10.1145/1882291.1882317
M3 - Conference contribution
AN - SCOPUS:78751519375
SN - 9781605587912
T3 - Proceedings of the ACM SIGSOFT Symposium on the Foundations of Software Engineering
SP - 167
EP - 176
BT - Proceedings of the 18th ACM SIGSOFT International Symposium on Foundations of Software Engineering, FSE-18
T2 - 18th ACM SIGSOFT International Symposium on the Foundations of Software Engineering, FSE-18
Y2 - 7 November 2010 through 11 November 2010
ER -