CLCDSA: Cross Language Code Clone Detection using Syntactical Features and API Documentation
Code clones are already proven as harmful for maintenance and evaluation of software systems such as development and modification of source code, propagation of potential bugs throughout software system and so on. Now a days, a single software system is developing in various programming languages for greater adaptability. In these systems a single functionality is replicated among their varieties; often implemented in different programming languages (cross language clones(CLC)). Consequences of these clones are more severe as they are hard to detect and hard to track their modifications. However, while there are a great many studies for finding clones in the same programming language, there is a marked lack of studies in detecting CLCs. To fill this gap, in this paper, we are going to propose CLCDSA, a CLC detector which can detect CLCs despite any prerequisites or intermediate states. This model analyses different syntactical features of source codes across different programming languages to detect CLC. To support large scale clone data, this model comprises an action filter based on cross language API calls similarity to discard non-potential clones before proceed to the main model. The design methodologies of CLCDSA is two-folded: a. it can detect CLC On the Fly by calculating features’ value similarity; b. it poses a deep neural network based feature vector learning model to learn the features and detect CLC. An early evaluation of this model observed an average precision, recall and F-measure score of 0.55, 0.86 and 0.64 respectively for the On the Fly phase and 0.61, 0.93 and 0.71 for the neural net phase which indicate that CLCDSA has outperformed all the available models in detecting cross language clones.
Thu 14 NovDisplayed time zone: Tijuana, Baja California change
16:00 - 17:40 | Untangling and MergingResearch Papers at Cortez 2&3 Chair(s): Iftekhar Ahmed University of California at Irvine, USA | ||
16:00 20mTalk | The Impact of Structure on Software Merging: Semistructured versus Structured Merge Research Papers Guilherme Cavalcanti Federal University of Pernambuco, Brazil, Paulo Borba Federal University of Pernambuco, Brazil, Georg Seibt University of Passau, Sven Apel Saarland University Pre-print | ||
16:20 20mTalk | Semistructured Merge in JavaScript Systems Research Papers Alberto Trindade Tavares Federal University of Pernambuco, Paulo Borba Federal University of Pernambuco, Brazil, Guilherme Cavalcanti Federal University of Pernambuco, Brazil, Sergio Soares Federal University of Pernambuco Pre-print | ||
16:40 20mTalk | CLCDSA: Cross Language Code Clone Detection using Syntactical Features and API Documentation Research Papers Kawser Nafi University of Saskatchewan, Tonny Shekha Kar University of Saskatchewan, Canada, Banani Roy University of Saskatchewan, Chanchal K. Roy University of Saskatchewan, Kevin Schneider University of Saskatchewan | ||
17:00 20mTalk | B2SFinder: Detecting Open-Source Software Reuse in COTS Software Research Papers Muyue Feng Institute of Information Engineering, Chinese Academy of Sciences, Zimu Yuan Institute of Information Engineering, Chinese Academy of Sciences, Feng Li Institute of Computing Technology at Chinese Academy of Sciences, China, Gu Ban Institute of Information Engineering, Chinese Academy of Sciences, Yang Xiao Institute of Information Engineering, Chinese Academy of Sciences & School of Cyber Security, University of Chinese Academy of Sciences, Shiyang Wang Institute of Information Engineering, Chinese Academy of Sciences, Qian Tang Institute of Information Engineering, Chinese Academy of Sciences, He Su Institute of Information Engineering, Chinese Academy of Sciences, Chendong Yu University of Chinese Academy of Sciences, Jiahuan Xu Institute of Information Engineering, Chinese Academy of Sciences, Aihua Piao Institute of Information Engineering, Chinese Academy of Sciences, Jingling Xue UNSW Sydney, Wei Huo Institute of Information Engineering, Chinese Academy of Sciences | ||
17:20 20mTalk | CoRA: Decomposing and Describing Tangled Code Changes for Reviewer Research Papers Min Wang Peking University, Zeqi Lin Microsoft Research, China, Yanzhen Zou Peking University, Bing Xie Peking University |