Defect prediction models focus on identifying defect-prone code elements, for example, to allow practitioners to allocate testing resources on specific subsystems and to provide assistance during code reviews. While the research community has been highly active in proposing metrics and methods to predict defects on long-term periods (i.e., at release time), a recent trend is represented by the so-called short-term defect prediction (i.e., at commit-level). Indeed, this strategy represents an effective alternative in terms of effort required to inspect files likely affected by defects. Nevertheless, the granularity considered by such models might be still too coarse. Indeed, existing commit-level models highlight an entire commit as defective even in cases where only specific files actually contain defects.
In this paper, we first investigate to what extent commits are partially defective; then, we propose a novel fine-grained just-in-time defect prediction model to predict the specific files, contained in a commit, that are defective. Finally, we evaluate our model in terms of (i) performance and (ii) the extent to which it decreases the effort required to diagnose a defect. Our study highlights that: (1) defective commits are frequently composed of a mixture of defective and non-defective files, (2) our fine-grained model can accurately predict defective files with an AUC-ROC up to 82% and (3) our model would allow practitioners to save inspection efforts with respect to standard just-in-time techniques.
Wed 13 NovDisplayed time zone: Tijuana, Baja California change
16:00 - 17:40 | PredictionResearch Papers / Journal First Presentations at Cortez 1 Chair(s): Xin Xia Monash University | ||
16:00 20mTalk | Predicting Licenses for Changed Source Code Research Papers Xiaoyu Liu Department of Computer Science and Engineering, Southern Methodist University, Liguo Huang Dept. of Computer Science, Southern Methodist University, Dallas, TX, 75205, Jidong Ge State Key Laboratory for Novel Software and Technology, Nanjing University, Vincent Ng Human Language Technology Research Institute, University of Texas at Dallas, Richardson, TX 75083-0688 | ||
16:20 20mTalk | Empirical evaluation of the impact of class overlap on software defect prediction Research Papers Lina Gong China University of Mining and Technology, Shujuan Jiang China University of Mining and Technology, Rongcun Wang China University of Mining and Technology, Li Jiang China University of Mining and Technology | ||
16:40 20mTalk | Combining Program Analysis and Statistical Language Model for Code Statement Completion Research Papers Son Nguyen The University of Texas at Dallas, Tien N. Nguyen University of Texas at Dallas, Yi Li New Jersey Institute of Technology, USA, Shaohua Wang New Jersey Institute of Technology, USA | ||
17:00 20mTalk | Balancing the trade-off between accuracy and interpretability in software defect prediction Journal First Presentations Toshiki Mori Corporate Software Engineering & Technology Center, Toshiba Corporation, Naoshi Uchihira School of Knowledge Science, Japan Advanced Institute of Science and Technology (JAIST) Link to publication File Attached | ||
17:20 20mTalk | Fine-grained just-in-time defect prediction Journal First Presentations Luca Pascarella Delft University of Technology, Fabio Palomba Department of Informatics, University of Zurich, Alberto Bacchelli University of Zurich Link to publication |