Balancing the trade-off between accuracy and interpretability in software defect prediction (ASE 2019 - Journal First Presentations)

Who

Toshiki Mori, Naoshi Uchihira

Track

ASE 2019 Journal First Presentations

Time Zone

The program is currently displayed in (GMT-08:00) Tijuana, Baja California.

Use conference time zone: (GMT-08:00) Tijuana, Baja CaliforniaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Wed 13 Nov 2019 17:00 - 17:20 at Cortez 1 - Prediction Chair(s): Xin Xia

Abstract

Context: Classification techniques of supervised machine learning have been successfully applied to various domains of practice. When building a predictive model, there are two important criteria: predictive accuracy and interpretability, which generally have a trade-off relationship. In particular, interpretability should be accorded greater emphasis in the domains where the incorporation of expert knowledge into a predictive model is required. Objective: The aim of this research is to propose a new classification model, called superposed naive Bayes (SNB), which transforms a naive Bayes ensemble into a simple naive Bayes model by linear approximation. Method: In order to evaluate the predictive accuracy and interpretability of the proposed method, we conducted a comparative study using well-known classification techniques such as rule-based learners, decision trees, regression models, support vector machines, neural networks, Bayesian learners, and ensemble learners, over 13 real-world public datasets. Results: A trade-off analysis between the accuracy and interpretability of different classification techniques was performed with a scatter plot comparing relative ranks of accuracy with those of interpretability. The experiment results show that the proposed method (SNB) can produce a balanced output that satisfies both accuracy and interpretability criteria. Conclusions: SNB offers a comprehensible predictive model based on a simple and transparent model structure, which can provide an effective way for balancing the trade-off between accuracy and interpretability.

Link to Publication

https://link.springer.com/article/10.1007/s10664-018-9638-1

File attachments

Balancing the Trade-off between Accuracy and Interpretability in Software Defect Prediction (ASE2019_20191112a.pdf)	927KiB

Toshiki Mori

Corporate Software Engineering & Technology Center, Toshiba Corporation

Japan

Naoshi Uchihira

School of Knowledge Science, Japan Advanced Institute of Science and Technology (JAIST)