Learning from Examples to Find Fully Qualified Names of API Elements in Code Snippets (ASE 2019 - Research Papers)

Blogs (1) >>

Sun 10 - Fri 15 November 2019 San Diego, California, United States

Who

C M Khaled Saifullah, Muhammad Asaduzzaman, Chanchal K. Roy

Track

ASE 2019 Research Papers

Time Zone

The program is currently displayed in (GMT-08:00) Tijuana, Baja California.

Use conference time zone: (GMT-08:00) Tijuana, Baja CaliforniaSelect other time zone

The GMT offsets shown reflect the offsets at the moment of the conference.

Time Band

By setting a time band, the program will dim events that are outside this time window. This is useful for (virtual) conferences with a continuous program (with repeated sessions).
The time band will also limit the events that are included in the personal iCalendar subscription service.

Display full programSpecify a time band

Save

When

Tue 12 Nov 2019 16:40 - 17:00 at Cortez 2&3 - Code and Artifact Analysis Chair(s): Sarah Nadi

Abstract

Developers often reuse code snippets from online forums, such as Stack Overflow, GitHub Gists to learn API usages of software frameworks or libraries. Those code snippets often have ambiguous undeclared external references. This makes it difficult to learn and use those APIs correctly. Reusing those code snippets to solve development tasks also requires resolving external references of those APIs. However, manually resolving fully qualified names (FQN) of API elements is a non-trivial task. In this paper, we propose a novel context-sensitive technique, COSTER, to resolve FQNs of API elements in those code snippets. The technique collects locally specific source code elements as well as globally related tokens as the context of FQNs, calculate association score, and build an occurrence likelihood dictionary. While inferring an API element, it collects the code context and ranks candidate FQNs from the dictionary by considering the association score of the tokens in the context, similarity between the context, and similarity between the API element. Evaluation with code examples collected from GitHub and Stack Overflow posts shows that our proposed technique improves precision and recall by 3-18% compared to existing state-of-the-art techniques. The proposed technique significantly reduces the training time compared to the StatType, a state-of-the-art technique, without sacrificing accuracy. Extensive analyses on results establish the facts of the robustness of the proposed technique.

Link to Preprint

https://drive.google.com/file/d/1IeIqYZDcZVDTTXXGwGJ1k6a0q5dbNkQt/view?usp=sharing

C M Khaled Saifullah

Department of Computer Science, University of Saskatchewan

Canada

Muhammad Asaduzzaman

Postdoctoral Research Fellow, Software Analysis and Intelligence Lab, Queen's University, Canada

Chanchal K. Roy

University of Saskatchewan