IFIP ICCIDS 2020

Event - 04380

20-22 FEBRUARY 2020

SSN COLLEGE OF ENGINEERING, TAMIL NADU, INDIA

DEPARTMENT OF COMPUTER SCIENCE & ENGINEERING

MACHINE LEARNING RESEARCH GROUP

Venkatesh Vinayakarao

Chennai Mathematical Institute

Title: Code Variants Retrieval

Abstract:

Code variants represent alternative implementations of a code snippet, where each alternative provides the same functionality, but has different properties that make some of them better suited to the overall project requirements. Developers routinely need to analyze existing code, find better reuse alternatives, and look to develop high-quality code that meets some desired properties. However, searching for such code variants over the web has several challenges. Existing text-retrieval models do not work well on source code. Expressing natural language queries on source code is an open problem. Many query terms in natural language have multiple surface forms in source code.

We address this problem by perceiving source code as a collection of entities. In this talk, we present new techniques to search for code variants. The ability to perform semantic search over source code snippets assisted by developer knowledge in the form of discussion forum data opens up a new way to solve several important problems.