RQ2: What is the complementarity of different representations?

In this Research Question we investigate the complementarity of the four representations. In particular, we compute:

  • Complementarity Metrics

  • Correlation

Complementarity Metrics

The following tables show the complementarity metrics (i.e., Intersection, Difference and Exclusive TP candidates) among the different code representations models.

Please switch the sheet (at the bottom) to visualize the different metrics.

Methods

Methods

Classes

Classes

Correlation

The following dataset contains all the distances computed by each representation for each and every pair of code artifacts (i.e., method or class) in the Projects dataset. Each row of the csv file (corresponding to a pair of code artifacts) contains four fields:

  • Identifier: distance of the pair computed using the identifier-based model;

  • AST: distance of the pair computed using the AST-based model;

  • CFG: distance of the pair computed using the CFG-based model;

  • Bytecode: distance of the pair computed using the Bytecode-based model;

Correlation analysis has been performed on the whole dataset.