The detailed statistics of this RQ are listed as follows. The table shows the performance of each evaluated ablation method. If a method succeeded within 10 tries on a test case, it was considered successful.
In summary, each type of mutation contributes differently to SpecGen. The comparative mutation contributes the most to the final performance while the predicative and arithmetic are less important. However, when combining them all, SpecGen achieves the best performance.