Effectiveness: The results focus on the specific attack, architecture, and quantity level are as follows.
(1) Criteria based on neuron weight (CLP and ANP). Across three datasets, CLP exhibits significant declines from narrow to large levels, whereas ANP remains more stable.
(2) General localization (SLICER and deepmufl). Overall, their effectiveness increases as the defect level increases. Specifically, SLICER performs well on the models (especially the VGG series) injected by the SRA backdoor attack across three datasets, occasionally even surpassing ANP/CLP. This can be attributed to the SRA injection method, which isolates the infected neurons from clean predictions.
(3) Activation-based criteria (NC and FP). In general, NC demonstrates effectiveness in narrow-level injected sub-networks but fails to identify infected neurons at larger levels, possibly due to the fewer infected neurons leading to greater activation differences. Conversely, FP consistently exhibits weaker performance.