GRFT

Dissecting Global Search: A Simple Yet Effective Method to Boost Individual Discrimination Testing and Repair

Introduction

Deep Learning (DL) has achieved significant success in socially critical decision-making applications but often exhibits unfair behaviors, raising social concerns. Among these unfair behaviors, individual discrimination—examining inequalities between instance pairs with identical profiles differing only in sensitive attributes such as gender, race, and age—is extremely socially impactful. Existing methods have made significant and commendable efforts in testing individual discrimination before deployment. However, their efficiency and effectiveness remain limited, particularly when evaluating relatively fairer models. Moreover, while existing methods typically involve two distinct phases, it remains unclear which phase, global or local, warrants more intensive and potentially costly search methods, such as gradient-based approaches, during fairness testing.

Facing the above issues, we aim to understand the significance of these phases and improve overall fairness testing performance. The critical role of the global phase in enhancing fairness testing motivates us to propose Genetic-Random Fairness Testing (GRFT), an effective and efficient method. In the global phase, we use a genetic algorithm to guide the search for more global discriminatory instances. In the local phase, we apply a light random search to explore the neighbors of these instances, avoiding time-consuming computations. Additionally, based on the fitness score, we also propose a straightforward yet effective repair approach. For a thorough evaluation, we conduct extensive experiments involving six testing methods, five datasets, 261 models (including 5 naively-trained, 64 repaired, and 192 quantized for on-device deployment), and sixteen combinations of sensitive attributes, showing the superior performance of GRFT and our repair method.

Supplemental Materials

Due to the page limit, more details on the experiment results are shown here:

Page updated

Google Sites

Report abuse