Active Learning for Animal Re-Identification with Ambiguity-Aware Sampling
Depanshu Sani, Mehar Khurana and Saket Anand
(Accepted at AAAI 2026)
Depanshu Sani, Mehar Khurana and Saket Anand
(Accepted at AAAI 2026)
Animal re-identification (Re-ID) has recently gained substantial attention in the AI research community due to its high impact on biodiversity monitoring and unique research challenges arising from environmental factors. The subtle distinguishing patterns like stripes or spots, handling new species and the inherent open-set nature make the problem even harder. To address these complexities, foundation models trained on labeled, large-scale and multi-species animal Re-ID datasets have recently been introduced to enable zeroshot Re-ID. However, our benchmarking reveals significant gaps in their zero-shot Re-ID performance for both known and unknown species. While this highlights the need for collecting labeled data in new domains, exhaustive annotation for Re-ID is laborious and requires domain expertise. Our analyses also show that existing unsupervised (USL) and active learning (AL) Re-ID methods underperform for animal Re-ID. To address these limitations, we introduce a novel AL Re-ID framework that leverages complementary clustering methods to uncover and target structurally ambiguous regions in the embedding space for mining pairs of samples that are both informative and broadly representative of the visual space. Oracle feedback on these pairs, in the form of must-link and cannot-link constraints, facilitates a simple annotation interface, which naturally integrates with existing USL methods through our proposed constrained clustering refinement algorithm. Through extensive experiments, we demonstrate that, by utilizing only 0.1% of all possible annotations, our approach consistently outperforms existing foundational, USL and AL baselines for animal Re-ID. Specifically, we report an average improvement of 10.49%, 11.19% and 3.99% (mAP) on 13 wildlife datasets over foundational, USL and AL methods, respectively, while attaining state-of-the-art performance on each dataset. Furthermore, we also show an improvement of 11.09%, 8.2% and 2.06% (AUC ROC) for unknown individuals in an open-world setting. For completeness, we also present a comparative analysis on 2 publicly available person Re-ID datasets, showing average gains of 7.96% and 2.86% (mAP) over existing USL and state-of-theart AL Re-ID methods.
This work was supported by ANRF (SERB), Govt. of India, under grant no. CRG/2020/006049. The authors acknowledge the compute infrastructure support from the Infosys Center for Artificial Intelligence at IIIT-Delhi. The authors would like to thank Anjaneya Sharma from IIIT-Delhi for his support in running additional analysis experiments. Lastly, the authors are grateful to the team at Tiger Cell, Wildlife Institute of India, Dehradun, India, for the collaboration that led to this work.
@misc{sani2025activelearninganimalreidentification,
title={Active Learning for Animal Re-Identification with Ambiguity-Aware Sampling},
author={Depanshu Sani and Mehar Khurana and Saket Anand},
year={2025},
eprint={2511.06658},
archivePrefix={arXiv},
primaryClass={cs.CV},
url={https://arxiv.org/abs/2511.06658},
}
For any queries, reach out to us at depanshus@iiitd.ac.in