[3] X. Ren, Y. Sun, C. Yi, K. Zhang, J. Guo, J. Du, H. Yang*. What's Missing in Autonomous Scientistic Discovery? A Systematization of Systems, Benchmarks, and Verification. [pdf]
[2] R. Bhatnagar, Y. Sun, C. A. Zhang, Y. Wen*, H. Yang*. HALT: Hallucination Assessment via Latent Testing. [pdf]
[1] K. V. Bodla, R. Bhatnagar, Haizhao Yang. Manifold-based Sampling for In-Context Hallucination Detection in Large Language Models. [pdf]