"Ontological Closure and Structural Limits on Systemic AI", Maneth Perera, Poojak Patel
"Leveraging Class Similarity for Enhanced Conformal Prediction", Ariel Fargion, Lahav Dabah, Tom Tirer
"When Uncertainty Isn't Enough: An Empirical Study of Self-Correction in Code Generation", Pranav Rakasi, Tinuade Adeleke, Arya Senthilkumar Palanivel, Maanas Lalwani, Arnav Srivastava, Sean Wu, Ruizhe Li
"Rethinking Uncertainty Evaluation in Large Language Models", Krish Matta, Atharv Naphade, Andy Zou
"Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?", Jean Rojas, Viraj Vilas Sawant, Nathan Allen, Nomgondalai Amgalanbaatar, Yannis Zongo, Kevin Zhu, Maheep Chaudhary
"Epistemic Gatekeeper: Training-Free One-Shot Gating for Open-World Industrial Anomaly Detection", Geonwoo Kim, Hyeonjun Kim, Chaein Oh, Suk-Ju Kang
"Open-World Sequential Belief Revision with Dynamic Functional Experts", David Scott Lewis
"What Intermediate Layers Know: Detecting Jailbreaks from Entropy Dynamics", Sofiia Nikolenko, Michele Papucci, Mina Rezaei, Shireen Kudukkil Manchingal
"Why Limit the Residual Stream to Layers and Not Tokens? Persistent Memory for Continuous Latent Reasoning", Mujtaba Farhan, Kevin Zhu, Maheep Chaudhary
"MAND: Modality-Aware Novelty Detection for Open-World Egocentric Activity Recognition", Hyejeong Im, Wonseon Lim, Dae-Won Kim
"Playing Devil's Advocate: Off-the-Shelf Persona Vectors Rival Targeted Steering for Sycophancy", Ishaan Kelkar, Nebras Alam, Vikram Kakaria, Kevin Zhu, Maheep Chaudhary
"Space-sampled Value Decay: Forgetting Mechanisms for Non-stationary Deep Reinforcement Learning", Felix Störck, Fabian Hinder, Barbara Hammer
"Task-Aware Calibration: Provably Optimal Decoding in LLMs", Tim Tomov, Dominik Fuchsgruber, Rajeev Verma, Stephan Günnemann
"Stable Miscalibration in Large Language Models: A Practical View of High-Confidence Errors", Akira Okutomi
"When Prediction Is Not Enough: Identifying Hidden Structure under Partial Observation", Jeongjun Lee, Laura Mascarell, Lancelot Da Costa, Ukyo T. Tazawa, Sundong Kim
"Pipeline-Aware Split Conformal Prediction: A Single-Quantile Reduction for Joint Coverage Under Compositional Uncertainty", Varun Kotte
"What Shapes Emergent Misalignment?", Yuchen Zhang, Anietta Weckauff, Diego Garcia-Olano, Maksym Andriushchenko
"Authority, Truth, and Citation Bias: A Large-Scale Multi-Domain Benchmark for Studying Epistemic Susceptibility in Large Language Models", Aryan Khurana, Aravind Ramana RN, Dhruv Kumar
"Conformal Steering of LLMs via Posterior Sampling", Nicolas Emmenegger, Theo X. Olausson, Armando Solar-Lezama, Chara Podimata
"Confidence Without Warrant: A Formal Theory of Epistemic Blind Spots in Learning Systems", Siddharth Karuturi, Kaustubh S. Bukkapatnam, Soham Batra, Mithil Shah, Tanush Ajay Shastry, Akshath Sharma, Laksh Patel, Aarav Lala, Neel N Shanbhag, Andrew Bae
"From Critic to Confidence: PPO for Language-Based Quantitative Prediction with Confidence Estimation", Mehak Preet Dhaliwal, Rasta Tadayon, Andong Hua, Haewon Jeong, Yao Qin
"How do Consonance Constructions Shape Conformal Credal Sets?", Chun Ho Chong, Kaizheng Wang, Michele Caprio, Siu Lun Chau
"The Mechanistic Invariance Test: Genomic Language Models Fail to Learn Positional Regulatory Logic", Bryan Cheng, Jasper Zhang
"When Symbolic Rules Cannot Find Their Own Triggers: An Epistemic Diagnosis of Predicate Grounding Collapse in Neuro-Symbolic Fraud Detection", Farjana Yesmin, Nusrat Shirmin
"Activation Probing for Uncertainty-Aware Tool Routing in Agentic LLMs", Bastien Zimmermann, Louis Hernandez, Clément Pierquin, Matthieu Boussard
"What Uncertainties Do We Need for Dynamical Systems?", Yusuf Sale, Christopher Bülte, Felix Czaja, Joshua Stiller, Eyke Hüllermeier
"Characterizing the Consistency of the Emergent Misalignment Persona", Anietta Weckauff, Yuchen Zhang, Maksym Andriushchenko
"Sharp Thresholds and Non-Gaussian Limits of Compositional Deep GPs", Mark Kozdoba, Shie Mannor
"Quantifying Subliminal Behavioral Transfer Ratios in Language Model Distillation", Uwe König, Hamza Kazmi, Ruizhe Li, Maheep Chaudhary
"Validation Windows: Epistemic Uncertainty Produced by the Solver in a Literature-Derived Bayesian Network", Deborah Vakas Duong, Igor Yi
"In-Context Learning for Latent Space Bayesian Optimization", Tuan A. Vu, Harri Lähdesmäki, Julien Martinelli
"Hybrid Adaptive Prediction Sets for Conformal Prediction", Soundouss Messoudi, Abdelhak Imoussaten
"When To Trust a Temporal Prior: Order-Aware Test-Time Adaptation with Likelihood-Ratio Abstention", Young Kyung Kim, Oded Schlesinger, Qiangqiang Wu, J. Matias Di Martino, Guillermo Sapiro
"LLM Doesn't Know What It Doesn't Know: Detecting Epistemic Blind Spots via Cross-Model Attribution Divergence on Clinical Tabular Data", Akshat Dasula, Prasanna Desikan, Jaideep Srivastava
"Stacking-Based Weighting for Large Language Bayes in M-Open Settings", Anant Bhide, Akarsh Gupta, Edmond Cunningham, Justin Domke
"Indirect Query Bayesian Optimization with Integrated Feedback", Mengyan Zhang, Shahine Bouabid, Cheng Soon Ong, Seth Flaxman, Dino Sejdinovic
"Decomposing Epistemic Uncertainty for Causal Decision Making", Md Musfiqur Rahman, Ziwei Jiang, Hilaf Hasson, Murat Kocaoglu
"Mechanistic Signatures of Recursive Reasoning in a Minimal Shared-Update Transformer Stack", Yi Liu, Ran Zhu, Jialiang An, Yunhan Jiang, Meng Lu
"Refusal Behavior Beyond English: A Multilingual Study of Representation Directions", Adith N Reganti, Malhar Udmale, Hardik Sharma, Soham Wasmatkar, Vaibhav Shukla, Bagesh Kumar
"Calibrated Surrogate Losses for Adversarial Classification With a Reject Option", Boris Ndjia Njike, Andriy Balinskyy, Waleed Mustafa, Puyu Wang, Antoine Ledent, Sophie Fellenz, Marius Kloft
"Green-Edge-Cloud: Routing without Routers via Confidence Calibration", Gao Yunzhe, Zijian Feng, Kezhi Mao
"Calibration of Structured Ignorance Certificates for Epistemic Diagnosis of Unknown Unknowns in Reasoning Models", Subramanyam Sahoo
"Robust Learning to Rank from Incomplete Rankings under Positional Censoring", Cristiano Migali, Gianmarco Genalti, Alberto Maria Metelli, Marco Mussi
"Learnability-Aware Replay for Mathematical Reasoning: Epistemic Uncertainty as a Guide for Experience Selection", Om Shastri
"Beyond a Single Signal: SPECTRE-G2 – A Unified Multi-Expert Anomaly Detector for Unknown Unknowns", Rahul D Ray
"Decoupled Conformal Optimisation: Efficient Prediction Sets via Independent Tuning and Calibration", Fanyi Wu, Lihua Niu, Michele Caprio
"Probabilistic Chain-of-Thought: Sequential Bayesian Inference over Latent Reasoning Correctness", Suriya Dev Saravanakumar, Ezra Matiwos Wesenie, Kishore Nuthalapati, Laksh Patel
"Modeling Incomparability: A New Rationality Paradigm for Preference-Based Reinforcement Learning", Simone Drago, Marco Mussi, Leonardo Bianconi, Alberto Maria Metelli
"I-CALM: Incentivizing Confidence-Aware Abstention for LLM Hallucination Mitigation", Haotian Zong, Binze Li, Yufei Long, Sinyin Chang, Jialong Wu, Gillian K Hadfield
"Split Conformal Prediction with Label-Shift-Adjusted Bayesian Scores", Hyeonsu Lee, Erkhembayar Jadamba, Juyeon Kim, Seungjin Choi, Hyunjin Shin
"ReflectiChain: Epistemic Grounding in LLM-Driven World Models for Supply Chain Resilience", Jia Luo
"ReasonBENCH: Benchmarking the (In)Stability of LLM Reasoning", Nearchos Potamitis, Lars Henning Klein, Akhil Arora
"Distributional Energy-Based Models for Uncertainty-Aware Structured LLM Reasoning", Shireen Kudukkil Manchingal, Abhey Kalia, Fernanda Gonçalves Abrantes, Shebin Rawther
"Same Target, Different Basins: Hard vs. Soft Labels for Annotator Distributions", Mirerfan Gheibi, Gashin Ghazizadeh
"Same Action, Different Justification: Path-Based Authorization for Irreversible Agent Actions", Jungsoo Baek
"Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training", Vin Bhaskara, Haicheng Wang
"Robust Gradual Domain Adaptation with Fuzzy Labeling", Siyi Shen, Benjamin Quost, Sebastien Destercke
"MissFlow: Flow Matching for Uncertainty-Quantified Imputation of Missing Tabular Data", Yonghyun Kwon, Sunil Hwang, Joonyoung Jeong
"A Tale of Two Uncertainties: Global–Local Attribution for Conformal Prediction", Sangyeon Cho, Minyoung Cho, Jungsoo Kim, Sujeong Oh, Sanghack Lee
"Stochastic Sampling is Epistemically Shallow: The Dimensionality Gap Between Temperature Variation and Model Diversity in LLMs", Izhar Ali
"Exposure Bias as Epistemic Underidentification in Recursive Forecasting", Riku Green, Zahraa S. Abdallah, Telmo M Silva Filho
"When Thinking Hurts: Epistemic Signals in the Reasoning Chains of Visual Language Models", Mayank Singal
"ConfidenceBench: Evaluating Confidence Calibration in Large Language Models", Matthew Ffrench-Constant, Daniel Yang, Sanyam Kapoor, Xinmeng Huang
"Conformal Bayes under Label Shift: Post-Hoc Calibration vs. In-Training Adaptation", Seungjin Choi
"HalluHard: A Hard Multi-Turn Hallucination Benchmark", Dongyang Fan, Sébastien Delsad, Nicolas Flammarion, Maksym Andriushchenko
"How Effective are Vision Language Models at Adapting to Novel Artifacts?", Lance Ying, Séb Arnold, Fei Sha, Sjoerd van Steenkiste
"Mitigating Reward Hacking in RLHF via Advantage Sign Robustness", Shinnosuke Ono, Johannes Ackermann, Soichiro Nishimori, Takashi Ishida, Masashi Sugiyama
"Language-Anchored Prototype Learning for Unseen Compositions", Anna-Alina Bondarets, Taras Rumezhak, Volodymyr Karpiv
"Don't Guess What You Don't Know: Spectral Effective Rank as a Diagnostic for Conformal Safety Under Subsampling", Rahul D Ray
"On the Epistemic Uncertainty of Overparametrized Neural Networks", David Rügamer
"Geometry-Aware Uncertainty Quantification via Conformal Prediction on Manifolds", Marzieh Amiri Shahbazi, Ali Baheri
"The Posterior-Prior Epistemic Gap: When World Model Imagination Fails to Carry Task-Relevant Signals", Jaerock Kwon, Elahe Delavari, Donghyun Kim, Haewoon Nam
"Constrained Bayesian Experimental Design via Online Planning", Yujia Guo, Daolang Huang, Xinyu Zhang, Sammie Katt, Samuel Kaski, Ayush Bharti
"Mind the Gap: Recovering Quality in Interrupted LLM Generation Through Continuation Strategy Selection", Tarun Narayanan, Ajay Krishnan
"Diffusion as Prior Construction: A Bayesian View of Communication in Multi-Agent POMDPs", Muhyun Byun, Eunae Lee, Jahun Oh, Seok Joo Doo
"Safety Targeted Embedding Exploit via Refinement: LLM Safety as an Epistemic Coverage Problem", Joshua Adrian Cahyono
"Set-based v.s. Distribution-based Representations of Epistemic Uncertainty: A Comparative Study", Kaizheng Wang, Yunjia Wang, Fabio Cuzzolin, David Moens, Hans Hallez, Siu Lun Chau
"Do Humans and Language Models Converge? A Dynamical Systems View of Human-AI Feedback and Co-evolution", Xuening Wu, Yanlan Kang, Qianya Xu, Kexuan Xie, Jiaqi Mi, Honggang Wang, Yubin Liu, Zeping Chen
"What Does Disagreement Mean? A Semantic Identifiability Frontier for Pairwise Preference Learning", Manoj Saravanan, Rohit Kumar Salla, Shrikar Reddy Kota
"The Shape of Reasoning: Topological Analysis of Reasoning Traces in Large Language Models", Xue Wen Tan, Galen Lee, Nathaniel Tan, Stanley Kok
"MMD-Balls as Credal Sets: A PAC-Bayesian Framework for Epistemic Uncertainty in Test-Time Adaptation", Ahanaf Hasan Ariq