Preprints

(α-β order) denotes alphabetical ordering, * denotes equal contribution.

Goedel-Prover-V2: Scaling Formal Theorem Proving with Scaffolded Data Synthesis and Self-Correction [arXiv]


LLM Economist: Large Population Models and Mechanism Design in Multi-Agent Generative Simulacra [arXiv]


Learning World Models for Interactive Video Generation [arXiv]


Ineq-Comp: Benchmarking Human-Intuitive Compositional Reasoning in Automated Theorem Proving on Inequalities [arXiv]


Frontier LLMs Still Struggle with Simple Reasoning Tasks [arXiv]


Principled Out-of-Distribution Generalization via Simplicity [arXiv]


Is Elo Rating Reliable? A Study Under Model Misspecification [arXiv]


Rethinking Mixture-of-Agents: Is Mixing Different Large Language Models Beneficial? [arXiv]


Generative Diffusion Modeling: A Practical Handbook [arXiv]


DOLLAR: Few-Step Video Generation via Distillation and Latent Reward Optimization [arXiv]


On Limitation of Transformer for Learning HMMs [arXiv]


Learning a Universal Human Prior for Dexterous Manipulation from Human Preference. [arXiv]


Thinking Fast and Slow: Data-Driven Adaptive DeFi Borrow-Lending Protocol. [arXiv]