8:00 - 8:05 Opening Remarks
8:05 - 8:35 Invited Talk
8:35 - 8:45 Contributed Talk: On Evaluating Methods vs. Evaluating Models
8:45 - 8:55 Contributed Talk: Detecting Training Data of Large Language Models via Expectation Maximization
8:55 - 9:05 Break
9:05 - 10:05 Poster Session 1 (submission ids: 1-87)
10:05 - 10:35 Invited Talk
10:35 - 11:05 Invited Talk
11:05 - 11:15 Contributed Talk: LLMs Show Surface-Form Brittleness Under Paraphrase Stress Tests
11:15 - 12:15 Poster Session 2 (submission ids: 88-169)
12:15 - 2:00 Lunch Break
2:00 - 2:30 Invited Talk
2:30 - 3:30 Poster Session 3 (submission ids:170-251)
3:30 - 3:40 Contributed Talk: Physics Supernova: AI Agent Matches Elite Gold Medalists at IPhO 2025
3:40 - 3:50 Contributed Talk: The Measure of All Measures: Quantifying LLM Benchmark Quality
3:50 - 4:00 Break
4:00 - 4:50 Panel
4:50 - 5:00 Closing Remarks