Location: Room Peridot 202 in Singapore Expo (second floor)
Software development is changing with collaborative AI agents that orchestrate full programming workflows. Unlike simple code completion, these systems must coordinate with each other and with human developers across planning, implementation, testing, debugging, and documentation. This workshop brings together researchers and practitioners to design, evaluate, and deploy AI teammates for real development.
We're proud to announce our speakers below.
Shengyu Fu
Partner Applied Science Manager, Microsoft CoreAI
From IntelliCode to Github Copilot: Human-Centered Coding Agents at Scale
(In-person talk)
Abhik Roychoudhury
Provost's Chair Professor
National University of Singapore
Agentic AI for Software:
Lessons in Trust
(In-person talk)
Baptiste Rozière
AI Scientist @ Mistral
(Leading Code Generation)
Code Assistants: from Code Completion to Coding Agents
(Remote talk)
Dalton Flanagan
Member of Technical Staff @ Anthropic
(Claude Code)
Claude Code: One Year Later
(Remote talk)
08:50 - 09:00 Opening Remarks by Behrooz Omidvar-Tehrani (AWS)
09:00 - 10:00 Invited Talk: From IntelliCode to GitHub Copilot: Human-Centered Coding Agents at Scale, in person talk by Shengyu Fu (Micrososft CoreAI)
10:00 - 10:15 Paper Presentation: From Vibe to Verifiable Spec-Driven Development: A Demo of Intent and Realization Engineering, presented in-person by Dayi Lin (Centre for Software Excellence, Huawei Canada)
10:15 - 10:30 Paper Presentation: Multi-Agent Framework for Automated Cloud Security Assessment presented virtually by Omer Tripp (Amazon)
10:30 - 11:00 Coffee Break ☕️
11:00 - 11:45 Invited Talk: Claude Code: One Year Later, virtual talk by Dalton Flanagan (Anthropic)
11:45 - 12:00 Paper Presentation: Improving Grammar Constraints Generation Alignment by Sampling Highly Probable Playouts, presented in-person by Killian Susini (Université Paris Dauphine)
12:00 - 12:15 Paper Presentation: Agentic Reinforcement Learning for Real-World Code Repair, presented virtually by Alborz Geramifard (LinkedIn)
12:15 - 14:00 Lunch
14:00 - 15:00 Invited Talk: Agentic AI for Software: Lessons in Trust, in person talk by Abhik Roychoudhury (National University of Singapore)
15:00 - 15:45 Invited Talk: Code Assistants: from Code Completion to Coding Agents, virtual talk by Baptiste Rozière (Mistral)
15:45 - 16:30 Panel: Shengyu Fu (in-person), Abhik Roychoudhury (in-person), Omer Tripp (in-person), Dalton Flanagan (remote), Alborz Geramifard (remote), Baptiste Rozière (remote)
16:30 - 16:45 Paper Presentation: CAT: Coverage-aware Testing — Structured Test Suite Generation for Coding Agent Handoff
16:45 - 17:00 Paper Presentation: LLM-as-a-Judge for Scalable Test Coverage Evaluation: Accuracy, Operational Reliability, and Cost, presented in-person by Shila Chew (MasterCard)
17:00 - 17:05 Closing Remarks by Shweta Garg and Behrooz Omidvar-Tehrani (AWS)
We explore how AI agents collaborate in modern software engineering. We study single agent systems that work with humans, multi agent systems that coordinate among themselves, and hybrid approaches. We focus on interaction models, handoffs, workflow design, and on safeguards that preserve human agency while leveraging AI capability. We also ask how to make these systems reliable, auditable, and safe for production, and what verification and evaluation frameworks are needed.
Collaborative AI architectures for software development
Multi-agent coordination strategies
Human–AI workflow integration and handoff mechanisms
Preserving human expertise and agency in AI-assisted coding
Interaction design in IDEs, CLIs, and collaborative environments
Trust, reliability, and safety in collaborative coding agents
Verification, validation, and auditing frameworks
Evaluation methodologies and benchmarks for collaboration
User studies and empirical developer experience evaluations
Industrial deployments and case studies of AI coding agents
Applications in planning, implementation, debugging, and testing
Lessons from systems such as GitHub Copilot, Amazon Kiro, IBM Bob, Cursor AI, and Asimov
We welcome submissions from academia and industry. Submissions will be single-blind; author names and affiliations should be included in the manuscript.
Regular papers (up to six pages): Mature research with empirical results or theory.
Position papers (up to four pages): New perspectives, conceptual frameworks, emerging directions.
Extended abstracts (up to two pages): Early stage work, system demos, industry experiences.
Accepted papers will be presented as talks, spotlights, or posters. We expect at least one author to be in person for the presentation.
Submission site: https://cmt3.research.microsoft.com/CodeMates2026/
Submission due: October 29, 2025
Notification date: November 11, 2025
Proceedings date: January 27, 2026
Archival note: Although AAAI does not officially archive workshop proceedings, we will make all accepted papers publicly available and citable through the workshop website to ensure accessibility and permanence.
Any questions may be directed to the workshop orgnizers using the following email address: coding-agent-aaai26-organizers@googlegroups.com.
Senior Applied Scientist and Science Manager at AWS AI Labs
Senior Applied Scientist and Science Manager at AWS AI Labs
Post-doctoral Researcher
at Columbia University
Senior Applied Scientist
at AWS
Abhilasha Katariya (Amazon)
Ajay Yadav (Google)
Dalton Flanagan (Anthropic)
Feiyang Jin (Google)
Gabriel Ryan (Microsoft)
Ignacio Erazo (Amazon)
Jinyao Guo (Purdue University)
Keyur Muzumdar (Meta)
Pareesa Golnari (Microsoft)
Pramod Chunduri (AWS)
Ravishka Rathnasuriya (The University of Texas at Dallas)
Shahed Sorower (AWS)
Xiaoyu Liu (Microsoft)
Yuntong Zhang (National University of Singapore)
Zhou Xuan (Purdue University)
Penghui Li (Columbia University)
Mingwei Zheng (Purdue University)
The Microsoft CMT service was used for managing the peer-reviewing process for this conference. This service was provided for free by Microsoft and they bore all expenses, including costs for Azure cloud services as well as for software development and support.