AAAI 2026 Workshop

W8: Agentic AI Benchmarks and Applications for Enterprise Tasks

January 26, 2026 | Singapore EXPO

Update

Several invited talk slides have been made available. You can download them from the "Speakers" tab. (March 5, 2026)
Poster allocation has been uploaded to the Accepted Papers tab. (January 16, 2026)
The final version of the day's schedule has been uploaded to the Schedule tab. (January 5, 2026)

The primary goal of this workshop is to foster discussions and collaborations to build robust, efficient, and trustworthy Agentic AI technologies for complex and dynamic enterprise business operations. It aims to bridge the gap between cutting-edge Agentic AI research and the practical demands of enterprise deployment and rigorous evaluation.

The workshop will address the following specific issues:

Benchmarking and Evaluation: Addressing the urgent need for robust benchmarks, datasets, and metrics to reliably evaluate Agentic AI systems for enterprise-level performance, safety, and reliability, including the challenges of creating realistic and representative enterprise task environments.
Application of Agentic AI in Enterprise Settings: Exploring how Agentic AI can perform complex tasks such as understanding on-site operations, planning, observation, reflection, and system management within enterprise contexts.
Human-Agent Interaction in Enterprise Workflows: Development and deployment of intelligent assistants that augment human capabilities in business operations.
Multimodal Reasoning for Enterprise Tasks: Integrating multimodal LLMs to handle diverse physical data (e.g., visual, textual, auditory) for robust decision-making and task execution in enterprises.
Task Planning and Orchestration in Enterprise Environment: Strategies for integrating multiple agents and tools to achieve complex, multi-step enterprise goals.
Others: We are also seeking a wide range of content related to Agentic AI technology.

Page updated

Google Sites

Report abuse