AAAI 2026 Workshop
W8: Agentic AI Benchmarks and Applications for Enterprise Tasks
January 26, 2026 | Singapore EXPO
January 26, 2026 | Singapore EXPO
The primary goal of this workshop is to foster discussions and collaborations to build robust, efficient, and trustworthy Agentic AI technologies for complex and dynamic enterprise business operations. It aims to bridge the gap between cutting-edge Agentic AI research and the practical demands of enterprise deployment and rigorous evaluation.
The workshop will address the following specific issues:
Benchmarking and Evaluation: Addressing the urgent need for robust benchmarks, datasets, and metrics to reliably evaluate Agentic AI systems for enterprise-level performance, safety, and reliability, including the challenges of creating realistic and representative enterprise task environments.
Application of Agentic AI in Enterprise Settings: Exploring how Agentic AI can perform complex tasks such as understanding on-site operations, planning, observation, reflection, and system management within enterprise contexts.
Human-Agent Interaction in Enterprise Workflows: Development and deployment of intelligent assistants that augment human capabilities in business operations.
Multimodal Reasoning for Enterprise Tasks: Integrating multimodal LLMs to handle diverse physical data (e.g., visual, textual, auditory) for robust decision-making and task execution in enterprises.
Task Planning and Orchestration in Enterprise Environment: Strategies for integrating multiple agents and tools to achieve complex, multi-step enterprise goals.
Others: We are also seeking a wide range of content related to Agentic AI technology.