AISQ 2025: The 2nd International Workshop on Advanced Intelligent Software Applications
AISQ 2025: The 2nd International Workshop on Advanced Intelligent Software Applications
Co-located with ISSRE 2025, October 21st - 24th, 2025
High-Availability Machine Learning Systems: N-version Architecture and Rejuvenation
As the use of machine learning (ML) becomes more prevalent in many practical software systems, maintaining the long-term availability of ML systems has emerged as a new critical challenge. In ML system operations, ML models continuously confront the risk of performance degradation due to dataset shifts or security attacks. As the system performance and quality may deteriorate over time, implementing proper mitigation methods to ensure the high availability of ML systems is essential. This talk introduces recent approaches to enhance the reliability and availability of ML systems through N-version configuration, software rejuvenation, and ML model updates. While these methods are based on well-established techniques, several unique perspectives, specifically tailored for ML-based systems, are discussed. The talk will also highlight future challenges for high-availability ML systems, which may consist of multiple dependent ML and AI models.
Fumio Machida is an associate professor at the Laboratory for System Dependability in the Computer Science Department of the University of Tsukuba. He was a principal researcher at NEC Corporation until 2019 and was a visiting scholar in the Department of Electrical and Computer Engineering at Duke University in 2010. He was a recipient of the Young Scientists’ Prize of Japan in 2014. He was named a distinguished contributor of the IEEE Computer Society, class of 2023. He is the general chair of the 35th IEEE International Symposium on Software Reliability Engineering. His research interests include modeling and analysis of system dependability, software aging and rejuvenation, and reliability of machine learning systems. He is a senior member of the IEEE and a member of ACM.