Keynote Speech II

James Tin-Yau KWOK 郭天佑

Professor, Hong Kong University of Science and Technology

Prof. Kwok is a Professor in the Department of Computer Science and Engineering, Hong Kong University of Science and Technology. Pr of. Kwok served/is serving as an Associate Editor for the IEEE Transactions on Neural Networks and Learning Systems, Neurocomputing, Artificial Intelligence Journal, International Journal of Data Science and Analytics, and on the Editorial Board of Machine Learning. He is also serving / served as Senior Area Chairs of major machine learning / AI conferences including NeurIPS, ICML, ICLR, IJCAI, and as Area Chairs of conferences including AAAI and ECML. Prof Kwok will be the IJCAI-2025 Program Chair.

Learning from Many Tasks in Meta-learning

In many machine learning applications, one only has a limited number of training samples. To alleviate this problem, a successful approach is meta-learning, which tries to extract meta-knowledge from similar historical tasks. Obviously, the larger the number of tasks to learn from, the more meta-knowledge can be learned. However, popular meta-learning algorithms like MAML only learn a globally-shared meta-model. This can be problematic when the task environment is complex, and a single meta-model is not sufficient to capture diversity of the meta-knowledge. Moreover, with a large number of tasks, accessing all task gradients in each training iteration may not be feasible. The sampling of tasks in each iteration also increases variance in the stochastic gradient, resulting in slow convergence. In this talk, we propose to address these problems by structuring the task model parameters into multiple subspaces, so that each subspace represents one type of meta-knowledge. Moreover, we propose a scalable solver with theoretical optimality guarantees based on the improvement function. Variance reduction is also incorporated into meta-learning to achieve fast convergence. Experiments on various meta-learning tasks demonstrate its effectiveness over state-of-the-art algorithms.

Page updated

Report abuse