An Empirical Study on the Resilience of Partial Merging to Model Clone Attacks
ICML 2026
Tiantong Wu, Yurong Hao, Wei Yang Bryan Lim
ICML 2026
Tiantong Wu, Yurong Hao, Wei Yang Bryan Lim
TL;DR: We study the privacy risks of partial model merging and show that private model components can still be substantially reconstructed under realistic clone-attack settings.
ICML 2026 [arXiv]
Guowei Guan, Yurong Hao, Jiaming Zhang, Tiantong Wu, Fuyao Zhang, Tianxiang Chen, Longtao Huang, Cyril Leung, Wei Yang Bryan Lim
TL;DR: We expose a new poisoning vulnerability in multimodal LLM recommender systems, showing that coordinated text–image perturbations can reliably promote target items while preserving normal recommendation utility.
ICML 2026
Gong Zhiren, Yikun Hou, Fan Wu, Che Wang, Fuyao Zhang, Tiantong Wu, Yurong Hao, Jiaming Zhang, Yiyang Duan, Tiantong Wang, Fei Huang, Chau Yuen, Wei Yang Bryan Lim
TL;DR: We propose an inference-time pruning framework that maps domain-aligned representation subspaces to sparse model pathways, enabling scenario-specific LLM subnetworks with improved efficiency.
ICML 2026 [GitHub arXiv Hugging Face]
Gong Zhiren, Tiantong Wu, Jiaming Zhang, Fuyao Zhang, CHE WANG, Yurong Hao, Yikun Hou, Foo Ping, Yilei Zhao, Fei Huang, Chau Yuen, Wei Yang Bryan Lim
TL;DR: We introduce XDomainBench, a benchmark for interactive interdisciplinary scientific reasoning, and show that LLMs exhibit systematic reasoning collapse as cross-domain composition becomes more complex.
CVPR 2026 (Oral)
Tianxiang Chen, Wenjie Hou, Feng Wang, Tiantong Wu, Zhiming Zheng, Shaoting Tang, Wei Yang Bryan Lim
TL;DR: We introduce FedAdamom, which adapts server momentum instead of the learning rate, to keep fast saddle-point escape while better selecting flat minima, improving both convergence and test accuracy.
TL;DR: A lightweight framework that integrates federated learning and targeted unlearning for LLMs, formulating them as a joint multi-objective optimization task to enable privacy-preserving training and compliance with GDPR's right to be forgotten.
TL;DR: A metric using limited unlabeled data to measure how well models can be merged by comparing merged weights vs ensemble outputs at layer/node levels. It guides merging and pruning and improves alignment with ensembling.