Conference
Zhe Xie, Zeyan Li, Xiao He, Longlong Xu, Xidao Wen, Tieying Zhang, Jianjun Chen, Rui Shi, Dan Pei. ChatTS: Aligning Time Series with LLMs via Synthetic Data for Enhanced Understanding and Reasoning. Proceedings of the VLDB Endowment 18, x (2025), PVLDB 2025. [ Arxiv ] [ PDF ] [ Bibtex ] [ Code ]
Xianghong Xu, Tieying Zhang, Xiao He, Haoyang Li, Rong Kang, Shuai Wang, Linhui Xu, Zhimin Liang, Shangyu Luo, Lei Zhang, Jianjun Chen. AdaNDV: Adaptive Number of Distinct Value Estimation via Learning to Select and Fuse Estimators. Proceedings of the VLDB Endowment 18, x (2025), PVLDB 2025. [ Arxiv ] [ PDF ] [ Bibtex ] [ Code ]
Xianghong Xu, Xiao He, Tieying Zhang, Lei Zhang, Rui Shi, Jianjun Chen. PLM4NDV: Minimizing Data Access for Number of Distinct Values Estimation with Pre-trained Language Models. Proceedings of the 2025 International Conference on Management of Data, SIGMOD 2025. [ Arxiv ] [ PDF ] [ Bibtex ] [ Code ]
Fengrui Liu, Xiao He, Tieying Zhang, Jianjun Chen, Yi Li, Lihua Yi, Haipeng Zhang, Gang Wu, Rui Shi. TickIt: Leveraging Large Language Models for Automated Ticket Escalation. FSE 2025. [ Arxiv ] [ PDF ] [ Bibtex ]
Changhua Pei, Zexin Wang, Fengrui Liu, Zeyan Li, Yang Liu, Xiao He, Rong Kang, Tieying Zhang, Jianjun Chen, Jianhui Li, Gaogang Xie, Dan Pei. Flow-of-Action: SOP Enhanced LLM-Based Multi-Agent System for Root Cause Analysis. Companion Proceedings of the ACM on Web Conference 2025. WWW 2025. [ Arxiv ] [ PDF ] [ Bibtex ]
Ye Li, Jian Tan, Bin Wu, Xiao He, Feifei Li. ShapleyIQ: Influence Quantification by Shapley Values for Performance Debugging of Microservices. Proceedings of the 28th {ACM} International Conference on Architectural Support for Programming Languages and Operating Systems. ASPLOS 2023. [PDF] [Bibtex] [Code]
Chunhui Shen, Qianyu Ouyang, Feibo Li, Zhipeng Liu, Longcheng Zhu, Yujie Zou, Qing Su, Tianhuan Yu, Yi Yi, Jianhong Hu, Cen Zheng, Bo Wen, Hanbang Zheng, Lunfan Xu, Sicheng Pan, Bin Wu, Xiao He, Ye Li, Jian Tan, Sheng Wang, Dan Pei, Wei Zhang, Feifei Li. Lindorm TSDB: A cloud-native time-series database for large-scale monitoring systems. Proceedings of the VLDB Endowment 16, 12 (2023), PVLDB 2023. [ PDF ] [ Bibtex ]
Xiao He, Ye Li, Jian Tan, Bin Wu, Feifei Li. OneShotSTL: One-Shot Seasonal-Trend Decomposition For Online Time Series Anomaly Detection And Forecasting. Proceedings of the VLDB Endowment 16, 06 (2023), PVLDB 2023. [ Arxiv ] [ PDF ] [ Bibtex ] [ Code ]
Xiao He, Jian Tan, Bin Wu, Feifei Li, Xinping Zhang, Gaozhong Liang, Jinfeng Xu. Active Sampling for Sparse Table by Bayesian Optimization with Adaptive Resolution. IEEE International Conference on Data Engineering, ICDE 2023. [ PDF ] [ Bibtex ]
Jingyi Yang, Peizhi Wu, Gao Cong, Tieying Zhang, Xiao He. SAM: Database Generation from Query Workloads with Supervised Autoregressive Models. Proceedings of the 2022 International Conference on Management of Data, SIGMOD 2022. [ PDF ] [ Bibtex ] [ Code ]
Ammar Shaker, Christoph Gärtner, Xiao He, Shujian Yu. Online meta-forest for regression data streams. International Joint Conference on Neural Networks, IJCNN 2020. [ PDF ] [ Bibtex ]
Luca Franceschi, Mathias Niepert, Massimiliano Pontil, Xiao He. Learning Discrete Structures for Graph Neural Networks. International Conference on Machine Learning, ICML 2019. [ Arxiv ] [ PDF ] [ Bibtex ] [ Code ]
Xiao He, Francesco Alesiani, Ammar Shaker. Efficient and Scalable Multi-task Regression on Massive Number of Tasks. The Thirty-Third AAAI Conference on Artificial Intelligenc. AAAI 2019. [ Arxiv ] [ PDF ] [ Bibtex ]
Xiao He, Luis Moreira-Matias, Ammar Shaker. Robust Continuous Co-Clustering. 2019. [ Arxiv ] [Bibtex]
Xiao He, Thomas Gumbsch, Damian Roqueiro, Karsten Borgwardt. Kernel Conditional Clustering. IEEE International Conference on Data Mining, ICDM 2017. (One of the best papers invited for publication in Knowledge and Information Systems) [ PDF ] [ Bibtex ] [ Code ]
Xiao He, Limin Li, Damian Roqueiro, Karsten Borgwardt. Multi-view Spectral Clustering on Conflicting Views. Joint European Conference on Machine Learning and Knowledge Discovery in Databases, ECML PKDD 2017. [ PDF ] [ Bibtex ] [ Code ]
Sebastian Goebl, Xiao He, Claudia Plant, Christian Böhm. Finding the Optimal Subspace for Clustering. IEEE International Conference on Data Mining, ICDM 2014. [ PDF ] [ Bibtex ] [ Code ]
Xiao He, Jing Feng, Bettina Konte, Son T Mai, Claudia Plant. Relevant Overlapping Subspace Clusters on Categorical Data. Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD 2014. [ PDF ] [ Bibtex ]
Son T Mai, Xiao He, Nina Hubig, Claudia Plant, Christian Böhm. Active Density-based Clustering. IEEE International Conference on Data Mining, ICDM 2013. [ PDF ] [ Bibtex ]
Jing Feng, Xiao He, Nina Hubig, Christian Böhm, Claudia Plant. Compression-based Graph Mining Exploiting Structure Primitives. IEEE International Conference on Data Mining, ICDM 2013. [ PDF ] [ Bibtex ]
Son T Mai, Xiao He, Jing Feng, Christian Böhm. Efficient Anytime Density-based Clustering. Proceedings of the 2013 SIAM International Conference on Data Mining, SDM 2013. [ PDF ] [ Bibtex ]
Junming Shao, Xiao He, Qinli Yang, Claudia Plant, Christian Böhm. Robust Synchronization-based Graph Clustering. Pacific-Asia conference on knowledge discovery and data mining, PAKDD 2013. [ PDF ] [ Bibtex ] [ Code ]
Jing Feng, Xiao He, Bettina Konte, Christian Böhm, Claudia Plant. Summarization-based Mining Bipartite Graphs. Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining, KDD 2012. [ PDF ] [ Bibtex ]
Christian Böhm, Jing Feng, Xiao He, Son T Mai, Claudia Plant, Junming Shao. A Novel Similarity Measure for Fiber Clustering Using Longest Common Subsequence. Proceedings of 17th ACM SIGKDD international conference on Knowledge discovery and data mining, Workshop on Data Mining for medicine and healthcare, DMMH 2011. [ PDF ] [ Bibtex ]
Xiao He, Jing Feng, Claudia Plant. Automatically Spotting Information-rich Nodes in Graphs. IEEE International Conference on Data Mining, Workshop on Data Mining in Networks, DAMNet 2011. [ PDF ] [ Bibtex ]
Journal
Pengcheng Zhang, Bin Yao, Chao Gao, Bin Wu, Xiao He, Feifei Li, Yuanfei Lu, Chaoqun Zhan, Feilong Tang. Learning-based query optimization for multi-probe approximate nearest neighbor search. The VLDB Journal, VLDBJ, 2022. [ PDF ] [ Bibtex ]
Xiao He, Thomas Gumbsch, Damian Roqueiro, Karsten Borgwardt. Kernel Conditional Clustering and Kernel Conditional Semi-supervised Learning. Knowledge and Information Systems, KAIS, 2019. [ PDF ] [ Bibtex ] [ Code ]
Xiao He, Lukas Folkman, Karsten Borgwardt. Kernelized Rank Learning for Personalized Drug Recommendation. Bioinformatics, 2018. (Best research poster Award at ISMB 2017 in Prague) [ PDF ] [ Bibtex ] [ Code ]
Limin Li, Xiao He, Karsten Borgwardt. Multi-target Drug Repositioning by Bipartite Block-wise Sparse Multi-task Learning. BMC Systems Biology, 2018. [ PDF ] [ Bibtex ] [ Code ]
Son T Mai, Xiao He, Jing Feng, Claudia Plant, Christian Böhm. Anytime Density-based Clustering of Complex Data. Knowledge and Information Systems, KAIS, 2015. [ PDF ] [ Bibtex ]
Junming Shao, Xiao He, Christian Böhm, Qinli Yang, Claudia Plant. Synchronization-inspired partitioning and hierarchical clustering. IEEE Transactions on Knowledge and Data Engineering, TKDE, 2013. [ PDF ] [ Bibtex ] [ Code ]
Patent
Ammar Shaker, Francesco Alesiani, Xiao He. Continual learning of artificial intelligence systems based on bi-level optimization. US Patent 11,544,558
Ammar Shaker, Christoph Gaertner, Xiao He. Method and system for adaptive online meta learning from data streams. US Patent 11,521,132
Xiao He, Luis Moreira-Matias. Method for automated scalable co-clustering. US Patent 10,817,543
Luca Franceschi, Xiao He, Mathias Niepert. Discrete learning structure. US Patent App. 16/558,175
Xiao He, Francesco Alesiani, Ammar Shaker. Method and system for scalable multi-task learning with convex clustering. US Patent 11,657,322