For LLM & AI Agent in Time Series and Education domains, check AI for Time Series, AI for Education
The full list of my publications (with arXiv papers) can be found at Google Scholar.
Algorithm & Benchmark & Data & Code:
[TPAMI'26] Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models, IEEE TPAMI, 2026. [arXiv]
[CVPR'26] AutoDebias: An Automated Framework for Detecting and Mitigating Backdoor Biases in Text-to-Image Models, CVPR 2026.
[ICLR'26] RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty, ICLR 2026. [arXiv]
[ICLR'26] DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models, ICLR 2026. [arXiv]
[ICLR'26] AudioTrust: Benchmarking The Multifaceted Trustworthiness of Audio Large Language Models, ICLR 2026. [arXiv]
[KDD'26] Evaluating RAG Robustness to Symbolic Perturbations, KDD 2026.
[AAAI'26] Assemble Your Crew: Automatic Multi-agent Communication Topology Design via Autoregressive Graph Generation, AAAI 2026. [arXiv] (AAAI Oral, Top 5%)
[AAAI'26] SafeSieve: From Heuristics to Experience in Progressive Pruning for LLM-based Multi-Agent Communication, AAAI 2026. [arXiv]
[NeurIPS'25] Improving Nonlinear RNN with Closed-loop Control, NeurIPS 2025. [arXiv] (NeurIPS Spotlight, Top 3.5%)
[NeurIPS'25] Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs, NeurIPS 2025.
[NeurIPS'25] SAEMark: Steering Personalized Multilingual LLM Watermarks with Sparse Autoencoders, NeurIPS 2025.
[ICLR'25] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? ICLR 2025. [arXiv] [code]
[NeurIPS'24] AutoSurvey: Large Language Models Can Automatically Write Surveys, NeurIPS 2024 [arXiv] [code]
[ACL'25] NetSafe: Exploring the Topological Safety of Multi-agent Networks, ACL 2025. [arXiv]
[EMNLP'25] DynamicNER: A Dynamic, Multilingual, and Fine-Grained Dataset for LLM-based Named Entity Recognition, EMNLP 2025. [arXiv]
[AAAI'25] UrbanVLP: Multi-Granularity Vision-Language Pretraining for Urban Socioeconomic Indicator Prediction, AAAI 2025 [paper]
[MM'25] The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework, ACM MM 2025. [arXiv]
[MM'25] Debiasing Multimodal Large Language Models via Penalization of Language Priors, ACM MM 2025. [arXiv] [code]
[EMNLP'24] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation, EMNLP 2024 (Demo Track) [arXiv]
[WWW'24] UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web, WWW ’24 [arXiv
Survey & Position & Perspective:
[SPM'25] Brain Foundation Models: A Survey on Advancements in Neural Signal Processing and Brain Discovery, IEEE Signal Processing Magazine, 2025. [arXiv]
[ACL'25] A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges, ACL 2025. [arXiv]
[FnTs'25] Recommender Systems Meet Large Language Model Agents: A Survey, Foundations and Trends® in Privacy and Security, 2025. [link] [Preprint]
[KDD'25] A Survey on Trustworthy LLM Agents: Threats and Countermeasures, KDD 2025. [arXiv]
KDD Workshop on AI Agent for Information Retrieval (KDD'25-Agent4IR)
WWW Workshop on AI Agent for Information Retrieval: Generating and Ranking (WWW'25-Agent4IR)
AAAI Workshop on AI Agent for Information Retrieval: Generating and Ranking (AAAI'25-Agent4IR)
CIKM Workshop on AI Agent for Information Retrieval (CIKM'24-Agent4IR)