For LLM & AI Agent in Time Series and Education domains, check AI for Education, AI for Time Series.
The full list of my publications (with arXiv papers) can be found at Google Scholar.
Algorithm & Benchmark & Data & Code:
[CVPR'26] AutoDebias: An Automated Framework for Detecting and Mitigating Backdoor Biases in Text-to-Image Models, CVPR 2026. [arXiv]
[ICLR'26] DiffuGuard: How Intrinsic Safety is Lost and Found in Diffusion Large Language Models, ICLR 2026. [arXiv]
[ICLR'26] AudioTrust: Benchmarking The Multifaceted Trustworthiness of Audio Large Language Models, ICLR 2026. [arXiv]
[NeurIPS'25] SAEMark: Steering Personalized Multilingual LLM Watermarks with Sparse Autoencoders, NeurIPS 2025. [arXiv]
[ACL'25] NetSafe: Exploring the Topological Safety of Multi-agent Networks, ACL 2025. [arXiv]
[MM'25] The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework, ACM MM 2025. [arXiv]
[MM'25] Debiasing Multimodal Large Language Models via Penalization of Language Priors, ACM MM 2025. [arXiv] [code]
AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management, arXiv 2025. [arXiv]
ChronosAudio: A Comprehensive Long-Audio Benchmark for Evaluating Audio-Large Language Models, arXiv 2026. [arXiv]
MCPShield: A Security Cognition Layer for Adaptive Trust Calibration in Model Context Protocol Agents, arXiv 2026. [arXiv]
Survey & Position & Perspective:
[KDD'25] A Survey on Trustworthy LLM Agents: Threats and Countermeasures, KDD 2025. [arXiv]
[SMC Magazine'25] R4 Trustworthy Human–Artificial Intelligence Symbiosis, IEEE SMC Magazine 2026. [paper]
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment, arXiv 2025. [arXiv]
LLM-based Agents Suffer from Hallucinations: A Survey of Taxonomy, Methods, and Directions, arXiv 2025. [arXiv]
Aligning Multimodal LLM with Human Preference: A Survey, arXiv 2025. [arXiv]
Evaluating LLMs in Finance Requires Explicit Bias Consideration, arXiv 2026. [arXiv]
Algorithm & Benchmark & Data & Code:
[ICML'26] See First, Reason Later: Mutual Information-Guided Reinforcement Learning for Vision-Language Models, ICML 2026.
[ACL'26] ARVEC: Adaptive Reasoning Model with Vision Understanding and Executable Code, ACL 2026. [arXiv]
[ACL'26] Scaling Law for Multimodal Large Language Model Supervised Fine-Tuning, ACL 2026.
[TPAMI'26] Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models, IEEE TPAMI, 2026. [arXiv]
[ICLR'26] RankLLM: Weighted Ranking of LLMs by Quantifying Question Difficulty, ICLR 2026. [arXiv]
[ICLR'25] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? ICLR 2025. [arXiv]
[NeurIPS'25] Improving Nonlinear RNN with Closed-loop Control, NeurIPS 2025. [arXiv] (NeurIPS Spotlight, Top 3.5%)
[NeurIPS'25] Can LLMs Correct Themselves? A Benchmark of Self-Correction in LLMs, NeurIPS 2025.
[NeurIPS'24] AutoSurvey: Large Language Models Can Automatically Write Surveys, NeurIPS 2024 [arXiv]
[AAAI'25] UrbanVLP: Multi-Granularity Vision-Language Pretraining for Urban Socioeconomic Indicator Prediction, AAAI 2025 [paper]
[EMNLP'25] DynamicNER: A Dynamic, Multilingual, and Fine-Grained Dataset for LLM-based Named Entity Recognition, EMNLP 2025. [arXiv]
[EMNLP'24] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation, EMNLP 2024 [arXiv]
[WWW'24] UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web, WWW ’24 [arXiv]
Survey & Position & Perspective:
[ACL'26] Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning, ACL 2026. [arXiv]
[SPM'25] Brain Foundation Models: A Survey on Advancements in Neural Signal Processing and Brain Discovery, IEEE Signal Processing Magazine, 2025. [arXiv]
[ACL'25] A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges, ACL 2025. [arXiv]
[FnTs'25] Recommender Systems Meet Large Language Model Agents: A Survey, Foundations and Trends® in Privacy and Security, 2025. [link] [Preprint]
A Survey on Post-training of Large Language Models, arXiv 2025. [arXiv]
Complex networks of AI agentic systems: topology, memory, and update dynamics, arXiv 2026 [arXiv]
Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models, arXiv 2026 [arXiv]
KDD Workshop on AI Agent for Information Retrieval (KDD'25-Agent4IR, KDD'26-Agent4IR)
WWW Workshop on AI Agent for Information Retrieval: Generating and Ranking (WWW'25-Agent4IR)
AAAI Workshop on AI Agent for Information Retrieval: Generating and Ranking (AAAI'25-Agent4IR)
CIKM Workshop on AI Agent for Information Retrieval (CIKM'24-Agent4IR)