(for LLM & AI Agent in Time Series and Education domains, check AI for Time Series, AI for Education)
Algorithm & Benchmark & Data & Code:
[ACL'25] NetSafe: Exploring the Topological Safety of Multi-agent Networks, ACL 2025. [arXiv]
[EMNLP'25] DynamicNER: A Dynamic, Multilingual, and Fine-Grained Dataset for LLM-based Named Entity Recognition, EMNLP 2025. [arXiv]
[AAAI'25] UrbanVLP: Multi-Granularity Vision-Language Pretraining for Urban Socioeconomic Indicator Prediction, AAAI 2025 [paper]
[MM'25] The Eye of Sherlock Holmes: Uncovering User Private Attribute Profiling via Vision-Language Model Agentic Framework, ACM MM 2025. [arXiv]
[ICLR'25] MME-RealWorld: Could Your Multimodal LLM Challenge High-Resolution Real-World Scenarios that are Difficult for Humans? ICLR 2025. [arXiv] [code]
[NeurIPS'24] AutoSurvey: Large Language Models Can Automatically Write Surveys, NeurIPS 2024 [arXiv] [code]
[MM'24] Debiasing Multimodal Large Language Models via Penalization of Language Priors, ACM MM 2025. [arXiv] [code]
[WWW'24] UrbanCLIP: Learning Text-enhanced Urban Region Profiling with Contrastive Language-Image Pretraining from the Web, WWW ’24 [arXiv]
[EMNLP'24] RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation, EMNLP 2024 (Demo Track) [arXiv] [code]
[arXiv'24] Beyond LLaVA-HD: Diving into High-Resolution Large Multimodal Models, arXiv 2024. [arXiv] [code]
[arXiv'25] LLM-Virus: Evolutionary Jailbreak Attack on Large Language Models, arXiv 2025. [arXiv]
[arXiv'25] AgentSafe: Safeguarding Large Language Model-based Multi-agent Systems via Hierarchical Data Management, arXiv 2025. [arXiv]
[arXiv'25] Automating Personalization: Prompt Optimization for Recommendation Reranking, arXiv 2025. [arXiv]
[arXiv'25] Comba: Improving Nonlinear RNNs with Closed-loop Control, arXiv 2025. [arXiv]
[arXiv'25] Assemble Your Crew: Automatic Multi-agent Communication Topology Design via Autoregressive Graph Generation, arXiv 2025. [arXiv]
Survey & Position & Perspective:
[SPM'25] Brain Foundation Models: A Survey on Advancements in Neural Signal Processing and Brain Discovery, IEEE Signal Processing Magazine, 2025. [arXiv]
[ACL'25] A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges, ACL 2025. [arXiv]
[FnTs'25] Recommender Systems Meet Large Language Model Agents: A Survey, Foundations and Trends® in Privacy and Security, 2025. [link] [Preprint]
[KDD'25] A Survey on Trustworthy LLM Agents: Threats and Countermeasures, KDD 2025. [arXiv]
[arXiv'25] A comprehensive survey in llm (-agent) full stack safety: Data, training and deployment, arXiv 2025. [arXiv]
[arXiv'25] Position: Multimodal Large Language Models Can Significantly Advance Scientific Reasoning, arXiv 2025. [arXiv]
[arXiv'25] Topological Structure Learning Should Be A Research Priority for LLM-Based Multi-Agent Systems, arXiv 2025. [arXiv]
[arXiv'25] Aligning Multimodal LLM with Human Preference: A Survey, arXiv 2025. [arXiv]
[arXiv'25] A Survey on Post-training of Large Language Models, arXiv 2025. [arXiv]
KDD Workshop on AI Agent for Information Retrieval (KDD'25-Agent4IR)
WWW Workshop on AI Agent for Information Retrieval: Generating and Ranking (WWW'25-Agent4IR)
AAAI Workshop on AI Agent for Information Retrieval: Generating and Ranking (AAAI'25-Agent4IR)
CIKM Workshop on AI Agent for Information Retrieval (CIKM'24-Agent4IR)