Publications

2026

[Preprint] Zhiyu An, Wan Du (2026). Representational Homomorphism Predicts and Improves Compositional Generalization in Tranformer Language Model.

[Paper] [Code]

[Preprint] Zhiyu An, Duaa Nakshbandi, Wan Du (2026). Differential Voting: Loss Functions for Axiomatically Diverse Aggregation of Heterogeneous Preferences.

[Paper] [Code]

[Preprint] Zhiyu An, Wan Du (2026). DIML: Differentiable Inverse Mechanism Learning from Behaviors of Multi-Agent Learning Trajectories.

[Paper] [Code]

[ICLR] Zhibo Hou, Zhiyu An, Wan Du (2026). Beyond Noisy-TVs: Noise-Robust Exploration Via Learning Progress Monitoring.

ICLR 2026. [Paper] [Code]

[AAAI] Zhiyu An, Wan Du (2026). MoralReason: Generalizable Moral Decision Alignment for LLM Agents Using Reasoning-Level Reinforcement Learning.

AAAI 2026. [Paper] [Project Website] [Dataset] [Code]

2025

[L4DC] Zhiyu An, Zhibo Hou, Wan Du (2025). Disentangling Uncertainties by Learning Compressed Data Representation.

7th Annual Learning for Dynamics & Control Conference [Paper] [Code]

[HICSS] Xianzhong Ding, Wanshi Hong, Zhiyu An, Bin Wang, Wan Du (2025). Deepot: Parking Lot Identification Using Low-Resolution Satellite Imagery

The 58th Hawaii International Conference on System Sciences 🏆 Best Paper Award [Paper]

[IEEE IoT-J] Xianzhong Ding*, Zhiyu An*, Arya Rathee, Wan Du (2025). A Safe and Data-efficient Model-based Reinforcement Learning System for HVAC Control

IEEE Internet of Things Journal (* denotes equal contribution) [Paper]

[SIGIR Workshop] Zhiyu An, Xianzhong Ding, Yen-Chun Fu, Cheng-Chung Chu, Yan Li, Wan Du (2025). Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation for Industrial Knowledge Base.

SIGIR workshop on Robust Information Retrieval [Paper] Work done while interning at Western Digital.

2024

[ICLR] Zhiyu An, Xianzhong Ding, Wan Du (2024). Reward Bound for Behavioral Guarantee of Model-Based Planning Agents.

Tiny Paper Track, International Conference on Learning Representations (Invited to Present) [Paper] [Open Review]

[DAC] Zhiyu An, Xianzhong Ding, Wan Du (2024). Go Beyond Black-box Policies: Rethinking the Design of Learning Agent for Interpretable and Verifiable HVAC Control.

The 61st ACM/IEEE Design Automation Conference [Paper] [Code] [Slides] [Presentation Recording] [Poster]

[KDD] Yuning Chen, Kang Yang, Zhiyu An, Brady Holder, Luke Paloutzian, Khaled M. Bali, Wan Du (2024). MARLP: Time-series Forecasting Control for Agricultural Managed Aquifer Recharge

The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining [Paper]

2023

[BuildSys] Zhiyu An, Xianzhong Ding, Arya Rathee, Wan Du (2023). CLUE: Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.

ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation 🏆 Best Paper Award Runner-Up [Paper] [Code] [Slides]

[SenSys Poster] Zhiyu An, Xianzhong Ding, Wan Du (2023). Data Efficient HVAC Control using Gaussian Process-based Reinforcement Learning

The ACM Conference on Embedded Networked Sensor Systems [Abstract]

Page updated

Google Sites

Report abuse