[Preprint] Zhiyu An, Wan Du (2026). Representational Homomorphism Predicts and Improves Compositional Generalization in Tranformer Language Model.
[Preprint] Zhiyu An, Duaa Nakshbandi, Wan Du (2026). Differential Voting: Loss Functions for Axiomatically Diverse Aggregation of Heterogeneous Preferences.
[Preprint] Zhiyu An, Wan Du (2026). DIML: Differentiable Inverse Mechanism Learning from Behaviors of Multi-Agent Learning Trajectories.
[ICLR] Zhibo Hou, Zhiyu An, Wan Du (2026). Beyond Noisy-TVs: Noise-Robust Exploration Via Learning Progress Monitoring.
[AAAI] Zhiyu An, Wan Du (2026). MoralReason: Generalizable Moral Decision Alignment for LLM Agents Using Reasoning-Level Reinforcement Learning.
AAAI 2026. [Paper] [Project Website] [Dataset] [Code]
[L4DC] Zhiyu An, Zhibo Hou, Wan Du (2025). Disentangling Uncertainties by Learning Compressed Data Representation.
7th Annual Learning for Dynamics & Control Conference [Paper] [Code]
[HICSS] Xianzhong Ding, Wanshi Hong, Zhiyu An, Bin Wang, Wan Du (2025). Deepot: Parking Lot Identification Using Low-Resolution Satellite Imagery
The 58th Hawaii International Conference on System Sciences 🏆 Best Paper Award [Paper]
[IEEE IoT-J] Xianzhong Ding*, Zhiyu An*, Arya Rathee, Wan Du (2025). A Safe and Data-efficient Model-based Reinforcement Learning System for HVAC Control
IEEE Internet of Things Journal (* denotes equal contribution) [Paper]
[SIGIR Workshop] Zhiyu An, Xianzhong Ding, Yen-Chun Fu, Cheng-Chung Chu, Yan Li, Wan Du (2025). Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation for Industrial Knowledge Base.
SIGIR workshop on Robust Information Retrieval [Paper] Work done while interning at Western Digital.
[ICLR] Zhiyu An, Xianzhong Ding, Wan Du (2024). Reward Bound for Behavioral Guarantee of Model-Based Planning Agents.
Tiny Paper Track, International Conference on Learning Representations (Invited to Present) [Paper] [Open Review]
[DAC] Zhiyu An, Xianzhong Ding, Wan Du (2024). Go Beyond Black-box Policies: Rethinking the Design of Learning Agent for Interpretable and Verifiable HVAC Control.
The 61st ACM/IEEE Design Automation Conference [Paper] [Code] [Slides] [Presentation Recording] [Poster]
[KDD] Yuning Chen, Kang Yang, Zhiyu An, Brady Holder, Luke Paloutzian, Khaled M. Bali, Wan Du (2024). MARLP: Time-series Forecasting Control for Agricultural Managed Aquifer Recharge
The 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining [Paper]
[BuildSys] Zhiyu An, Xianzhong Ding, Arya Rathee, Wan Du (2023). CLUE: Safe Model-Based RL HVAC Control Using Epistemic Uncertainty Estimation.
ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation 🏆 Best Paper Award Runner-Up [Paper] [Code] [Slides]
[SenSys Poster] Zhiyu An, Xianzhong Ding, Wan Du (2023). Data Efficient HVAC Control using Gaussian Process-based Reinforcement Learning
The ACM Conference on Embedded Networked Sensor Systems [Abstract]