Postdoctoral researcher at Meta AI Research.
My research focuses on problems that are important to a general, reinforcement-learning-based intelligent system, but are not well understood yet.
I obtained my Ph.D. in Computing Science at the University of Alberta, supervised by Professor Richard S. Sutton. I earned my Bachelor's degree in Electrical and Computer Engineering (ECE) from Shanghai Jiao Tong University (SJTU), where I worked in Professor Kai Yu's speech group. After that, I earned my Master's degree, also in ECE, from the University of Michigan. I worked on the application of reinforcement learning in robotics when I was in Michigan, supervised by Professor Ben Kuipers.
Email: yiwan@meta.com
Publications
Loosely Consistent Emphatic Temporal-Difference Learning.
The Emphatic Approach to Average-Reward Policy Evaluation.
Planning with Expectation Models for Control.
Off-policy Maximum Entropy Reinforcement Learning: Soft Actor-Critic with Advantage Weighted Mixture Policy (SAC-AWMP).
Code
Services
Journal Reviewer: TMLR (2022, 2023)
Conference Reviewer: NeurIPS (2021, 2022, 2023), ICML (2022), ICLR (2020, 2021, 2022), CoLLAs (2022, 2023), AAAI (2023)
Workshop Reviewer: Decision Aware RL workshop in ICML (2022), RL4RealLife workshop in ICML (2021), optimization for machine learning workshop in NeurIPS (2022).
Organizer: Continuing (Non-Episodic) RL problems social at ICML (2021), Designing an RL system toward AGI social at ICML (2022).
Volunteer: ICML (2022) session moderator
Teaching
Reinforcement Learning II (COMPT 609) 2020, 2021, 2022 Teaching Assistant. Guest lecture: A Second Tutorial on Tabular TD(λ), Slides
Reinforcement Learning I (COMPT 366) 2018 Teaching Assistant
Skiing
Winter is long, skiing is fun.
Marmot Basin, Jasper, Canada
Photo from a video filmed by Shangtong Zhang
Blackcomb Glacier Ice Cave, Whistler, Canada