My research centers on understanding the effects of post-training methods, especially SFT and reinforcement learning, on large language models. I investigate issues such as forgetting and generalization, and explore how training dynamics influence model capabilities.
Outside of research, I enjoy movies, hiking, badminton, squash, tennis, and reading novels and mathematics books.