Inverse Reinforcement Learning Meets Large Language Model Alignment