Reinforcement Learning

Channel: #reinforcement-learning

Co-leads: 


Goal:


Logistics:

Occurrences: Bi-weekly, Saturdays 8:30 PM (GMT+7)

Recordings of Recent Sessions

C4AI - Reinforcement Learning Group (2024-05-18 07:35 GMT-7)

May 18, 2024

C4AI - Reinforcement Learning Group (2024-05-11 07:41 GMT-7)

May 11, 2024

C4AI - Reinforcement Learning Group (2024-04-13 07:36 GMT-7)

April 13, 2024

C4AI - Reinforcement Learning Group (2024-04-06 07:36 GMT-7)

April 6, 2024

C4AI - Reinforcement Learning Group (2024-03-30 05:53 GMT-7)

Thang and Raseem lead a discussion on "Policy Gradient Methods for Reinforcement Learning with Function Approximation" (https://proceedings.neurips.cc/paper/1999/file/464d828b85b0bed98e80ade0a5c43b0f-Paper.pdf

C4AI - Reinforcement Learning Group (2024-02-24 05:48 GMT-8)

RL Foundations Study Group - "Lecture 10: RL in games"

C4AI - Reinforcement Learning Group (2024-02-17 05:38 GMT-8)

RL Foundations Study Group - "Lecture 9: Exploration and Exploitation" (part 2)

Akifumi Wachi presents "Safe RL" (Reinforcement Learning) (2024-02-13 00:06 GMT-8)

Akifumi Wachi presents "Safe RL" 

C4AI - Reinforcement Learning Group (2024-02-10 05:40 GMT-8)

RL Foundations Study Group - "Lecture 9: Exploration and Exploitation" (part 1)

C4AI - Reinforcement Learning Group (2023-11-25 05:40 GMT-8)

RL Foundations Study Group - "Lecture 7: Policy Gradient"

stc-iaig-prb (2023-10-08 12:36 GMT-7)

RL Foundations Study Group - "Lecture 6: Value Function Approximation"

Costa Huang - Cleanba: A Reproducible Distributed Deep Reinforcement Learning Platform (RL Group) (2023-10-02 11:31 GMT-7)

Costa Huang, Machine Learning Engineer at Hugging Face presents "Cleanba: A Reproducible Distributed Deep Reinforcement Learning Platform"

stc-iaig-prb (2023-09-24 20:34 GMT+1)

RL Foundations Study Group - "Lecture 3: Model Free Control"

Max Schwarzer - Sample-Efficient RL Through Scaling (RL Group) (2023-09-15 13:01 GMT-7)

Max Schwarzer discusses "Sample-Efficient RL Through Scaling". Max will present his work on two papers "Sample-Efficient Reinforcement Learning by Breaking the Replay Ratio Barrier" and its successor "Bigger, Better, Faster: Human-Level Atari with Human-Level Efficiency."

C4AI - Reinforcement Learning Group (2023-08-10 07:35 GMT-4)

RL Foundations Study Group - "Lecture 2: Dynamic Programming"

C4AI - Reinforcement Learning Group (2023-07-27 07:37 GMT-4)

RL Foundations Study Group - "Lecture 1: Introduction to RL and MDP."

RL Learning Resources

Advanced Survey RL