Summer 2025
| Date | Title | Presenter | Resources |
|---|---|---|---|
| May 23rd 2025 |
Agenda for Summer
planning |
||
| May 30th 2025 |
Intro to Pong and Gymnasium
presentation |
Lain |
📊 Slides 🔗 Code 🔗 Karpathy's write-up on Pong |
| June 6th 2025 |
Human-level control through deep reinforcement learning
presentation |
Nolan |
📊 Slides 📄 DQN Paper |
| June 13th 2025 |
Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations
presentation |
Lain |
📊 Slides 📄 Dexterous Manipulation Paper |
| June 20th 2025 |
Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review
presentation |
Will |
📊 Slides 📄 RL as Probabilistic Inference Paper |
| June 27th 2025 |
Implementing Reinforcement Learning from Human Feedback (RLHF): Best Practices and Challenges
presentation |
Lain |
📊 Slides 📄 Training language models to follow instructions with human feedback 📄 Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback |
| July 4th 2025 |
NO MEETING
break |
||
| July 11th 2025 |
Asynchronous Methods for Deep Reinforcement Learning
presentation |
Dr. Pingali |
📊 Slides 📄 Asynchronous Methods Paper |
| July 18th 2025 |
MuJoCo Playground
presentation |
Marie Elster |
📊 Slides 📄 MuJoCo Playground Paper |
| July 25th 2025 |
Mastering Diverse Domains through World Models (Hafner et al. 2023)
presentation |
Will |
📊 Slides 📄 Dreamer Paper |
| August 1st 2025 |
AlphaEvolve
presentation |
Lain |
📊 Slides 📄 AlphaEvolve Paper 📄 FunSearch Paper (2023) |