Reinforcement Learning Reading Group

When: Tuesdays, 10am

Where: ICICS Room 238 / Zoom (link in mail)

Our reading group consist in a person leading the paper presentation with the aim at fostering live discussions and critics on the ideas and methods proposed, with the goal to understand the why and the how a particular paper is interesting.

Use this spreadsheet to sign up as a presenter.

To subscribe to the mailing list for talk announcements and other updates, send a message to majordomo@cs.ubc.ca with the words subscribe rlrg-l in the body.

  • If the above command doesn't work, you should try again with an empty subject, and the following message body
    
                  subscribe rlrg-l YOUR-EMAIL-ADDRESS
                  end
                
  • If you are using gmail, try using the web version and select "plain text mode"
  • Try to remove the signature
  • If you want to unsubscribe, send an email to majordomo@cs.ubc.ca with the words unsubscribe rlrg-l YOUR-EMAIL-ADDRESS in the body.
  • For more info on how majordomo mailing system works, you can check this page.
  • If you are still having issues, contact Daniele (dreda@cs.ubc.ca) and he will register you manually.

  • Both slides and directly presenting from the paper in hand is fine.
  • These are some template points we try to go through during the reading group (you don't need to understand and know everything, it's a *group* process):
    • Why did you choose this paper?
    • What can you tell us about the authors?
    • What is the general context for the ideas in this paper?
    • What is the high-level idea of the paper, in a few sentences?
    • Overview of method/algorithm/theory
    • What was difficult to understand?
    • Results: most important figures and tables
    • Discussion: impact/assumptions/evaluation
    • Future work: what should be done next?
  • A (non-complete) list of topics covered in this reading group is available here.

Upcoming presentations

Date Presenter Paper or topic Link
Jan 17 Jenny Fine-Tuning Language Models from Human Preferences paper
Jan 24 Yuval Tassa (GUEST TALK) Predictive Sampling: Real-time Behaviour Synthesis with MuJoCo paper
Jan 31 Daniele Dreamer v3 - Mastering Diverse Domains through World Models website
Feb 7 Shengran Inner Monologue: Embodied Reasoning through Planning with Language Models website
Feb 14 Rui Discovering faster matrix multiplication algorithms with reinforcement learning paper
Feb 21 -- READING WEEK BREAK
Feb 28 Niloofar PADL: Language-Directed Physics-Based Character Control paper
Mar 7 - CANCELLED FOR FACULTY TALK
Mar 14 Aaron In-context Reinforcement Learning with Algorithm Distillation paper
Mar 21 Amit Bermano (GUEST TALK)
Mar 28 Nick Magnetic control of tokamak plasmas through deep reinforcement learning paper
Apr 4 Yuni ALAN : Autonomously Exploring Robotic Agents in the Real World website
Apr 11
Apr 18
Apr 25