OpenReview.net
  • Login

Arnaud Bergeron

Pronouns: he/him

MS student, DIRO, Mila - Quebec Artificial Intelligence Institute

  • Joined January 2025

Names

Arnaud Bergeron (Preferred)
  • Suggest Name

Emails

****@mila.quebec (Confirmed)
  • Suggest Email

Personal Links

Homepage
Google Scholar
DBLP
  • Suggest URL

Career & Education History

MS student
DIRO, Mila - Quebec Artificial Intelligence Institute (mila.quebec)
2024 – 2026
 
  • Suggest Position

Advisors, Relations & Conflicts

PhD Advisor
Nicolas Le Roux
2024 – 2026
 
  • Suggest Relation

Expertise

RLHF
2024 – 2026
 
  • Suggest Expertise

Publications

  • Tapered Off-Policy REINFORCE - Stable and efficient reinforcement learning for large language models

    Nicolas Le Roux, Marc G Bellemare, Jonathan Lebensold, Arnaud Bergeron, Joshua Greaves, Alexandre Fréchette, Carolyne Pelletier, Eric Thibodeau-Laufer, Sándor Tóth, Sam Work
    • NeurIPS 2025 poster
    • Readers: Everyone
  • Tapered Off-Policy REINFORCE: Stable and efficient reinforcement learning for LLMs

    Nicolas Le Roux, Marc G. Bellemare, Jonathan Lebensold, Arnaud Bergeron, Joshua Greaves, Alexandre Fréchette, Carolyne Pelletier, Eric Thibodeau-Laufer, Sándor Tóth, Sam Work
    • CoRR 2025
    • Readers: Everyone

Co-Authors

  • Alexandre Fréchette
  • Carolyne Pelletier
  • Eric Thibodeau-Laufer
  • Jonathan Lebensold
  • Joshua Greaves
  • Marc G Bellemare
  • Marc G. Bellemare
  • Nicolas Le Roux
  • Sam Work
  • Sándor Tóth
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Contact
  • Sponsors
  • Donate
  • FAQ
  • Terms of Use / Privacy Policy
  • News
  • About OpenReview
  • Hosting a Venue
  • All Venues
  • Sponsors
  • News
  • FAQ
  • Contact
  • Donate
  • Terms of Use
  • Privacy Policy

OpenReview is a long-term project to advance science through improved peer review with legal nonprofit status. We gratefully acknowledge the support of the OpenReview Sponsors. © 2025 OpenReview