Arnav Kumar Jain (Preferred)
Arnav Kumar Jain
PhD student, DIRO, Université de Montréal
Joined
May 2021
Names
Emails
****@gmail.com (Confirmed)
****@mila.quebec (Confirmed)
****@umontreal.ca (Confirmed)
****@microsoft.com
****@cohere.com (Confirmed)
Personal Links
Career & Education History
PhD student
DIRO, Université de Montréal (umontreal.ca)
2022 – 2026
MS student
Indian Institute of Technology Kharagpur (iitkgp.ac.in)
2016 – 2018
Undergrad student
Indian Institute of Technology Kharagpur (iitkgp.ac.in)
2013 – 2016
Advisors, Relations & Conflicts
Coauthor
Avisek Lahiri
2016 – 2020
Expertise
Reasoning for Large Language Models
2025 – Present
Code Generation
2024 – Present
Reinforcement Learning from Human Feedback (RLHF)
2024 – Present
Inverse Reinforcement Leanring
2023 – Present
Deep Reinforcement Learning
2021 – Present
Publications
Co-Authors
- Abhinav Agarwalla
- Alexander M Rush
- Alexander M. Rush
- Amit S
- Amit Singh
- Annabelle Martin
- Arpit Tarang Saxena
- Atiksh Bhardwaj
- Avisek Lahiri
- Danijar Hafner
- Deepak Saini
- Divyasri Nadendla
- Glen Berseth
- Gokul Swamy
- Gonzalo Gonzalez-Pumariega
- Gundeep Arora
- Hadia Mohmmed Osman Ahmed Samil
- Harley Wiltzer
- Irina Rish
- Jayanta Mukhopadhyay
- Jesse Farebrother
- Jian Jiao
- Juntao Ren
- Justin T Chiu
- K. V. Manohar