OpenReview
.net
OpenReview
.net
Login
OpenReview
.net
Login
Yoshi Suhara
Researcher, NVIDIA
Joined
July 2019
Names
Yoshi Suhara
(Preferred)
,
Yoshihiko Suhara
Emails
****@megagon.ai
(Confirmed)
,
****@gmail.com
(Confirmed)
,
****@acm.org
(Confirmed)
,
****@grammarly.com
(Confirmed)
,
****@gmail.com
(Confirmed)
,
****@nvidia.com
(Confirmed)
Personal Links
Homepage
Google Scholar
DBLP
ORCID
LinkedIn
Semantic Scholar
ACL Anthology
Career & Education History
Researcher
NVIDIA
(nvidia.com)
2023
–
Present
Researcher
Grammarly
(grammarly.com)
2022
–
2023
Researcher
Megagon Labs
(megagon.ai)
2017
–
2022
Researcher
Massachusetts Institute of Technology
(mit.edu)
2014
–
2017
Researcher
NTT
(lab.ntt.co.jp)
2008
–
2014
Advisors, Relations & Conflicts
No relations added
Expertise
Large language models
,
small language models
2023
–
Present
text summarization
,
information extraction
,
deep learning for tables
2017
–
2023
computational social science
,
affective computing
2014
–
2016
information retrieval
,
learning to rank
,
geospatial data mining
2008
–
2014
Publications
Orak: A Foundational Benchmark for Training and Evaluating LLM Agents on Diverse Video Games
Dongmin Park
,
Minkyu Kim
,
Beongjun Choi
,
Junhyuck Kim
,
Keon Lee
,
Jonghyun Lee
,
Inkyu Park
,
Byeong-Uk Lee
,
Jaeyoung Hwang
,
Jaewoo Ahn
,
Ameya Sunil Mahabaleshwarkar
,
Bilal Kartal
,
Pritam Biswas
,
Yoshi Suhara
,
Kangwook Lee
,
Jaewoong Cho
ICLR 2026 Poster
Readers:
Everyone
Llama-Nemotron: Efficient Reasoning Models
Soumye Singhal
,
Jiaqi Zeng
,
Alexander Bukharin
,
Yian Zhang
,
Gerald Shen
,
Ameya Sunil Mahabaleshwarkar
,
Bilal Kartal
,
Yoshi Suhara
,
Akhiad Bercovich
,
Itay Levy
,
Izik Golan
,
Mohammed Dabbah
,
Ran El-Yaniv
,
Somshubra Majumdar
,
Igor Gitman
,
Evelina Bakhturina
,
Jimmy J. Zhang
,
Bor-Yiing Su
,
Guyue Huang
,
Izzy Putterman
et al. (7 additional authors not shown)
EXAIT@ICML 2025 Poster
Readers:
Everyone
Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning
Ali Taghibakhshi
,
Sharath Turuvekere Sreenivas
,
Saurav Muralidharan
,
Marcin Chochowski
,
Yashaswi Karnati
,
Raviraj Bhuminand Joshi
,
Ameya Sunil Mahabaleshwarkar
,
ZIJIA CHEN
,
Yoshi Suhara
,
Oluwatobi Olabiyi
,
Daniel Korzekwa
,
Mostofa Patwary
,
Mohammad Shoeybi
,
Jan Kautz
,
Bryan Catanzaro
,
Ashwath Aithal
,
Nima Tajbakhsh
,
Pavlo Molchanov
NeurIPS 2025 poster
Readers:
Everyone
Nemotron-CLIMB: Clustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training
Shizhe Diao
,
Yu Yang
,
Yonggan Fu
,
Xin Dong
,
Dan SU
,
Markus Kliegl
,
ZIJIA CHEN
,
Peter Belcak
,
Yoshi Suhara
,
Hongxu Yin
,
Mostofa Patwary
,
Yingyan Celine Lin
,
Jan Kautz
,
Pavlo Molchanov
NeurIPS 2025 Datasets and Benchmarks Track spotlight
Readers:
Everyone
LLM Pruning and Distillation in Practice
Sharath Turuvekere Sreenivas
,
Saurav Muralidharan
,
Raviraj Bhuminand Joshi
,
Marcin Chochowski
,
Mostofa Patwary
,
Pavlo Molchanov
,
Mohammad Shoeybi
,
Jan Kautz
,
Ameya Sunil Mahabaleshwarkar
,
Gerald Shen
,
Jiaqi Zeng
,
Oleksii Kuchaiev
,
ZIJIA CHEN
,
Yoshi Suhara
,
Shizhe Diao
,
Chenhan D. Yu
,
Wei-Chun Chen
,
Hayley Ross
,
Daniel Korzekwa
,
Oluwatobi Olabiyi
et al. (2 additional authors not shown)
Submitted to ICLR 2025
Readers:
Everyone
Hymba: A Hybrid-head Architecture for Small Language Models
Xin Dong
,
Yonggan Fu
,
Shizhe Diao
,
Wonmin Byeon
,
ZIJIA CHEN
,
Ameya Sunil Mahabaleshwarkar
,
Shih-Yang Liu
,
Matthijs Van keirsbilck
,
Min-Hung Chen
,
Yoshi Suhara
,
Yingyan Celine Lin
,
Jan Kautz
,
Pavlo Molchanov
ICLR 2025 Spotlight
Readers:
Everyone
Extracting Salient Facts from Company Reviews with Scarce Labels
Jinfeng Li
,
Nikita Bhutani
,
Alexander Whedon
,
Chieh-Yang Huang
,
Estevam Hruschka
,
Yoshihiko Suhara
20 Jan 2022
OpenReview Archive Direct Upload
Readers:
Everyone
Annotating Columns with Pre-trained Language Models
Yoshihiko Suhara
,
Jinfeng Li
,
Yuliang Li
,
Dan Zhang
,
Çagatay Demiralp
,
Chen Chen
,
Wang-Chiew Tan
2022 (modified: 17 Apr 2023)
SIGMOD Conference 2022
Readers:
Everyone
Comparative Opinion Summarization via Collaborative Decoding
Hayate Iso
,
Xiaolan Wang
,
Stefanos Angelidis
,
Yoshihiko Suhara
2022 (modified: 17 Apr 2023)
ACL (Findings) 2022
Readers:
Everyone
Annotating Columns with Pre-trained Language Models
Yoshihiko Suhara
,
Jinfeng Li
,
Yuliang Li
,
Dan Zhang
,
Çagatay Demiralp
,
Chen Chen
,
Wang-Chiew Tan
2021 (modified: 16 Jun 2021)
CoRR 2021
Readers:
Everyone
View all 53 publications
Co-Authors
Aaron Feng
Aaron Traylor
Akari Asai
Akhiad Bercovich
Akito Sakurai
Alex 'Sandy' Pentland
Alexander Bukharin
Alexander Whedon
Ali Taghibakhshi
Alon Y. Halevy
Ameya Sunil Mahabaleshwarkar
AnHai Doan
Andrei Lopatenko
Andrey Bogomolov
Ashwath Aithal
Behzad Golshan
Beongjun Choi
Bilal Kartal
Bor-Yiing Su
Boris Ginsburg
Bruno Lepri
Bruno V. Ferreira
Bryan Catanzaro
Burçin Bozkaya
Byeong-Uk Lee
View all 137 co-authors