Toggle navigation
OpenReview
.net
Login
×
Go to
DBLP
homepage
BAQET: BRAM-aware Quantization for Efficient Transformer Inference via Stream-based Architecture on an FPGA
LingChi Yang
,
Chi-Jui Chen
,
Trung Le
,
Bo-Cheng Lai
,
Scott Hauck
,
Shih-Chieh Hsu
Published: 01 Jan 2025, Last Modified: 12 May 2025
FPGA 2025
Everyone
Revisions
BibTeX
CC BY-SA 4.0
Loading