Toggle navigation
OpenReview
.net
Login
×
Back to
ISCA
ISCA 2025 Workshop MLArchSys Submissions
How Low Can LoRA Go: System-Level Throughput, Energy, and Model Quality Tradeoffs when Fine-Tuning Adapters
Connor Espenshade
,
Umesh Deshpande
,
Yue Zhu
,
Eun Kyung Lee
,
Martha A Kim
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
Minstrel: Application-Aware SLM Inference Optimization on Edge Devices
Bakshree Mishra
Published: 21 May 2025, Last Modified: 21 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
ProtocolLLM: RTL Benchmark for SystemVerilog Code Generation of Communication Protocols
Arnav Miteshkumar Sheth
,
Ivaxi Sheth
,
Mario Fritz
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
SPADA: Secure, Performant, and Distributed LLM Inference
Kexin Chu
,
Jianchang Su
,
Wei Zhang
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Poster
Readers:
Everyone
Runtime Attestation for Secure LLM Serving in Cloud-Native Trusted Execution Environments
Jianchang Su
,
Wei Zhang
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
SafeKV: Safe KV-Cache Sharing in LLM Serving
Kexin Chu
,
Zixu Shen
,
Dawei Xiang
,
Wei Zhang
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
Constrained bit allocation for mixed-precision deep neural networks
Souleyman Boudouh
,
Simla Burcu Harma
,
Abdulrahman Mahmoud
,
Babak Falsafi
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
Empirical Training Time Prediction for LLM Fine-Tuning Using Scaling Laws
Chianing Wang
,
Younghyun Cho
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
GraphSnapShot: A System for Graph Machine Learning Acceleration
Dong Liu
,
Yanxuan Yu
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation
Charles Hong
,
Brendan Roberts
,
Huijae An
,
Alex Um
,
Advay Ratan
,
Sophia Shao
Published: 21 May 2025, Last Modified: 21 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
Autocomp: LLM-Driven Code Optimization for Tensor Accelerators
Charles Hong
,
Sahil Bhatia
,
Alvin Cheung
,
Sophia Shao
Published: 21 May 2025, Last Modified: 21 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
EQuARX: Efficient Quantized AllReduce in XLA for Distributed Machine Learning Acceleration
Ibrahim Ahmed
,
Clemens JS Schaefer
,
Gil Tabak
,
Zenong Zhang
,
Felix Chern
,
Anatoliy Yevtushenko
,
Andy Davis
Published: 21 May 2025, Last Modified: 21 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
Efficient RL-based Cache Vulnerability Exploration by Penalizing Useless Agent Actions
Kanato Nakanishi
,
Soramichi Akiyama
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
Benchmarking Efficiency Techniques in GenAI Foundation Models Using an Elo-Based Performance Evaluation Framework
Saman Keon
,
Summer Deng
,
Bram Wasti
,
Joshua Wolff Fromm
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Poster
Readers:
Everyone
A Uniform, Tessellated Architecture for Energy-Efficient Learning and Inference
Jerry Felix
,
Steve Brunker
,
Carol Hibbard
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Poster
Readers:
Everyone
Leveraging LLMs to Improve Hardware-Software Co-Design Workflow Productivity and Accessibility
Kavya Sreedhar
,
Josh Ogbonda
,
Pengqi Yin
,
Narges Shahidi
,
Kanthi Nagaraj
,
Zhijie Deng
,
Rami Cohen
,
Ton Kalker
,
Sameer Kumar
,
Amir Yazdanbakhsh
,
Suvinay Subramanian
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Poster
Readers:
Everyone
RECAP: Training-Free Compensation for Coarse Activation Channel Pruning in Compressed LLMs
Mingyu Lee
,
Akshat Ramachandran
,
Tushar Krishna
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
Limiting Network Bandwidth to Unleash Throughput for Serverless Systems
Prasoon Sinha
,
Sidharth N. Babu
,
Neeraja J. Yadwadkar
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone
Profile-Guided Quantization: a compiler solution to automate quantization for efficient LLM training
Gil Tabak
,
Clemens JS Schaefer
,
Xiaofan Zhang
,
Denali Molitor
,
Jinliang Wei
,
Zongwei Zhou
,
Philip G Hendrix
,
Mitchelle Rasquinha
Published: 21 May 2025, Last Modified: 17 Jun 2025
MLArchSys 2025 Oral
Readers:
Everyone