Toggle navigation
OpenReview
.net
Login
×
Back to
NeurIPS
NeurIPS 2024 Workshop Compression Submissions
QIANets: Quantum-Integrated Adaptive Networks for Reduced Latency and Improved Inference Times in CNN Models
Zhumazhan Balapanov
,
Vanessa Matvei
,
Olivia Holmberg
,
Edward Magongo
,
Kevin Zhu
,
Jonathan Pei
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Interactions Across Blocks in Post-Training Quantization of Large Language Models
Khasmamad Shabanovi
,
Lukas Wiest
,
Vladimir Golkov
,
Daniel Cremers
,
Thomas Pfeil
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
LORC: Low-Rank Compression for LLMs KV Cache with a Progressive Compression Strategy
Rongzhi Zhang
,
Kuan Wang
,
Liyuan Liu
,
Shuohang Wang
,
Hao Cheng
,
Chao Zhang
,
yelong shen
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
BinaryDM: Accurate Weight Binarization for Efficient Diffusion Models
Xingyu Zheng
,
Xianglong Liu
,
Haotong Qin
,
Xudong Ma
,
Mingyuan Zhang
,
Haojie Hao
,
Jiakai Wang
,
Zixiang Zhao
,
Jinyang Guo
,
Michele Magno
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Latent Probabilistic Dataset Distillation with Theoretical Guarantees
Progyan Das
,
Shrutimoy Das
,
Anirban Dasgupta
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
CDQuant: Greedy Coordinate Descent for Accurate LLM Quantization
Pranav Ajit Nair
,
Arun Suggala
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Non-interactive Remote Coordination
Yassine Hamdi
,
Xueyan Niu
,
Bo Bai
,
Deniz Gunduz
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Layer-wise Quantization for Distributed Variational Inequalities
Anh Duc Nguyen
,
Ilia Markov
,
Ali Ramezani-Kebrya
,
Kimon Antonakopoulos
,
Dan Alistarh
,
Volkan Cevher
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Communication Compression for Tensor Parallel LLM Inference
Jan Hansen-Palmus
,
Michael Truong Le
,
Oliver Hausdörfer
,
Alok Verma
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Large Language Model Compression with Neural Architecture Search
Rhea Sanjay Sukthanker
,
Benedikt Staffler
,
Frank Hutter
,
Aaron Klein
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
LLM Vocabulary Compression for Low-Compute Environments
Sreeram Vennam
,
Anish R Joishy
,
Ponnurangam Kumaraguru
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Benchmarking neural lossless compression algorithms on multi-purpose astronomical image data
Tuan Truong
,
Rithwik Sudharsan
,
Yibo Yang
,
Peter Xiangyuan Ma
,
Ruihan Yang
,
Stephan Mandt
,
Joshua S. Bloom
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
LiteVAR: Compressing Visual Autoregressive Modelling with Efficient Attention and Quantization
Rui Xie
,
Tianchen Zhao
,
Zhihang Yuan
,
Rui Wan
,
Wenxi Gao
,
Zhenhua Zhu
,
Xuefei Ning
,
Yu Wang
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
A Theory for Compressibility of Graph Transformers for Transductive Learning
Hamed Shirzad
,
Honghao Lin
,
Ameya Velingker
,
Balaji Venkatachalam
,
David Woodruff
,
Danica J. Sutherland
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Bridging the Gap between Diffusion Models and Universal Quantization for Image Compression
Lucas Relic
,
Roberto Azevedo
,
Yang Zhang
,
Markus Gross
,
Christopher Schroers
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Towards Scalable Compression with Universally Quantized Diffusion Models
Yibo Yang
,
Justus Will
,
Stephan Mandt
Published: 09 Oct 2024, Last Modified: 13 Dec 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Deep Clustering with Associative Memories
Bishwajit Saha
,
Dmitry Krotov
,
Mohammed J Zaki
,
Parikshit Ram
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
LSH-E Tells You What To Discard: An Adaptive Locality-Sensitive Strategy for KV Cache Compression
Tahseen Rabbani
,
Minghui Liu
,
Tony O'Halloran
,
Ananth Sankaralingam
,
Mary-Anne Hartley
,
Furong Huang
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Exploiting Temporal Priors for Efficient Real-time Compression and Feedback of Wireless Channels
Akshay Malhotra
,
Mohamed Salah Ibrahim
,
Keya Patani
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Learning to Compress: Local Rank and Information Compression in Deep Neural Networks
Niket Nikul Patel
,
Ravid Shwartz-Ziv
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Simple LLM Compression Recovery Using Dynamic Prompting with Theoretical Analysis
Duc N.M Hoang
,
Minsik Cho
,
Thomas Merth
,
Mohammad Rastegari
,
Zhangyang Wang
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
FinerCut: Finer-grained Interpretable Layer Pruning for Large Language Models
Yang Zhang
,
Yawei Li
,
Xinpeng Wang
,
Qianli Shen
,
Barbara Plank
,
Bernd Bischl
,
Mina Rezaei
,
Kenji Kawaguchi
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Adaptive Quantization and Pruning of Deep Neural Networks via Layer Importance Estimation
Tushar Shinde
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Learnable Fourier-based Activations for Implicit Signal Representations
Parsa Mojarad Adi
,
Ali Mehrabian
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
Randomly Pivoted V-optimal Design: Fast Data Selection under Low Intrinsic Dimension
Yijun Dong
,
Xiang Pan
,
Hoang Phan
,
Qi Lei
Published: 09 Oct 2024, Last Modified: 19 Nov 2024
Compression Workshop @ NeurIPS 2024
Readers:
Everyone
«
‹
1
2
3
4
›
»