Deep learning efficiency strategies for edge device deployment
2024 – Present
Extreme compression methods for LLMs, VLMs, ViTs, and related architectures
2024 – Present
Hardware acceleration approaches for multi-modal models on edge devices
2024 – Present
Post-training optimization techniques for multi-modal model efficiency
2023 – Present