Towards Characterizing Knowledge Distillation of PPG Heart Rate Estimation Models

Kanav Arora; Girish Narayanswamy; Shwetak Patel; Richard Li

Towards Characterizing Knowledge Distillation of PPG Heart Rate Estimation Models

Kanav Arora, Girish Narayanswamy, Shwetak Patel, Richard Li

Published: 23 Sept 2025, Last Modified: 01 Dec 2025TS4H NeurIPS 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0

Keywords: PPG, heart rate, knowledge distillation, scaling law

TL;DR: We explore and characterize how a large PPG heart rate estimation model might be distilled into a smaller model appropriate for running on-device, in real-time, on wearable devices.

Abstract: Heart rate estimation from photoplethysmography (PPG) signals generated by wearable devices such as smartwatches and fitness trackers has significant implications for the health and well-being of individuals. Although prior work has demonstrated deep learning models with strong performance in the heart rate estimation task, in order to deploy these models on wearable devices, these models must also adhere to strict memory and latency constraints. In this work, we explore and characterize how large pre-trained PPG models may be distilled to smaller models appropriate for real-time inference on the edge. We evaluate four distillation strategies through comprehensive sweeps of teacher and student model capacities: (1) hard distillation, (2) soft distillation, (3) decoupled knowledge distillation (DKD), and (4) feature distillation. We present a characterization of the resulting scaling laws describing the relationship between model size and performance. This early investigation lays the groundwork for practical and predictable methods for building edge-deployable models for physiological sensing.

Submission Number: 19

Loading