Scaling Frog Monitoring with FrogID: A Robust Classification Pipeline for Citizen Science Using Bioacoustic Foundation Models

Published: 02 Oct 2025, Last Modified: 02 Dec 2025NeurIPS 2025 AiForAnimalComms WorkshopEveryoneRevisionsBibTeXCC BY 4.0
Keywords: bioacoustics, AI, machine learning, species identification, citizen science, foundation models, biodiversity monitoring, frog conservation
TL;DR: Citizen science frog monitoring is scaled by a pipeline combining source separation, an audio-language foundation model, and cross-taxa transfer learning, reducing manual validation bottlenecks and advancing biodiversity conservation.
Abstract: Amidst global biodiversity declines, audio-based citizen science projects offer significant potential for biodiversity monitoring, but the need for manual validation limits scalability. The FrogID project has gathered over 1.3 million frog records from over 800,000 audio submissions, advancing amphibian research and conservation in Australia, yet manual species identification remains time-consuming, creating backlogs, delaying conservation action, and reducing user engagement. We present a frog species identification pipeline that combines unsupervised source separation with an audio-language foundation model to refine coarse annotations, followed by transfer learning from cross-taxa embeddings with a hybrid classifier. The method achieves strong per-species performance even on non-quality audio, enabling scalable frog monitoring.
Submission Number: 16
Loading