BeetleFlow: An Integrative Deep Learning Pipeline for Beetle Image Processing

Published: 09 Oct 2025, Last Modified: 09 Oct 2025NeurIPS 2025 Workshop ImageomicsEveryoneRevisionsBibTeXCC BY 4.0
Submission Track: Short papers presenting ongoing research or work submitted to other venues (up to 5 pages, excluding references)
Keywords: Computer Vision, Image Processing, Open-Vocabulary Object Detection, Fine-grained Morphological Segmentation
TL;DR: We propose an integrative pipeline that uses an open-vocabulary detector and a segmentation model to identify, crop, and analyze beetle specimens from large-scale tray images.
Abstract: In entomology and ecology research, biologists often need to collect a large number of insects, among which beetles are the most common species. A common practice for biologists to organize beetles is to place them on trays and take a picture of each tray to digitize them. Given the images of thousands of such trays, it is important to have an automated pipeline to process the large-scale image data for further research. Therefore, we develop a 3-stage pipeline to detect all the beetles on each tray, sort and crop the image of each beetle, and do morphological segmentation on the cropped beetles. For detection, we design an iterative process utilizing a transformer-based open-vocabulary object detector and a vision-language model to comprehensively detect all beetles in the tray. For segmentation, we manually labeled 670 beetle images and fine-tuned two variants of a transformer-based segmentation model to achieve fine-grained segmentation of beetles with relatively high accuracy. The pipeline integrates multiple deep learning methods and is specialized for beetle image processing, which can greatly improve the efficiency to process large-scale beetle data and accelerate biological research.
Submission Number: 60
Loading