Real-time Breast Lesion Detection in Videos via Spatial-temporal Feature Aggregation

Published: 27 Mar 2025, Last Modified: 02 Jun 2025MIDL 2025 PosterEveryoneRevisionsBibTeXCC BY 4.0
Keywords: Breast lesion, Ultrasound video, Real- time detection, Transformer
Abstract: Recently, transformer-based detectors have shown impressive performance for breast lesion detection in ultrasound videos. However, these methods often require substantial computational resource and ex- hibit low inference speed, which poses challenges towards real-time ap- plicability. To address this issue, we introduce a fast yet accurate spatial- temporal transformer, named FA-DETR, to efficiently aggregate multi- scale spatial-temporal features for breast lesion detection in ultrasound videos. Our FA-DETR is based on a lightweight spatial-temporal self- attention module, which seamlessly fuses spatial and temporal features extracted from each video frame. In the decoding phase, we employ IoU- aware query selection to generate independent queries for each frame. These queries gain access to rich spatial-temporal information through the encoder embeddings’ cross-attention and frame-aware cross-attention mechanisms. Experiments conducted on a public breast lesion ultrasound video dataset demonstrate that our FA-DETR achieves state-of-the-art performance with an absolute gain of 3.8% in terms of overall AP while being 2.5 times faster, compared to the best existing approach in the literature. Our code and models will be publicly released.
Primary Subject Area: Detection and Diagnosis
Secondary Subject Area: Application: Radiology
Paper Type: Both
Registration Requirement: Yes
Visa & Travel: Yes
Midl Latex Submission Checklist: Ensure no LaTeX errors during compilation., Created a single midl25_NNN.zip file with midl25_NNN.tex, midl25_NNN.bib, all necessary figures and files., Includes \documentclass{midl}, \jmlryear{2025}, \jmlrworkshop, \jmlrvolume, \editors, and correct \bibliography command., Did not override options of the hyperref package, Did not use the times package., All authors and co-authors are correctly listed with proper spelling and avoid Unicode characters., Author and institution details are de-anonymized where needed. All author names, affiliations, and paper title are correctly spelled and capitalized in the biography section., References must use the .bib file. Did not override the bibliographystyle defined in midl.cls. Did not use \begin{thebibliography} directly to insert references., Tables and figures do not overflow margins; avoid using \scalebox; used \resizebox when needed., Included all necessary figures and removed *unused* files in the zip archive., Removed special formatting, visual annotations, and highlights used during rebuttal., All special characters in the paper and .bib file use LaTeX commands (e.g., \'e for é)., Appendices and supplementary material are included in the same PDF after references., Main paper does not exceed 9 pages; acknowledgements, references, and appendix start on page 10 or later.
Latex Code: zip
Copyright Form: pdf
Submission Number: 60
Loading