Universal and extensible language-vision models for organ segmentation and tumor detection from abdominal computed tomography
Abstract: Highlights•A universal framework adapting a single model to multiple datasets and new classes.•A language-driven parameter generator leverages embeddings from CLIP.•We develop a class-specific, lightweight head to ease the addition of new classes.•Universal Model is efficient, generalizable, transferable, and extensible.•Universal Model ranks first in MSD and BTCV competitions for medical segmentation.
Loading