Developing Generalist Foundation Models from a Multimodal Dataset for 3D Computed Tomography

Published: 28 Oct 2024, Last Modified: 06 May 2026CrossrefEveryoneRevisionsCC BY-SA 4.0
Abstract: While computer vision has achieved tremendous success with multimodal encoding and direct textual interaction with images via chat-based large language models, similar advancements in medical imaging AI—particularly in 3D imaging—have been limited due to the scarcity of comprehensive datasets. To...
Loading