FishDetectLLM: Multimodal instruction tuning with large language models for fish detection

Published: 01 Jan 2025, Last Modified: 15 May 2025Knowl. Based Syst. 2025EveryoneRevisionsBibTeXCC BY-SA 4.0
Abstract: Highlights•We present the first lightweight Multimodal Large Language Model named FishDetectLLM for fish detection.•We develop a new instruction-based conversation dataset on fish detection for instruction tuning.•FishDetectLLM shows superior performance and robust generalization.
Loading