Abstract: Highlights•Method for eliminating the influence of shooting conditions in UAV object detection.•Eliminate the influence of shooting conditions through language guidance.•Pretrain vision-language model to provide a latent space for vision and language.•Method can be used as a plug-in at training stage for various algorithms.
Loading