Multimodal Large Language Models
2025 – Present
Vision Language Models, Remote Sensing
2024 – Present
Pose Estimation, Human Motion Segmentation, Neural Machine Translation, Motion Captioning
2020 – Present
Image Classification, Object detection
2019 – 2020