from .clip_encoder import build_vision_tower