Vision language model for interpretable and fine-grained detection of safety compliance in diverse workplaces
Abstract: Highlights•Introduce Clip2Safety for improved safety compliance in diverse workplaces.•Integrate scene recognition, prompts, and fine-grained verification in detection models.•Enhance real-time safety monitoring through a robust and adaptable compliance detection framework.•Achieve improved accuracy and speed in detecting PPE compliance across six real-world scenarios.
External IDs:dblp:journals/eswa/ChenCICI25
Loading