Human Description in the Wild: Description of the Scene with Ensembles of AI Models

Vincenzo Dentamaro, Vincenzo Gattulli, Paolo Giglio, Donato Impedovo, Giuseppe Pirlo

Published: 2022, Last Modified: 10 May 2023S+SSPR 2022Readers: Everyone

Abstract: Describing an image scene in Natural Language is a very complex procedure for a machine. Many researchers have used Natural Language Processing approaches. In this paper Machine Learning and Computer Vision models will be illustrated with the purpose of describing a picture in the wild. Action Recognition models, Face Recognition with gender and age and Clothing Recognition will be performed in combination with the purpose of generating a textual sentence belonging to natural language describing the scene in the picture. The proposed technique can target multiple domains, specifically useful for preventing cyberbullying situations. In addition, an attempt will be made to exceed for each model the current SoA.

0 Replies