Abstract: Highlights•Vision and language modalities are mostly used input modalities in SLP.•The lack of a large annotated dataset is a major challenge in SLP.•The proposed works in SLP can be categorized into five categories.•One limitation in SLP is the generation of high-resolution images/videos.•A fast processing model in an uncontrolled environment is necessary for SLP.
Loading