Open Peer Review. Open Publishing. Open Access. Open Discussion. Open Directory. Open Recommendations. Open API. Open Source.
Signs in time: Encoding human motion as a temporal image
Joon Son Chung, Andrew Zisserman
Aug 09, 2016 (modified: Aug 09, 2016)ECCV2016 BNMW submissionreaders: everyone
Abstract:The goal of this work is to recognise and localise short temporal signals in image time series, where strong supervision is not available for training.
To this end we propose an image encoding that concisely represents human motion in a video sequence in a form that is suitable for learning with a ConvNet. The encoding reduces the pose information from an image to a single column, dramatically diminishing the input requirements for the network, but retaining the essential information for recognition.
The encoding is applied to the task of recognizing and localizing signed gestures in British Sign Language (BSL) videos. We demonstrate that using the proposed encoding, signs as short as 10 frames duration can be learnt from clips lasting hundreds of frames using only weak (clip level) supervision and with considerable label noise.
Enter your feedback below and we'll get back to you as soon as possible.