Coconstructions in spoken Universal Dependencies: guidelines and first results

Published: 27 May 2026, Last Modified: 27 May 2026UniDive 2026EveryoneRevisionsCC BY-SA 4.0
Keywords: spoken language, coconstruction, backchannel, reformulation, syntactic treebank, French, Italian, Slovenian
Working Group: WG1: Corpus annotation
WG1 Tasks: Task 1.5: Annotation of Spoken data
Abstract: The paper proposes a set of guidelines for the annotation of coconstructions and backchannels in syntactic treebanks of spoken data in the Universal Dependencies collection. Two representations are proposed: a speaker-based representation following the segmentation into speech turns, and a dependency-based representation with dependencies across speech turns. New propositions are also put forward to distinguish between reformulations and repairs, and to promote elements in unfinished phrases.
Tracks For Type Of Contribution: Work in progress
Do You Need Visa To Attend The 4th UniDive General Meeting In Romania: No
Email Sharing: We authorize the sharing of all author emails with Program Chairs.
Data Release: We authorize the release of our submission and author names to the public in the event of acceptance.
Submission Number: 7
Loading