Abstract: The text presents the basic concept, methodology and annotation structure of the Olomouc Spoken Corpus (OSC), the project that has been built at the Department of Czech Studies at Palacký University since 2002. The text deals with the main principles of the data collection, an annotation structure of spoken corpora of the Czech National Corpus as well as OSC, the transcription rules, the main stages of building of OSC and finally presents an overview of transcription symbols and structural metasymbols of OSC transcripts.
Loading