Obsah

CzeSL – a Learner Corpus of Czech

Available versions

Thousands of tokens in Annotation Metadata Access Year
non-native ethnolect 𝚺 error linguistic
essays theses Tags TH T0 T1 T2
CzeSL-plain 1,315 732 428 2,475 SD 2012
CzeSL-SGT 1,147 1,147 F K M M yes SD 2014
CzeSL-man v0, a1 134 192 326 F+G 2T M M SD 2012
CzeSL-man v0, a2 59 149 208 F+G 2T M M S 2012
CzeSL-man v1 134 134 F+G T2 M M+S yes SD 2016
CzeSL-man v2 134 134 F+G 2T M M M yes SD 2020
CzeSL-TH 180 180 2T yes D 2018
CzeSL-MD 12 12 MD T2 D 2018
CzeSL-UD 10 10 M+S D 2018
CzeSL-GEC ? ? 108 2T D 2017
AKCES-GEC 336 168 504 G 2T D 2019
CzeSL in TEITOK 299 299 F+I 2T+ M M M+S yes S 2020

CzeSL-plain

CzeSL-SGT

CzeSL-man

CzeSL-man v0
CzeSL-man v1
CzeSL-man v2

CzeSL-TH

CzeSL-MD

CzeSL-UD

CzeSL-GEC

AKCES-GEC

CzeSL in TEITOK

Tools

Bibliography

Bibliography

NEW:

Rosen, A., Hana, J., Hladká, B., Jelínek, T., Škodová, S., and Štindlová, B. (2020). Compiling and annotating a learner corpus for a morphologically rich language – CzeSL, a corpus of non-native Czech. Karolinum, Charles University Press, Praha. Print copy, e-book CU Digital Repository

Acknowledgement

This work was supported by the European Regional Development Fund project “Creativity and Adaptability as Conditions of the Success of Europe in an Interrelated World” (reg. no.: CZ.02.1.01/0.0/0.0/16_019/0000734).