Full text loading...
and Stefanie Wulff2
Abstract
This paper provides a detailed account of the Turkish Learner Corpus (TURLEC). Building on the first author’s doctoral dissertation project, which aimed to identify proficiency descriptors for four skills (listening, reading, writing, and speaking) for learners of Turkish as a second language (L2) at various CEFR levels, the main motivation to build a learner corpus is to outline the language learners actually use at different proficiency levels. With the written and spoken texts of learners of Turkish L2 at university level coming from various countries with numerous L1 backgrounds, TURLEC comprises 735 texts and approximately 97,000 tokens. After rigorous anonymisation, annotation, and error-tagging efforts, TURLEC contains ~21,000 word forms with 3,561 lemmas, which will be profiled based on the CEFR levels. TURLEC is the first learner corpus built to offer a vocabulary profile for L2 Turkish, which is an ever-growing field of study with an increasing number of students.
Article metrics loading...
Full text loading...
References
Data & Media loading...