1887
image of Introducing TURLEC
USD
Buy:$35.00 + Taxes

Abstract

This paper provides a detailed account of the Turkish Learner Corpus (TURLEC). Building on the first author’s doctoral dissertation project, which aimed to identify proficiency descriptors for four skills (listening, reading, writing, and speaking) for learners of Turkish as a second language (L2) at various CEFR levels, the main motivation to build a learner corpus is to outline the language learners actually use at different proficiency levels. With the written and spoken texts of learners of Turkish L2 at university level coming from various countries with numerous L1 backgrounds, TURLEC comprises 735 texts and approximately 97,000 tokens. After rigorous anonymisation, annotation, and error-tagging efforts, TURLEC contains ~21,000 word forms with 3,561 lemmas, which will be profiled based on the CEFR levels. TURLEC is the first learner corpus built to offer a vocabulary profile for L2 Turkish, which is an ever-growing field of study with an increasing number of students.

Loading

Article metrics loading...

/content/journals/10.1075/ijcl.24022.sav
2026-05-07
2026-06-07
Loading full text...

Full text loading...

References

  1. Abel, A., Glaznieks, A., Nicolas, L., & Stemle, E. W.
    (2014) KoKo: An L1 learner corpus for German. InN. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds) Proceedings of the ninth international conference on Language Resources and Evaluation (LREC’14) (pp.–). European Language Resources Association. https://aclanthology.org/L14-1710/
    [Google Scholar]
  2. Anthony, L.
    (2026) AntConc (Version 4.4.0) [Computer software]. Waseda University. https://www.laurenceanthony.net/software
    [Google Scholar]
  3. Biber, D., Reppen, R., Staples, S., & Egbert, J.
    (2020) Exploring the longitudinal development of grammatical complexity in the disciplinary writing of L2-English university students. International Journal of Learner Corpus Research, (), –. 10.1075/ijlcr.18007.bib
    https://doi.org/10.1075/ijlcr.18007.bib [Google Scholar]
  4. Boyd, A., Hana, J., Nicolas, L., Meurers, D., Wisniewski, K., Abel, A., Schöne, K., Štindlová, B., & Vettori, C.
    (2014) The MERLIN corpus: Learner language and the CEFR. InN. Calzolari, K. Choukri, T. Declerck, H. Loftsson, B. Maegaard, J. Mariani, A. Moreno, J. Odijk, & S. Piperidis (Eds) Proceedings of the ninth international conference on Language Resources and Evaluation (LREC’14) (pp.–). European Language Resources Association. https://aclanthology.org/L14-1488/
    [Google Scholar]
  5. Brunfaut, T., & Harding, L.
    (2020) International language proficiency standards in the local context: Interpreting the CEFR in standard setting for exam reform in Luxembourg. Assessment in Education: Principles, Policy & Practice, (), –. 10.1080/0969594X.2019.1700213
    https://doi.org/10.1080/0969594X.2019.1700213 [Google Scholar]
  6. Centre for English Corpus Linguistics (CECL)
    Centre for English Corpus Linguistics (CECL) (2023) Learner corpora around the world. Université catholique de Louvain. https://uclouvain.be/en/research-institutes/ilc/cecl/learner-corpora-around-the-world.html
    [Google Scholar]
  7. Common Language Resources and Technology Infrastructure (CLARIN)
    Common Language Resources and Technology Infrastructure (CLARIN) (2023) L2 learner corpora. Common Language Resources and Technology Infrastructure. https://www.clarin.eu/resource-families/L2-corpora
    [Google Scholar]
  8. Council of Europe (CoE)
    Council of Europe (CoE) (2020) Common European framework of reference for languages: Learning, teaching, assessment — Companion volume. Council of Europe Publishing. https://www.coe.int/lang-cefr
    [Google Scholar]
  9. Council of Higher Education (CoHE)
    Council of Higher Education (CoHE) (2023) Yükseköğretim Bilgi Yönetim Sistemi [Higher Education Information Management System]. Yükseköğretim Kurulu. https://istatistik.yok.gov.tr/
    [Google Scholar]
  10. Çanga, D.
    (2022) Word frequencies in the written texts of learners of Turkish as a foreign language at CEFR Level A. Cankaya University Journal of Humanities and Social Sciences, (), –. 10.47777/cankujhss.1153478
    https://doi.org/10.47777/cankujhss.1153478 [Google Scholar]
  11. Deygers, B.
    (2021) The CEFR companion volume: Between research-based policy and policy-based research. Applied Linguistics, (), –. 10.1093/applin/amz024
    https://doi.org/10.1093/applin/amz024 [Google Scholar]
  12. Díez-Bedmar, B.
    (2021) Error analysis. InN. Tracy-Ventura & M. Paquot (Eds), The Routledge handbook of second language acquisition and corpora (pp.–). Routledge. 10.4324/9781351137904
    https://doi.org/10.4324/9781351137904 [Google Scholar]
  13. Egbert, J., Biber, D., & Gray, B.
    (2022) Designing and evaluating language corpora: A Practical framework for corpus representativeness. Cambridge University Press. 10.1017/9781316584880
    https://doi.org/10.1017/9781316584880 [Google Scholar]
  14. Friginal, E.
    (2018) Corpus linguistics for English teachers: Tools, online resources, and classroom activities. Routledge. 10.4324/9781315649054
    https://doi.org/10.4324/9781315649054 [Google Scholar]
  15. Gablasova, D., & Bottini, R.
    (2022) Spoken learner corpora for language teaching. InR. R. Jablonkai & E. Csomay (Eds), The Routledge handbook of corpora and English language teaching and learning (pp.–). Routledge. 10.4324/9781003002901‑24
    https://doi.org/10.4324/9781003002901-24 [Google Scholar]
  16. Gilquin, G.
    (2015) From design to collection of learner corpora. InS. Granger, G. Gilquin, & F. Meunier (Eds), The Cambridge handbook of learner corpus research (pp.–). Cambridge University Press. 10.1017/CBO9781139649414.002
    https://doi.org/10.1017/CBO9781139649414.002 [Google Scholar]
  17. Glaznieks, A., Frey, J. C., Stopfner, M., Zanasi, L., & Nicolas, L.
    (2022) Leonide: A longitudinal trilingual corpus of young learners of Italian, German and English. International Journal of Learner Corpus Research, (), –. 10.1075/ijlcr.21004.gla
    https://doi.org/10.1075/ijlcr.21004.gla [Google Scholar]
  18. Golynskaia, A.
    (2022) An Error coding system for the Turkish Learner corpus. The Journal of Linguistics, , –. 10.26650/jol.2022.1216218
    https://doi.org/10.26650/jol.2022.1216218 [Google Scholar]
  19. Granger, S., Swallow, H., & Thewissen, J.
    (2022) The Louvain error tagging manual. Version 2.0. Centre for English Corpus Linguistics. https://repository.uantwerpen.be/docstore/d:irua:12456
    [Google Scholar]
  20. Kabak, B.
    (2011) Turkish vowel harmony. The Blackwell companion to phonology. Wiley. 10.1002/9781444335262.wbctp0118
    https://doi.org/10.1002/9781444335262.wbctp0118 [Google Scholar]
  21. Kilgarriff, A., Baisa, V., Bušta, J., Jakubíček, M., Kovář, V., Michelfeit, J., Rychlý, P., & Suchomel, V.
    (2014) The Sketch Engine: Ten years on. Lexicography, , –. 10.1007/s40607‑014‑0009‑9
    https://doi.org/10.1007/s40607-014-0009-9 [Google Scholar]
  22. Lüdeling, A., & Hirschmann, H.
    (2015) Error annotation systems. InS. Granger, F. Meunier, & G. Gilquin (Eds), The Cambridge handbook of learner corpus research (pp.–). Cambridge University Press. 10.1017/CBO9781139649414.007
    https://doi.org/10.1017/CBO9781139649414.007 [Google Scholar]
  23. Naismith, B., Han, N. R., & Juffs, A.
    (2022) The University of Pittsburgh English Language Institute Corpus (PELIC). International Journal of Learner Corpus Research, (), –. 10.1075/ijlcr.21002.nai
    https://doi.org/10.1075/ijlcr.21002.nai [Google Scholar]
  24. Paquot, M.
    (2022) Corpora and second language acquisition. InR. R. Jablonkai, & E. Csomay (Eds), The Routledge handbook of corpora and English language teaching and learning. (pp.–). Routledge. 10.4324/9781003002901‑4
    https://doi.org/10.4324/9781003002901-4 [Google Scholar]
  25. Polgardi, K.
    (1999) Vowel harmony and disharmony in Turkish. The Linguistic Review, (), –. 10.1515/tlir.1999.16.2.187
    https://doi.org/10.1515/tlir.1999.16.2.187 [Google Scholar]
  26. Rosen, A., Hana, J., Štindlova, B., & Feldman, A.
    (2014) Evaluating and automating the annotation of a learner corpus. Language Resources and Evaluation, (), –. 10.1007/s10579‑013‑9226‑3
    https://doi.org/10.1007/s10579-013-9226-3 [Google Scholar]
  27. Savuran, Y., & Çubukçu, Z.
    (2021) Yabancı Dil Olarak Türkçe Öğretiminde Performans Betimleyicileri Geliştirme: Temel ve Ara Düzeyler [Developing performance descriptors in teaching Turkish as a foreign language: Basic and intermediate levels]. Türk Eğitim Bilimleri Dergisi, (), –. 10.37217/tebd.876422
    https://doi.org/10.37217/tebd.876422 [Google Scholar]
  28. Savuran, Y., & Wulff, S.
    (2025) Developing a vocabulary profile for Turkish L2 based on learner corpus. 10.17605/OSF.IO/BNV3P
    https://doi.org/10.17605/OSF.IO/BNV3P [Google Scholar]
  29. Sommer-Farias, B., Novikov, A., Picoral, A., Centanin-Bertho, M., & Staples, S.
    (2022) A multilingual learner corpus for less commonly taught languages. International Journal of Learner Corpus Research, (), –. 10.1075/ijlcr.21001.som
    https://doi.org/10.1075/ijlcr.21001.som [Google Scholar]
  30. TS Corpus Project
    TS Corpus Project (2026) TS POS tagger. TS Corpus Project. https://tools.tscorpus.com/tools/postagger/
    [Google Scholar]
  31. Türkiye Maarif Vakfı (TMV)
    Türkiye Maarif Vakfı (TMV) (2026) Establishment purpose. Turkish Maarif Foundation. https://turkiyemaarif.org/page/about-us
    [Google Scholar]
  32. Türkiye Scholarships
    Türkiye Scholarships (2023) Turkish language education. Türkiye Scholarships. https://www.turkiyeburslari.gov.tr/en/page/current-students/turkish-language-education
    [Google Scholar]
  33. Yunus Emre Enstitüsü (YEI)
    Yunus Emre Enstitüsü (YEI) (2026) Türkiye Scholarship applications. Yunus Emre Enstitüsü. https://yeelondon.org.uk/event/turkiye-scholarship-applications/
    [Google Scholar]
/content/journals/10.1075/ijcl.24022.sav
Loading
/content/journals/10.1075/ijcl.24022.sav
Loading

Data & Media loading...

  • Article Type: Research Article
Keywords: learner corpus research ; CEFR levels ; L2 Turkish ; vocabulary profile
This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error