- Home
- e-Journals
- International Journal of Learner Corpus Research
- Fast Track Listing
International Journal of Learner Corpus Research - Online First
Online First articles are the published Version of Record, made available as soon as they are finalized and formatted. They are in general accessible to current subscribers, until they have been included in an issue, which is accessible to subscribers to the relevant volume
-
-
The influence of L1 Dutch on connective use in L2 German academic writing : A contrastive corpus-based analysis
Author(s): Helena Wedig, Carola Strobl, Jim J. J. Ureel, Tanja Mortelmans and Larissa WeberAvailable online: 04 November 2025More LessAbstractThe present study provides a comparative corpus-based analysis of summaries written by three groups: first-language (L1) German writers, second-language (L2) German writers with L1 Dutch, and L2 German writers with other L1s. The aim is to determine whether there are differences in connective use between L1 and L2 writers in summary writing and whether there are L1 Dutch-specific differences. The results show that L2 German writers with non-Dutch L1s use fewer connectives than L1 German writers, whereas L2 German writers with L1 Dutch use more connectives, especially expansion and contingency connectives. In addition, L2 German writers prefer certain connectives (e.g., und (and), weil (because)) and L2 German writers with L1 Dutch aber (but). Overall, this study highlights the importance of (contrastively) analysing summary writing as well as considering under-researched language pairs such as German and Dutch.
-
-
-
Automatic discourse segmentation of L1 and L2 spoken English transcripts
Author(s): Linsey C. Yang, Wenwei Dong, Nathan Vandeweerd and Jet HoekAvailable online: 07 October 2025More LessAbstractNatural language processing (NLP) tools, primarily trained on L1 written English, have achieved remarkable performance, but are rarely used in L2 learner data. This study leverages a rule-based segmenter to automatically segment spoken English discourse by both L1 speakers and learners, presenting novel preparatory data-cleaning steps that combine a state-of-the-art disfluency detector and additional rules to improve segmentation performance. In three successive segmentation tests on data from the Louvain Corpus of Native English Conversation (LOCNEC; De Cock, 2004) and the Louvain International Database of Spoken English Interlanguage (LINDSEI; Gilquin et al. 2010), we achieve an enhanced segmentation performance that is similar for both the L1 and L2 data (.84). Our approach highlights the effectiveness of leveraging existing NLP tools to process disfluent L2 spoken transcripts, facilitating automatic discourse analysis in Learner Corpus Research (LCR). The code for executing our pipeline is publicly available for future research.
-
-
-
SEEFLEX : The Corpus of Secondary English as a Foreign Language (EFL) Exams
Author(s): Tobias PaulsAvailable online: 21 August 2025More LessAbstractThis report presents the Corpus of Secondary School English as a Foreign Language (EFL) Exams (SEEFLEX). In Germany, upper secondary school EFL exams feature recurring tasks targeting diverse text types. The SEEFLEX was developed to investigate how students complete these tasks linguistically and whether they meet the curricular requirements. The corpus contains data from 575 transcribed authentic curriculum-based examinations (1,979 texts, ~625.000 words). The metadata include standardized receptive vocabulary assessments, a cognition scale, the participants’ reading habits, social background, and their language experience and proficiency. Extensive xml mark-up was added to investigate the influence of inter alia source material, structural text features, and selected language mistakes. An online repository provides full-text access as well as ample additional resources, including an interactive Shiny application to investigate register variation in the corpus.
-
Most Read This Month Most Read RSS feed
-
-
The Trinity Lancaster Corpus
Author(s): Dana Gablasova, Vaclav Brezina and Tony McEnery
-
- More Less