- Home
- e-Journals
- International Journal of Corpus Linguistics
- Fast Track Listing
International Journal of Corpus Linguistics - Online First
Online First articles are the published Version of Record, made available as soon as they are finalized and formatted. They are in general accessible to current subscribers, until they have been included in an issue, which is accessible to subscribers to the relevant volume
-
-
Measuring divergence in migration-related terminology between EU legal discourse and press articles in English and French
Author(s): Edward ClayAvailable online: 09 February 2026More LessAbstractThis paper sets out a bilingual (English and French) corpus-based approach to quantify divergence in terminology between legal discourse and news articles, triangulating a series of complementary indicators of frequency difference, predominant terms and absent terms. This methodology is then applied to purpose-built corpora consisting of EU legal discourse and newspaper articles on the subject of migration in English and French, illustrating the relevance of an approach to measuring shifts in terminological distance. The results of such a study can provide insights into the level of comprehensibility of legal discourse, which is fundamental to ensuring access to justice. This context makes it vitally important to develop such a methodology, which empirically measures whether the terminology used in EU legal discourse is continuing to diverge from language used in non-specialist settings.
-
-
-
Dimensions of variation across institutional legal and administrative registers : An MDA analysis of the Polish Eurolect and the national variety
Author(s): Łucja Biel, Katarzyna Wasilewska and Dariusz KoźbiałAvailable online: 19 December 2025More LessAbstractThis study applies full Multidimensional Analysis (MDA) to examine linguistic variation in the Polish Eurolect — a hybrid variety shaped by translation and institutional constraints within the European Union — by comparing it to the national variety. Using a corpus of key institutional registers (legal acts, judgments, administrative reports, and citizen-oriented websites), we identify four dimensions of variation: Argumentative vs Informational, Engaged Instruction vs Distanced Authority, Prescriptive vs Narrative, and Lexical Richness. The findings reveal notable differences between how supranational and national institutions communicate. EU legal acts and judgments show greater prescriptiveness, legal referencing, and argumentative structuring compared to their Polish counterparts. EU websites have less engagement and explanatory strategies while EU reports favour a less distanced style. The findings map variation and group institutional registers, thereby visualizing similarities and differences between supranational and national institutional communication.
-
-
-
Is human translation more conservative than machine translation? : A corpus-based study measuring formality across translation varieties and registers
Author(s): Jia Li and Xianyao HuAvailable online: 14 November 2025More LessAbstractThe present study investigates whether conservatism exists in human- and machine-translated texts from Chinese into English, and whether this tendency is consistently observable across different registers and multiple lexico-grammatical features by applying profile-based correspondence analysis and mixed-effects logistic regression modelling. The results reveal that human translation is characterised by a higher level of conservatism than both machine translation and original writing, irrespective of registers and lexico-grammatical features. In contrast, machine translation tends to be more conservative compared to non-translations only in journalistic and fictional texts, and the degree of conservatism varies across machine translation platforms. These findings suggest that human translators are more risk avoidant than original writers are, providing strong support for the risk aversion hypothesis. Moreover, the lack of understanding of translation norms or standards in machine translation, as well as the linguistic distinctions from human translation, implies the immense potential of future human-machine collaborative translation models.
-
-
-
From theory to data : Testing introspective claims on synonymous French adjectives ‘prochain’ and ‘suivant’ using corpus-based methods
Author(s): Jarvis Looi, Patricia Nora Riget, Alex Boulton and Roshidah HassanAvailable online: 31 October 2025More LessAbstractThis paper presents a corpus-based study that evaluates variables identified introspectively by Berthonneau (2002) in relation to the alternation between two French synonymous: prochain (‘next’ or ‘upcoming’) and suivant (‘following’ or ‘next’). Employing a multifactorial and behavioural profiles approach, we explore the asymmetry in their temporal and spatial applications. Our findings highlight distinct uses with temporal nouns, whereas uses with material nouns overlap. Binomial logistic regression and multiple correspondence analysis reveal a general divide between the deictic use of prochain and the anaphoric use of suivant, corroborating Berthonneau’s descriptions. Additionally, event predictability, which Berthonneau addressed only fleetingly, significantly influences form choice, with prochain often associated with general, eventual contexts and suivant with specific, predictable events. This study contributes to the understanding of how corpus-based methods can refine existing linguistic hypotheses by illuminating the intersections and divergences between empirical and introspective research.
-
Most Read This Month Most Read RSS feed
-
-
The Spoken BNC2014
Author(s): Robbie Love, Claire Dembry, Andrew Hardie, Vaclav Brezina and Tony McEnery
-
- More Less