- Home
- e-Journals
- International Journal of Corpus Linguistics
- Previous Issues
- Volume 26, Issue 2, 2021
International Journal of Corpus Linguistics - Volume 26, Issue 2, 2021
Volume 26, Issue 2, 2021
-
Beyond base and collocate
Author(s): Pascual Cantos and Moisés Almela-Sánchezpp.: 161–186 (26)More LessAbstractSupport verb constructions figure among the most frequently investigated topics in the literature on collocation. So far, most studies of this kind have focused on bipartite structures, consisting of a verbal collocate and a nominal base. Accordingly, the analysis of how support verbs are distributed has concentrated almost exclusively on the lexical control exerted by the base. In this article, we draw attention towards the influence exerted by the participation of verb and noun in more complex patterns of lexical co-occurrence. We contend that the distribution of the support verb collocate is contingent not only on the base noun but also on other elements of the lexical context. This highlights the need to enrich the theoretical framework of collocation analysis with the additional descriptive category of ‘second-order collocate’. The proposal is illustrated with two case studies using a large-scale web corpus of English.
-
Subcategorization frame identification for learner English
Author(s): Yan Huang, Akira Murakami, Theodora Alexopoulou and Anna Korhonenpp.: 187–218 (32)More LessAbstractAs large-scale learner corpora become increasingly available, it is vital that natural language processing (NLP) technology is developed to provide rich linguistic annotations necessary for second language (L2) research. We present a system for automatically analyzing subcategorization frames (SCFs) for learner English. SCFs link lexis with morphosyntax, shedding light on the interplay between lexical and structural information in learner language. Meanwhile, SCFs are crucial to the study of a wide range of phenomena including individual verbs, verb classes and varying syntactic structures. To illustrate the usefulness of our system for learner corpus research and second language acquisition (SLA), we investigate how L2 learners diversify their use of SCFs in text and how this diversity changes with L2 proficiency.
-
Shell nouns as register-specific discourse devices
Author(s): Alex Chengyu Fang and Min Dongpp.: 219–247 (29)More LessAbstractThis article provides a corpus-based investigation into shell nouns. Shell nouns perform a variety of referential functions and express speaker stance. The investigation was motivated by the fact that past research in this area has been primarily based on written texts. Very little is known about the use of shell nouns in speech. The study used the ICE-GB corpus of contemporary British English and investigated cataphoric shell nouns complemented by appositive that-clauses across fine-grained spoken and written registers. It has revealed that the deployment of shell nouns is governed by the principle of register formality definable in terms of contextual configurations of the Field-Tenor-Mode complex rather than the mode of production. Additionally, the study has uncovered the frequent use of a small core set of shell nouns common across speech and writing. Hence it argues that shell nouns are part and parcel of spoken and written discourse and that they pertain more to grammar than to lexis.
-
Two subjunctives or three?
Author(s): Gustavo Guajardopp.: 248–283 (36)More LessAbstractThis paper examines the use of the three non-periphrastic subjunctives in Spanish in embedded clauses under obligatory subjunctive predicates in the past tense in three Spanish varieties: Argentinean, Mexican and Peninsular Spanish. By means of random forest and logistic regression analyses, I demonstrate that a grammar where the two “past” subjunctives make up one group, such that the variation can be modeled on a binary opposition between (morphologically) past vs. (morphologically) present, achieves better prediction accuracy and goodness-of-fit parameters than a grammar with a three-way split. The results suggest that, at least in complement clauses of obligatory subjunctive predicates, there appear to be no semantic differences between the two past subjunctives but there are still relatively large differences in how the three subjunctive forms are used across the three Spanish varieties studied.1
-
Concordance line sorting in The Prime Machine
Author(s): Stephen Jeacopp.: 284–297 (14)More LessAbstractCorpus data provide evidence of the patterning of language, and one way word usage can be analysed is through the study of concordance lines. While popular concordancers provide different sorting methods, they are typically only able to display lines in the order in which they occur in the corpus, randomly, or alphabetically by words in slots to the left or right of the word of interest. Less sophisticated users may find recognising patterns from these orderings quite challenging. This paper considers possible needs of language learners in terms of concordance ranking and introduces two methods which have been adopted and developed for The Prime Machine. The first method uses repeated patterns, measuring the number of matches made with other lines in the set. The second method incorporates collocation scores, providing examples with strong collocations from the entire corpus at the top of sampled concordance lines.
-
Review of Wołk (2019): Machine Learning in Translation Corpora Processing
Author(s): Xiangtao Du and Kanglong Liupp.: 298–303 (6)More LessThis article reviews Machine Learning in Translation Corpora Processing
Volumes & issues
-
Volume 29 (2024)
-
Volume 28 (2023)
-
Volume 27 (2022)
-
Volume 26 (2021)
-
Volume 25 (2020)
-
Volume 24 (2019)
-
Volume 23 (2018)
-
Volume 22 (2017)
-
Volume 21 (2016)
-
Volume 20 (2015)
-
Volume 19 (2014)
-
Volume 18 (2013)
-
Volume 17 (2012)
-
Volume 16 (2011)
-
Volume 15 (2010)
-
Volume 14 (2009)
-
Volume 13 (2008)
-
Volume 12 (2007)
-
Volume 11 (2006)
-
Volume 10 (2005)
-
Volume 9 (2004)
-
Volume 8 (2003)
-
Volume 7 (2002)
-
Volume 6 (2001)
-
Volume 5 (2000)
-
Volume 4 (1999)
-
Volume 3 (1998)
-
Volume 2 (1997)
-
Volume 1 (1996)
Most Read This Month

-
-
The Spoken BNC2014
Author(s): Robbie Love, Claire Dembry, Andrew Hardie, Vaclav Brezina and Tony McEnery
-
- More Less