- Home
- Book Series
- [, Studies in Corpus Linguistics]
[, Studies in Corpus Linguistics]
<p>SCL focuses on the use of corpora throughout language study, the development of a quantitative approach to linguistics, the design and use of new tools for processing language texts, and the theoretical implications of a data-rich discipline. </p>
61 - 80 of 127 results
-
-
Exploring Language and Society with Big Data
Editor(s): Minna Korhonen, Haidee Kotze and Jukka TyrkköPublication Date November 2023show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:As the legislative bodies of democratic nations, parliaments play a fundamental role in society. Consequently the linguistic practices observed in parliamentary discourse are of importance to everyone. This volume brings together leading researchers in areas of corpus linguistics, big data, parliamentary discourse, and historical linguistics in a truly interdisciplinary exploration at the vanguard of big data and corpus methods with the aim to investigate the intersection between linguistic and social change. Making use of both quantitative and qualitative methods, the studies included in this volume range from a focus on explicitly linguistic phenomena to topics that contribute to our understanding of language and society more generally. It breaks new ground in its critical reflection on the conceptual and methodological challenges of using large corpora of parliamentary discourse to study both the specialised language of parliamentary speech and the societies that the parliaments in question represent and govern.
-
-
-
Exploring Newspaper Language
Editor(s): Gisle AndersenPublication Date March 2012show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:This book describes new methodological and technological approaches to corpus building and presents recent research based on the Norwegian Newspaper Corpus. This is a large monitor corpus of contemporary Norwegian language, compiled through daily harvesting of web newspapers. The book gives an overview of the corpus and its system architecture, and presents tools used for tasks such as text harvesting, annotation, topic classification and extraction and frequency profiling of new words and phrases. Among the innovative technologies is Corpuscle, a corpus query engine and management system which is flexible enough to handle very large corpora in an efficient way. The individual research contributions based on the corpus explore different aspects of Norwegian, including the occurrence of anglicisms, neologisms and terminology, and the use of metonymy and metaphor in newspaper language. The book also describes an innovative method of applying correspondence analysis and implicational analysis to investigate interdependencies between morphosyntactic variants.
-
-
-
Exploring Second-Language Varieties of English and Learner Englishes
Editor(s): Joybrato Mukherjee and Marianne HundtPublication Date May 2011show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:The articles in this volume are intended to bridge what Sridhar and Sridhar (1986) have called the 'paradigm gap' between traditional SLA research on the one hand and research into institutionalised second-language varieties in former colonial territories on the other. Since both learner Englishes and second-language varieties are typically non-native forms of English that emerge in language contact situations, it is high time that they are described and compared on an empirical basis in order to draw conceptual and theoretical conclusions with regard to their form, function and acquisition. The present collection of articles places special emphasis on empirical evidence obtained from large-scale analyses of computerised corpora of learner Englishes (such as the International Corpus of Learner English) and of second-language varieties of English (such as the International Corpus of English). It addresses questions such as ‘Are the phenomena we find in ESL and EFL varieties features or errors?’ or ‘How common and wide-spread are features across contact varieties of English?’
-
-
-
Exploring the Lexis–Grammar Interface
Editor(s): Ute Römer and Rainer SchulzePublication Date March 2009show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:This volume showcases studies that recognize and provide evidence for the inseparability of lexis and grammar. The contributors explore in what ways these two areas, often treated separately in linguistic theory and description, form an organic whole. The papers in Section I (Setting the Scene) introduce some of the key methodological approaches and theoretical positions at the lexis-grammar interface, while Section II (Considering the Particulars) contains papers that report on case studies and show concrete applications of the central methods and theories. Exploring the Lexis-Grammar Interface is a stimulating collection of papers for anyone who wishes to learn more about and get fresh state-of-the-art perspectives on language patterning.
-
-
-
Fluency in Native and Nonnative English Speech
Author(s): Sandra GötzPublication Date March 2013show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:This book takes a new and holistic approach to fluency in English speech and differentiates between productive, perceptive, and nonverbal fluency. The in-depth corpus-based description of productive fluency points out major differences of how fluency is established in native and nonnative speech. It also reveals areas in which even highly advanced learners of English still deviate strongly from the native target norm and in which they have already approximated to it. Based on these findings, selected learners are subjected to native speakers' ratings of seven perceptive fluency variables in order to test which variables are most responsible for a perception of oral proficiency on the sides of the listeners. Finally, language-pedagogical implications derived from these findings for the improvement of fluency in learner language are presented. This book is conceptually and methodologically relevant for corpus-linguistics, learner corpus research and foreign language teaching and learning.
-
-
-
Frequency, Dispersion, Association, and Keyness
Author(s): Stefan Th. GriesPublication Date July 2024show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:This book is an attempt to revisit the main specifically corpus-linguistic statistics/measures the field has been relying on for decades: frequency, dispersion, association, and keyness. The book first discusses the purpose of these measures and how they have been measured. Then, the book makes three main proposals: First, that many measures of dispersion, association, and keyness are too confounded with frequency and how to 'take frequency out of them' to obtain conceptually cleaner and more interpretable measures. Second, that many existing measures can be replaced by the simple information-theoretic measure of the Kullback-Leibler divergence and that it, too, can have frequency 'removed' from it. Third, that corpus linguistics should abandon the tradition of trying to describe its findings with a single number and adopt a tupleization approach instead, where we use several separate dimensions of information for description and interpretation. The book is written in an informal, hands-on style and comes with its own R package featuring functions, example data, and several thousand lines of code exemplifying all applications.
-
-
-
Grammatical Change in English World-Wide
Editor(s): Peter CollinsPublication Date February 2015show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:The contributions to this volume apply and extend the techniques of corpus linguistics and diachronic linguistics to the challenge of describing and explaining grammatical change in varieties of English world-wide. The book is divided into two parts, with ten chapters on ‘Inner Circle’ varieties such as Australian, Canadian, and Irish English, and eight on ‘Outer Circle’ varieties such as Philippine, Indian, and Nigerian English. Contributors examine a range of topics including the progressive aspect, modal auxiliaries, do-support, verb morphology, and quotatives, using a wide variety of corpus resources. Overarching research questions addressed include the following: Do diachronic tendencies observed in a particular variety converge with, diverge from, or run in parallel with, those in the parent variety? What are the possible causes of changes observed (e.g. English teaching traditions, Americanisation, internal changes in registers)? This book will appeal to linguists, particularly those interested in grammatical description, corpus linguistics and World Englishes.
-
-
-
How to Do Corpus Pragmatics on Pragmatically Annotated Data
Author(s): Martin WeisserPublication Date April 2018show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:This book introduces a methodology and research tool (DART) that make it possible to carry out advanced corpus pragmatics research using dialogue corpora enriched with pragmatics-relevant annotations. It first explores the general use of spoken corpora for pragmatics research, as well as issues revolving around their representation and annotation, and then goes on to describe the resources required for such an annotation process. Based on data from three different corpora, ranging from highly constrained, task-oriented, ones (SPAADIA Trainline & Trains 93) to unconstrained dialogues (Switchboard), it next presents an in-depth discussion and illustration of the potential contributions of syntax, semantics, and semantico-pragmatics towards pragmatic force. This is followed by a description of the largely automatic annotation process itself, and finally an analysis of how a set of more than 110 potential speech acts defined in DART contributes towards establishing the specific communicative characteristics of the three corpora.
-
-
-
How to Use Corpora in Language Teaching
Editor(s): John McH. SinclairPublication Date April 2004show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:After decades of being overlooked, corpus evidence is becoming an important component of the teaching and learning of languages. Above all, the profession needs guidance in the practicalities of using corpora, interpreting the results and applying them to the problems and opportunities of the classroom. This book is intensely practical, written mainly by a new generation of language teachers who are acknowledged experts in central aspects of the discipline. It offers advice on what to do in the classroom, how to cope with teachers' queries about language, what corpora to use including learner corpora and spoken corpora and how to handle the variability of language; it reports on some current research and explains how the access software is constructed, including an opportunity for the practitioner to write small but useful programs; and it takes a look into the future of corpora in language teaching.
-
-
-
The Idiom Principle and L1 Influence
Author(s): Ying WangPublication Date October 2016show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:This book examines delexical verb + noun collocations such as make a decision, give rise to and take care of in Swedish and Chinese learner English. Using a methodological framework that combines learner corpus research with a contrastive perspective, the study is one of the very few in the field to incorporate corpora of the learner’s L1 to investigate the effects of L1 influence. The book provides a highly detailed and multi-faceted analysis of delexical verb + noun collocations in terms of frequency of occurrence, lexical preferences and morphosyntactic patterns. Quantitative and qualitative results on overuse, underuse and errors are presented with linguistically and pedagogically relevant interpretations that include cultural and discourse aspects. More importantly, the book throws light on how L2 learners may alternate between the open-choice principle and the idiom principle as well as the extent and nature of L1 influence on their collocational use.
-
-
-
In Search of Basic Units of Spoken Language
Editor(s): Shlomo Izre'el, Heliana Mello, Alessandro Panunzi and Tommaso RasoPublication Date June 2020show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:What is the best way to analyze spontaneous spoken language? In their search for the basic units of spoken language the authors of this volume opt for a corpus-driven approach. They share a strong conviction that prosodic structure is essential for the study of spoken discourse and each bring their own theoretical and practical experience to the table. In the first part of the book they segment spoken material from a range of different languages (Russian, Hebrew, Central Pomo (an indigenous language from California), French, Japanese, Italian, and Brazilian Portuguese). In the second part of the book each author analyzes the same two spoken English samples, but looking at them from different perspectives, using different methods of analysis as reflected in their respective analyses in Part I. This approach allows for common tendencies of segmentation to emerge, both prosodic and segmental.
-
-
-
Investigating Wikipedia
Editor(s): Céline Poudat, Harald Lüngen and Laura HerzbergPublication Date November 2024show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:The present volume is intended as a reference book on Wikipedia corpus studies, from corpus construction to exploration and analysis. Wikipedia is a complex object, difficult to manipulate for linguists and corpus researchers. In addition to the encyclopedic articles consulted by millions of users, it contains vast spaces of written discussions, aka talk pages, where Wikipedia authors negotiate the collaborative editing of articles, make evaluations, or discuss related topics. The proposed volume covers Wikipedia articles, their revision histories, and discussions, with a focus on discussions, which have not been studied extensively so far and have also been neglected in previous corpus building efforts. Wikipedia discussions are instances of computer-mediated communication (CMC), thus constituting a completely different, interaction-oriented linguistic genre. Sophisticated tools and methods of linguistic annotation and corpus exploration are needed to exploit the huge and valuable corpus resources that can be constructed from the Wikipedia discussions. The present volume aims at encouraging and facilitating Wikipedia corpus studies, providing standards, recommendations, and innovative methods to build and explore Wikipedia corpora, and presenting corpus studies that make the most of the peculiarities of Wikipedia.
-
-
-
The (Ir)reversibility of English Binomials
Author(s): Sandra MollinPublication Date September 2014show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:This book focuses on binomials (word pairs such as heart and soul, rich and poor, or if and when), and in particular on the degree of reversibility that English binomials demonstrate. Detailed and innovative corpus linguistic analyses investigate the correlates of the degree of reversibility, linguistic constraints that influence the ordering and reversibility of binomials and the diachronic development of reversibility. In addition, judgment data are analyzed for their convergence and divergence with corpus data regarding degrees of reversibility. The book thus establishes reversibility as a complex characteristic of the binomial construction, at the same time throwing light on general questions in phraseology, lexicalization, language structure and language processing.
-
-
-
Keyness in Texts
Editor(s): Marina Bondi and Mike ScottPublication Date November 2010show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:This is corpus linguistics with a text linguistic focus. The volume concerns lexical inequality, the fact that some words and phrases share the quality of being key – and thereby reflect or promote important themes – in some textual contexts, while others do not. The patterning of words which differ in their centrality to text meaning is of increasing interest to corpus linguistics. At the same time software resources are yielding increasingly more detailed ways of identifying and studying the linkages between key words and phrases in text databases. This volume brings together work from some of the leading researchers in this field. It presents thirteen studies organized in three sections, the first containing a series of studies exploring the nature of keyness itself, then a set of five studies looking at keyness in specific discourse contexts, and then three studies with an educational focus.
-
-
-
Korean English
Author(s): Glenn HadikinPublication Date July 2014show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:The English language is changing every day and it is us – the individual speakers and writers – that drive those changes in small ways by choosing to use certain strings of words over others. This book discusses and describes some of the choices made by speakers from South Korea by examining the similarities and differences between two Korean communities: one in England and one in South Korea. The book has two overall aims. Firstly, it is intended to begin a discussion about phraseology and Lexical Priming and how these theoretical concepts relate and play out in the context of a New English. Secondly, it provides a model of how a language variety can be explored by detailed analysis of short strings. It delves into a range of areas from World Englishes to phraseology and formulaic language and would be suitable for students, teachers and researchers in all these areas.
-
-
-
Language Acquisition in CLIL and Non-CLIL Settings
Author(s): Verena MöllerPublication Date December 2017show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:Language Acquisition in CLIL and Non-CLIL Settings builds a bridge between Second Language Acquisition and Learner Corpus Research (LCR) methodologies to take the evaluation of Content and Language Integrated Learning (CLIL) to a new level. The study innovates in two main ways. First, it is based on a highly diversified L2 database which includes learner corpus data as well as experimental data from the same learners. These linguistic components of the database are complemented with extensive information on learner variables, including cognitive and affective factors, which are rarely studied in LCR. Second, the study relies on multifactorial statistical analyses to assess the effectiveness of CLIL itself as well as the impact of the selectivity inherent in the CLIL system, which has frequently been ignored. The linguistic focus of the study is the English passive, which is investigated in CLIL and non-CLIL teaching materials, and subsequently related to learner output.
-
-
-
Language and Characterisation in Television Series
Author(s): Monika BednarekPublication Date March 2023show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:This book explores how language is used to create characters in fictional television series. To do so, it draws on multiple case studies from the United States and Australia. Brought together in this book for the first time, these case studies constitute more than the sum of their parts. They highlight different aspects of televisual characterisation and showcase the use of different data, methods, and approaches in its analysis. Uniquely, the book takes a mixed-method approach and will thus not only appeal to corpus linguists but also researchers in sociolinguistics, stylistics, and pragmatics. All corpus linguistic techniques are clearly introduced and explained, and the book is thus accessible to both experienced researchers as well as novice researchers and students. It will be essential reading in linguistics, literature, stylistics, and media/television studies.
-
-
-
The Language of Outsourced Call Centers
Author(s): Eric FriginalPublication Date February 2009show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:The Language of Outsourced Call Centers is the first book to explore a large-scale corpus representing the typical kinds of interactions and communicative tasks in outsourced call centers located in the Philippines and serving American customers. The specific goals of this book are to conduct a corpus-based register comparison between outsourced call center interactions, face-to-face American conversations, and spontaneous telephone exchanges; and to study the dynamics of cross-cultural communication between Filipino call center agents and American callers, as well as other demographic groups of participants in outsourced call center transactions, e.g., gender of speakers, agents’ experience and performance, and types of transactional tasks. The research design relies on a number of analytical approaches, including corpus linguistics and discourse analysis, and combines quantitative and qualitative examination of linguistic data in the investigation of the frequency distribution and functional characteristics of a range of lexico/syntactic features of outsourced call center discourse.
-
-
-
Language Periphery
Publication Date March 2016show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:A full-length study of monocollocable words, i.e. words whose usage is severely restricted to one or a few combinations only (such as English ado in without much/further ado), that brings together corpus-based data from the four languages along with studies analysing, along both general and language-specific lines, monocollocable words in terms of their frequency, lexical as well as morphosyntactic behaviour, and various facets of their peripheral status. Each of the four langauges covered, namely, English, Italian, German and Czech also offers a short introduction of the respective languages written in English, Italian, German and Czech. A rare contribution to our knowledge of an as yet little studied field, the book will attract the attention of, and stimulate a new interest in, all who are ready to acknowledge that collocation is a core phenomenon of language – lexicologists, lexicographers with a focus on phraseology, language typologists, linguists with a contrastive and historical agenda, and language teachers alike.
-
-
-
Learner Corpora and Language Teaching
Editor(s): Sandra Götz and Joybrato MukherjeePublication Date May 2019show More to view fulltext, buy and share links for: show Less to hide fulltext, buy and share links for:While native corpora and corpus linguistic tools and methods have been used and applied for quite some time in the development of learning and teaching materials, learner corpora are only just beginning to impact the field of language teaching, testing and assessment. This volume helps to close this still existing gap and highlights the great potential of learner corpus research for language pedagogy by presenting a selection of 11 original studies on learner corpora, conducted by established experts as well as by excellent young researchers. The papers included in the volume present new corpora and methods; studies on written as well as spoken learner corpora and on using data-driven learning scenarios in the classroom.
All papers include sections on practical and concrete language-pedagogical applications. This volume will be of significant interest to researchers working in corpus linguistics, learner corpus research, second language acquisition and English for Academic and Specific Purposes, as well to language teachers and materials developers.
-



















