Full text loading...
GBP
-
Multi-dimensional register classification using bigrams
-
View Affiliations Hide Affiliations
- Source: International Journal of Corpus Linguistics, Volume 12, Issue 4, Jan 2007, p. 453 - 478
-
- Previous Article
- Table of Contents
- Next Article
Abstract
A corpus linguistic analysis investigated register classification using frequency of bigrams in nine spoken and two written corpora. Four dimensions emerged from a factor analysis using bigram frequencies shared across corpora: (1) Scripted vs. Unscripted Discourse, (2) Deliberate vs. Unplanned Discourse, (3) Spatial vs. Non-Spatial Discourse, and (4) Directional vs. Non-Directional Discourse. These findings were replicated in a second analysis. Both analyses demonstrate the strength of bigrams for classifying spoken and written registers, especially in locating distinct collocations among spoken corpora, as well as revealing syntactic and discourse features through a data-driven approach.
© 2007 John Benjamins Publishing Company