Volume 21, Issue 4
  • ISSN 1384-6655
  • E-ISSN: 1569-9811
Buy:$35.00 + Taxes


This paper explores the effectiveness of Juilland’s as a measure of vocabulary dispersion in large corpora. Through a series of experiments using the BNC, we explored the influence of three variables: the number of corpus-parts used for the computation of , the frequency of the target word, and the distributions of those words. The experiments demonstrate that the effective range for is greatly reduced when computations are based on a large number of corpus-parts: even words with highly skewed distributions have values indicating a relatively uniform distribution. We also briefly explore an alternative measure, Gries’ (Gries 2008), showing that it is a more reliable and effective measure of dispersion in a large corpus divided into many parts. In conclusion, we discuss the implications of these findings for quantitative methods applied to the creation of vocabulary lists as well as research questions in other areas of corpus linguistics.


Article metrics loading...

Loading full text...

Full text loading...


  1. Baker, P. , & Egbert, J
    (Eds.) (2016) Triangulating Methodological Approaches in Corpus-linguistic Research. New York, NY: Routledge.
    [Google Scholar]
  2. Biber, D
    (2012) Register as a predictor of linguistic variation. Corpus Linguistics and Linguistic Theory, 8(1), 9–37. doi: 10.1515/cllt‑2012‑0002
    https://doi.org/10.1515/cllt-2012-0002 [Google Scholar]
  3. Biber, D. , Egbert, J. , Gray, B. , Oppliger, R. , & Szmrecsanyi, B
    . (Forthcoming). Variationist versus text-linguistic approaches to grammatical change in English: Nominal modifiers of head nouns. In M. Kytö & P. Pahta (Eds.) Cambridge Handbook of English Historical Linguistics. Cambridge: Cambridge University Press.
    [Google Scholar]
  4. Brezina, V. , & Gablasova, D
    (2015) Is there a core general vocabulary? Introducing the New General Service List. Applied Linguistics, 36(1), 1–22. doi: 10.1093/applin/amt018
    https://doi.org/10.1093/applin/amt018 [Google Scholar]
  5. Coxhead, A
    (2000) A new academic word list. TESOL Quarterly, 34(2), 213–238. doi: 10.2307/3587951
    https://doi.org/10.2307/3587951 [Google Scholar]
  6. Davies, M. , & Gardner, D
    (2010) A Frequency Dictionary of Contemporary American English: Word Sketches, Collocates, and Thematic Lists. London: Routledge.
    [Google Scholar]
  7. Evert, S
    (2004) The statistics of word co-occurrences: Word pairs and collocations (Unpublished doctoral dissertation). University of Stuttgart, Germany. Retrieved fromelib.uni-stuttgart.de/bitstream/11682/2573/1/Evert2005phd.pdf (last accessed September 2016).
    [Google Scholar]
  8. Gardner, D. , & Davies, M
    (2014) A new academic vocabulary list. Applied Linguistics, 34(5), 1–24.
    [Google Scholar]
  9. Gries, S. Th
    (2008) Dispersions and adjusted frequencies in corpora. International Journal of Corpus Linguistics, 13(4), 403–437. doi: 10.1075/ijcl.13.4.02gri
    https://doi.org/10.1075/ijcl.13.4.02gri [Google Scholar]
  10. Juilland, A.G. , Brodin, D.R. , & Davidovitch, C
    (1970) Frequency Dictionary of French Words. The Hague: Mouton de Gruyter
    [Google Scholar]
  11. Juilland, A. , & Chang-Rodriguez, E
    (1964) Frequency Dictionary of Spanish words. The Hague: Mouton de Gruyter.
    [Google Scholar]
  12. Lyne, A
    (1985) The Vocabulary of French Business Correspondence. Geneva: Slatkine.
    [Google Scholar]
  13. Leech, G. , Rayson, P. , & Wilson, A
    (2001) Word Frequencies in Written and Spoken English: Based on the British National Corpus. London: Longman.
    [Google Scholar]
  14. Martin, J.D. , & Gray, L.N
    (1971) Measurement of relative variation: Sociological examples. American Sociological Review, 36(3), 496–502. doi: 10.2307/2093089
    https://doi.org/10.2307/2093089 [Google Scholar]
  15. Oakes, M
    (1998) Statistics for Corpus Linguistics. Edinburgh: Edinburgh Press.
    [Google Scholar]
This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error