Volume 25, Issue 3
  • ISSN 1384-6655
  • E-ISSN: 1569-9811
Buy:$35.00 + Taxes



The coronavirus pandemic may be the largest crisis the world has had to face since World War II. It does not come as a surprise that it is also having an impact on language as our primary communication tool. In this short paper, we present three inter-connected resources that are designed to capture and illustrate these effects on a subset of the German language: An RSS corpus of German-language newsfeeds (with freely available untruncated frequency lists), a continuously updated HTML page tracking the diversity of the vocabulary in the RSS corpus and a web application that enables other researchers and the broader public to explore the corpus in terms of basic frequencies.


Article metrics loading...

Loading full text...

Full text loading...


  1. Chang, W., Cheng, J., Allaire, J., Xie, Y., & McPherson, J.
    (2020) shiny: Web application framework for R(Version [Computer software]. https://CRAN.R-project.org/package=shiny
    [Google Scholar]
  2. Davies, M.
    (2016–) Corpus of news on the web (NOW): 10 billion words from 20 countries, updated every day. https://www.english-corpora.org/now/
  3. Dowle, M., & Srinivason, A.
    (2019) data.table: Extension of “data.frame”(Version 1.12.8) [Computer software]. https://CRAN.R-project.org/package=data.table
    [Google Scholar]
  4. Grolemund, G., & Wickham, H.
    (2011) Dates and times made easy with {lubridate}. Journal of Statistical Software, 40(3), 1–25. 10.18637/jss.v040.i03
    https://doi.org/10.18637/jss.v040.i03 [Google Scholar]
  5. Johnson, W.
    (1944) Studies in language behavior: I. A program of research. Psychological Monographs: General and Applied, 56(2), 1–15. 10.1037/h0093508
    https://doi.org/10.1037/h0093508 [Google Scholar]
  6. Koplenig, A.
    (2017) A data-driven method to identify (correlated) changes in chronological corpora. Journal of Quantitative Linguistics, 24(4), 289–318. 10.1080/09296174.2017.1311447
    https://doi.org/10.1080/09296174.2017.1311447 [Google Scholar]
  7. Michel, J. -B., Shen, Y. K., Aiden, A. P., Verses, A., Gray, M. K., The Google Books Team, Pickett, J. P., Hoiberg, D., Clancy, D., Norvig, P., Orwant, J., Pinker, S., Nowak, M. A., & Aiden, L. E.
    (2011) Quantitative analysis of culture using millions of digitized books. Science, 331(14), 176–182. doi:  10.1126/science.1199644
    https://doi.org/10.1126/science.1199644 [Google Scholar]
  8. R Core Team
    R Core Team (2020) R: A language and environment for statistical computing (Version 4.0.2). R Foundation for Statistical Computing [Computer software]. https://www.R-project.org/
    [Google Scholar]
  9. Shannon, C. E.
    (1948) A mathematical theory of communication. Bell System Technical Journal, 27(3), 379–423. doi:  10.1002/j.1538‑7305.1948.tb01338.x
    https://doi.org/10.1002/j.1538-7305.1948.tb01338.x [Google Scholar]
  10. Temple Lang, D.
    (2020) XML: Tools for parsing and generating XML within R and S-Plus(Version 3.99-0.3) [Computer software]. https://CRAN.R-project.org/package=XML
    [Google Scholar]
  11. Xie, Y., Allaire, J., & Grolemund, G.
    (2018) R Markdown: The Definitive Guide. Chapman and Hall/CRC.https://bookdown.org/yihui/rmarkdown. 10.1201/9781138359444
    https://doi.org/10.1201/9781138359444 [Google Scholar]

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error