1887
Volume 25, Issue 3
  • ISSN 1384-6655
  • E-ISSN: 1569-9811
USD
Buy:$35.00 + Taxes

Abstract

Abstract

The coronavirus pandemic may be the largest crisis the world has had to face since World War II. It does not come as a surprise that it is also having an impact on language as our primary communication tool. In this short paper, we present three inter-connected resources that are designed to capture and illustrate these effects on a subset of the German language: An RSS corpus of German-language newsfeeds (with freely available untruncated frequency lists), a continuously updated HTML page tracking the diversity of the vocabulary in the RSS corpus and a web application that enables other researchers and the broader public to explore the corpus in terms of basic frequencies.

Loading

Article metrics loading...

/content/journals/10.1075/ijcl.20078.wol
2020-10-14
2024-10-12
Loading full text...

Full text loading...

References

  1. Chang, W., Cheng, J., Allaire, J., Xie, Y., & McPherson, J.
    (2020) shiny: Web application framework for R(Version 1.4.0.2) [Computer software]. https://CRAN.R-project.org/package=shiny
    [Google Scholar]
  2. Davies, M.
    (2016–) Corpus of news on the web (NOW): 10 billion words from 20 countries, updated every day. https://www.english-corpora.org/now/
  3. Dowle, M., & Srinivason, A.
    (2019) data.table: Extension of “data.frame”(Version 1.12.8) [Computer software]. https://CRAN.R-project.org/package=data.table
    [Google Scholar]
  4. Grolemund, G., & Wickham, H.
    (2011) Dates and times made easy with {lubridate}. Journal of Statistical Software, 40(3), 1–25. 10.18637/jss.v040.i03
    https://doi.org/10.18637/jss.v040.i03 [Google Scholar]
  5. Johnson, W.
    (1944) Studies in language behavior: I. A program of research. Psychological Monographs: General and Applied, 56(2), 1–15. 10.1037/h0093508
    https://doi.org/10.1037/h0093508 [Google Scholar]
  6. Koplenig, A.
    (2017) A data-driven method to identify (correlated) changes in chronological corpora. Journal of Quantitative Linguistics, 24(4), 289–318. 10.1080/09296174.2017.1311447
    https://doi.org/10.1080/09296174.2017.1311447 [Google Scholar]
  7. Michel, J. -B., Shen, Y. K., Aiden, A. P., Verses, A., Gray, M. K., The Google Books Team, Pickett, J. P., Hoiberg, D., Clancy, D., Norvig, P., Orwant, J., Pinker, S., Nowak, M. A., & Aiden, L. E.
    (2011) Quantitative analysis of culture using millions of digitized books. Science, 331(14), 176–182. doi:  10.1126/science.1199644
    https://doi.org/10.1126/science.1199644 [Google Scholar]
  8. R Core Team
    R Core Team (2020) R: A language and environment for statistical computing (Version 4.0.2). R Foundation for Statistical Computing [Computer software]. https://www.R-project.org/
    [Google Scholar]
  9. Shannon, C. E.
    (1948) A mathematical theory of communication. Bell System Technical Journal, 27(3), 379–423. doi:  10.1002/j.1538‑7305.1948.tb01338.x
    https://doi.org/10.1002/j.1538-7305.1948.tb01338.x [Google Scholar]
  10. Temple Lang, D.
    (2020) XML: Tools for parsing and generating XML within R and S-Plus(Version 3.99-0.3) [Computer software]. https://CRAN.R-project.org/package=XML
    [Google Scholar]
  11. Xie, Y., Allaire, J., & Grolemund, G.
    (2018) R Markdown: The Definitive Guide. Chapman and Hall/CRC.https://bookdown.org/yihui/rmarkdown. 10.1201/9781138359444
    https://doi.org/10.1201/9781138359444 [Google Scholar]
/content/journals/10.1075/ijcl.20078.wol
Loading
/content/journals/10.1075/ijcl.20078.wol
Loading

Data & Media loading...

This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error