1887
image of A cross-linguistic computational study on one new idea per clause

Abstract

We investigate a notion related to Chafe’s One New Idea Constraint (ONICON), namely that clauses only contain one new idea at a time. We investigate both the nominal and the verbal domain in discourse production data from 16 diverse languages. We find no convincing evidence for a hard constraint on the number of new ideas per clause, nor for speakers actively managing information flow in discourse. The absence of an underlying ONICON is suggested by (1) the fact that NPs co-occurring in a single clause do not have more accessible referents, (2) that observed distributions of information can likewise be reproduced by a randomised mechanism, and (3) the absence of a presentational referent activation pattern. A more likely account of observed discourse production pattern is producers’ overall goal to be both contentful and coherent, the former implying the mentioning of a considerable number of entities and the latter implying predications about these entities. It is in this sense that there is audience design, rather than in an avoidance of information overcharge.

Available under the CC BY 4.0 license.
Loading

Article metrics loading...

/content/journals/10.1075/fol.26016.rei
2026-05-05
2026-05-11
Loading full text...

Full text loading...

/deliver/fulltext/10.1075/fol.26016.rei/fol.26016.rei.html?itemId=/content/journals/10.1075/fol.26016.rei&mimeType=html&fmt=ahah

References

  1. Ariel, Mira
    1990Accessing noun-phrase antecedents. London: Routledge.
    [Google Scholar]
  2. Arnold, Jennifer E.
    2010 How speakers refer: The role of accessibility. Language and Linguistics Compass(). –. 10.1111/j.1749‑818X.2010.00193.x
    https://doi.org/10.1111/j.1749-818X.2010.00193.x [Google Scholar]
  3. Barth, Danielle, Kira Davey & Maria Matheas
    2023 Multi-CAST Matukar Panau. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  4. Bogomolova, Natalia, Dmitry Ganenkov & Nils N. Schiborr
    2021 Multi-CAST Tabasaran. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  5. Brennan, Susan E.
    1995 Centering attention in discourse. Language and Cognitive Processes(). –. 10.1080/01690969508407091
    https://doi.org/10.1080/01690969508407091 [Google Scholar]
  6. Chafe, Wallace L.
    1994Discourse, consciousness, and time: The flow and displacement of conscious experience in speaking and writing. Chicago, IL: University of Chicago Press.
    [Google Scholar]
  7. Chafe, Wallace L.
    1987 Cognitive constraints on information flow. InRussell S. Tomlin (ed.), Coherence and grounding in discourse, –. Amsterdam: John Benjamins. 10.1075/tsl.11.03cha
    https://doi.org/10.1075/tsl.11.03cha [Google Scholar]
  8. Christiansen, Morten H. & Nick Chater
    2016 The now-or-never bottleneck: A fundamental constraint on language. Behavioral and Brain Sciences. . 10.1017/S0140525X1500031X
    https://doi.org/10.1017/S0140525X1500031X [Google Scholar]
  9. Clark, Herbert H. & Gregory L. Murphy
    1982 Audience design in meaning and reference. InJean-François Le Ny & Walter Kintsch (eds.), Advances in psychology, –. Amsterdam: North-Holland.
    [Google Scholar]
  10. Comrie, Bernard, Martin Haspelmath & Balthasar Bickel
    2008The Leipzig glossing rules: Conventions for interlinear morpheme-by-morpheme glosses. Leipzig: Max Planck Institute for Evolutionary Anthropology.
    [Google Scholar]
  11. Croft, William
    2007 Intonation units and grammatical structure in Wardaman and in cross-linguistic perspective. Australian Journal of Linguistics(). –. 10.1080/07268600601172934
    https://doi.org/10.1080/07268600601172934 [Google Scholar]
  12. Du Bois, John. W.
    1987 The discourse basis of ergativity. Language(). –. 10.2307/415719
    https://doi.org/10.2307/415719 [Google Scholar]
  13. 2003 Discourse and grammar. InMichael Tomasello (ed.), The new psychology of language: Cognitive and functional approaches to language structure, –. Mahwah, MA: Lawrence Erlbaum.
    [Google Scholar]
  14. Egurtzegi, Aitor, Damián E. Blasi, Sebastian Sauppe, Balthasar Bickel & Stefan Schnell
    2025 Discourse ergativity and human reference in Basque. Studies in Language(). –. 10.1075/sl.23005.egu
    https://doi.org/10.1075/sl.23005.egu [Google Scholar]
  15. Evans, Nicholas & Stephen C. Levinson
    2009 The myth of language universals: Language diversity and its importance for cognitive science. Behavioral and Brain Sciences(). –. 10.1017/S0140525X0999094X
    https://doi.org/10.1017/S0140525X0999094X [Google Scholar]
  16. Forker, Diana & Nils N. Schiborr
    2019 Multi-CAST Sanzhi Dargwa. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  17. Ganenkov, Dmitry & Nils N. Schiborr
    2025 Multi-CAST Chirag. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2507. Bamberg: University of Bamberg.
    [Google Scholar]
  18. Givón, Talmy
    1975 Focus and scope of assertion: Some Bantu evidence. Studies in African Linguistics(). –.
    [Google Scholar]
  19. 1983 Topic continuity in discourse: An introduction. InTalmy Givón (ed.), Topic continuity in discourse, –. Amsterdam: John Benjamins. 10.1075/tsl.3.01giv
    https://doi.org/10.1075/tsl.3.01giv [Google Scholar]
  20. 1984Syntax: A functional-typological introduction (Vol.). Amsterdam: John Benjamins. 10.1075/z.17
    https://doi.org/10.1075/z.17 [Google Scholar]
  21. Goldberg, Adele E.
    2004 Discourse and argument structure. InLaurence R. Horn & Gregory Ward (eds.), The handbook of pragmatics, –. Malden, MA: Blackwell.
    [Google Scholar]
  22. Gundel, Jeanette K., Nancy Hedberg & Ron Zacharski
    1993 Cognitive status and the form of referring expressions in discourse. Language(). –. 10.2307/416535
    https://doi.org/10.2307/416535 [Google Scholar]
  23. Hadjidas, Harris & Maria Vollmer
    2015 Multi-CAST Cypriot Greek. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  24. Haig, Geoffrey & Stefan Schnell
    2014Annotations using GRAID (Grammatical relations and animacy in discourse): Introduction and guidelines for annotators (version 7.0). Bamberg: University of Bamberg.
    [Google Scholar]
  25. 2016 The discourse basis of ergativity revisited. Language(). –. 10.1353/lan.2016.0049
    https://doi.org/10.1353/lan.2016.0049 [Google Scholar]
  26. 2023Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  27. Haig, Geoffrey, Stefan Schnell & Nils Norman Schiborr
    2021 Universals of reference in discourse and grammar: Evidence from the Multi-CAST collection of spoken corpora. InGeoffrey Haig, Stefan Schnell & Frank Seifart (eds.), Doing corpus-based typology with spoken language corpora: State of the art, –. Honolulu, HI: University of Hawai’i Press.
    [Google Scholar]
  28. Haig, Geoffrey, Maria Vollmer & Hanna Thiele
    2019 Multi-CAST Northern Kurdish. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  29. Himmelmann, Nikolaus P.
    2022 Prosodic phrasing and the emergence of phrase structure. Linguistics(). –. 10.1515/ling‑2020‑0135
    https://doi.org/10.1515/ling-2020-0135 [Google Scholar]
  30. Himmelmann, Nikolaus P., Meytal Sandler, Jan Strunk & Volker Unterladstetter
    2018 On the universality of intonational phrases: A cross-linguistic interrater study. Phonology(). –. 10.1017/S0952675718000039
    https://doi.org/10.1017/S0952675718000039 [Google Scholar]
  31. Hymes, Dell
    1972 On communicative competence. InJohn B. Pride & Janet Holmes (eds.), Sociolinguistics: Selected readings, –. Harmondsworth: Penguin.
    [Google Scholar]
  32. Inbar, Maya, Shir Genzer, Anat Perry, Eitan Grossman & Ayelet N. Landau
    2023 Intonation units in spontaneous speech evoke a neural response. Journal of Neuroscience(). –. 10.1523/JNEUROSCI.0235‑23.2023
    https://doi.org/10.1523/JNEUROSCI.0235-23.2023 [Google Scholar]
  33. Jaeger, T. Florian
    2010 Redundancy and reduction: Speakers manage syntactic information density. Cognitive Psychology(). –. 10.1016/j.cogpsych.2010.02.002
    https://doi.org/10.1016/j.cogpsych.2010.02.002 [Google Scholar]
  34. Karttunen, Lauri
    1976 Discourse referents. InJames D. McCawley (ed.), Notes from the linguistic underground, –. New York, NY: Academic Press. 10.1163/9789004368859_021
    https://doi.org/10.1163/9789004368859_021 [Google Scholar]
  35. Konopka, Agnieszka E.
    2012 Planning ahead: How recent experience with structures and words changes the scope of linguistic planning. Journal of Memory and Language(). –. 10.1016/j.jml.2011.08.003
    https://doi.org/10.1016/j.jml.2011.08.003 [Google Scholar]
  36. Kurabe, Keita
    2021 Multi-CAST Jinghpaw. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  37. Labov, William & Joshua Waletzky
    1967 Narrative analysis: Oral versions of personal experience. InJune Helm (ed.), Proceedings of the 1966 Annual Spring Meeting of the American Ethnological Society, –. Seattle, WA: University of Washington Press.
    [Google Scholar]
  38. Lambrecht, Knud
    1994Information structure and sentence form: Topic, focus, and the mental representations of discourse referents. Cambridge: CUP. 10.1017/CBO9780511620607
    https://doi.org/10.1017/CBO9780511620607 [Google Scholar]
  39. Levy, Roger & T. Florian Jaeger
    2007 Speakers optimize information density through syntactic reduction. InBernhard Schölkopf, John Platt & Thomas Hofmann (eds.), Advances in neural information processing systems 19: Proceedings of the 2006 conference, –. Cambridge, MA: MIT Press. 10.7551/mitpress/7503.003.0111
    https://doi.org/10.7551/mitpress/7503.003.0111 [Google Scholar]
  40. Linders, Guido M. & Stefan Schnell
    2025 Functional factors predict referential choice similarly across languages: A cross-linguistic computational analysis. PsyArXiv. 23p4m_v1. 10.31234/osf.io/23p4m_v1
    https://doi.org/10.31234/osf.io/23p4m_v1 [Google Scholar]
  41. MacDonald, Maryellen C.
    2013 How language production shapes language form and comprehension. Frontiers in Psychology(). –. 10.3389/fpsyg.2013.00226
    https://doi.org/10.3389/fpsyg.2013.00226 [Google Scholar]
  42. Meng, Chenxi
    2019 Multi-CAST Tulil. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  43. Mosel, Ulrike & Stefan Schnell
    2015 Multi-CAST Teop. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  44. Pawley, Andrew & Frances H. Syder
    2000 The one-clause-at-a-time hypothesis. InHeidi Riggenbach (ed.), Perspectives on fluency, –. Ann Arbor, MI: University of Michigan Press.
    [Google Scholar]
  45. Peck, Naomi & Laura Becker
    2024 Syntactic pausing? Re-examining the associations. Linguistics Vanguard(). –. 10.1515/lingvan‑2022‑0156
    https://doi.org/10.1515/lingvan-2022-0156 [Google Scholar]
  46. Polanyi, Livia
    2005 The linguistic structure of discourse. InDeborah Schiffrin, Deborah Tannen & Heidi E. Hamilton (eds.), The handbook of discourse analysis, –. Malden, MA: Blackwell. 10.1002/9780470753460.ch15
    https://doi.org/10.1002/9780470753460.ch15 [Google Scholar]
  47. R Core Team
    R Core Team 2025R: A language and environment for statistical computing. R Foundation for Statistical Computing. https://www.R-project.org
    [Google Scholar]
  48. Schiborr, Nils N.
    2015 Multi-CAST English. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  49. 2021 MulticastR: A companion to the Multi-CAST collection. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Bamberg: University of Bamberg.
    [Google Scholar]
  50. 2023Lexical anaphora. A corpus-based typological study of referential choice. Bamberg: University of Bamberg Press. 10.20378/irb‑59773
    https://doi.org/10.20378/irb-59773 [Google Scholar]
  51. Schiborr, Nils N., Stefan Schnell & Hanna Thiele
    2018RefIND — Referent indexing in natural-language discourse: Annotation guidelines (v1.1). Bamberg: University of Bamberg.
    [Google Scholar]
  52. Schnell, Stefan
    2015 Multi-CAST Vera’a. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  53. Schnell, Stefan, Nils N. Schiborr & Geoffrey Haig
    2021 Efficiency in discourse processing: Does morphosyntax adapt to accommodate new referents?Linguistics Vanguard(). . 10.1515/lingvan‑2019‑0064
    https://doi.org/10.1515/lingvan-2019-0064 [Google Scholar]
  54. Seifart, Frank & Tai Hong
    2022 Multi-CAST Bora. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  55. Seifart, Frank, Jan Strunk, Swintha Danielsen, Iren Hartmann, Brigitte Pakendorf, Søren Wichmann, Alena Witzlack-Makarevich, Nivja H. de Jong & Balthasar Bickel
    2018 Nouns slow down speech across structurally and culturally diverse languages. Proceedings of the National Academy of Sciences(). –. 10.1073/pnas.1800708115
    https://doi.org/10.1073/pnas.1800708115 [Google Scholar]
  56. Shannon, Claude E.
    1948 A mathematical theory of communication. The Bell System Technical Journal(). –. 10.1002/j.1538‑7305.1948.tb01338.x
    https://doi.org/10.1002/j.1538-7305.1948.tb01338.x [Google Scholar]
  57. Shiohara, Asako
    2022 Multi-CAST Sumbawa. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  58. Smith, Mark & Linda Wheeldon
    1999 High level processing scope in spoken sentence production. Cognition(). –. 10.1016/S0010‑0277(99)00053‑0
    https://doi.org/10.1016/S0010-0277(99)00053-0 [Google Scholar]
  59. Thieberger, Nick & Timothy Brickell
    2019 Multi-CAST Nafsan. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  60. Thompson, Chad L.
    1990 On the treatment of topical objects in Chepang: Passive or inverse?Studies in Language. International Journal sponsored by the Foundation “Foundations of Language”(). –. 10.1075/sl.14.2.06tho
    https://doi.org/10.1075/sl.14.2.06tho [Google Scholar]
  61. Tily, Harry & Steven Piantadosi
    2009 Refer efficiently: Use less informative expressions for more predictable meanings. InKees van Deemter, Albert Gatt, Roger P. van Gompel & Emiel J. Krahmer (eds.), Proceedings of the Workshop on the Production of Referring Expressions: Bridging the gap between computational and empirical approaches to reference (PRE-CogSci 2009), –. Tilburg: Tilburg University.
    [Google Scholar]
  62. Venhuizen, Noortje J. & Harm Brouwer
    2025 Referential retrieval and integration in language comprehension: An electrophysiological perspective. Psychological Review. Published online. 10.1037/rev0000530
    https://doi.org/10.1037/rev0000530 [Google Scholar]
  63. Visser, Eline
    2021 Multi-CAST Kalamang. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  64. Vollmer, Maria
    2020 Multi-CAST Mandarin. InGeoffrey Haig & Stefan Schnell (eds.), Multi-CAST: Multilingual corpus of annotated spoken texts. Version 2311. Bamberg: University of Bamberg.
    [Google Scholar]
  65. Zipf, George K.
    1949Human behavior and the principle of least effort. Cambridge, MA: Addison-Wesley.
    [Google Scholar]
/content/journals/10.1075/fol.26016.rei
Loading
/content/journals/10.1075/fol.26016.rei
Loading

Data & Media loading...

  • Article Type: Research Article
Keywords: discourse production ; corpus-based typology ; ONICON ; information management
This is a required field
Please enter a valid email address
Approval was successful
Invalid data
An Error Occurred
Approval was partially successful, following selected items could not be processed due to error