Choice and complexity: In naturally occurring data, absolute complexity does not necessarily trigger relative complexity

Authors

DOI:

https://doi.org/10.60923/issn.2785-0943/19727

Keywords:

variation, disfluency, isomorphism, alternation, repeated measures correlation, switchboard

Abstract

This article interrogates two related assumptions widespread in many approaches to language: (1) languages do not like synonymy; (2) absolute complexity (i.e. the length of the grammatical description of a language) tends to be proportional to relative complexity (i.e. difficulty). Against this backdrop, we explore the link between syntactic synonymy (i.e., grammatical variation and optionality) and relative complexity (i.e., cognitive load) using methods from both corpus and psycholinguistics. We test two predictions: First, if synonymy avoidance is a design feature of human language, then grammatical variation should be sub-optimal and cause a measurable increase in production difficulty. Second, optionality will necessarily increase the absolute complexity of a language system. This increased absolute complexity will, in turn, increase relative complexity, i.e., cognitive load, also measured by increased production difficulty. Contrary to these predictions, analyses based on the SWITCHBOARD corpus of American English shows that the presence of choice contexts does not positively correlate with two metrics of production difficulty, namely filled pauses (um and uh) and unfilled pauses (speech planning time), not even when a typology of grammatical alternation type (insertion/deletion, substitution, permutation) is taken into account. These results challenge the view that grammatical optionality is sub-optimal and difficult for speakers, and that absolute complexity is necessarily proportional to relative complexity. 

Downloads

Download data is not yet available.

References

Abel, Jennifer Colleen. 2015. The effect of task difficulty on speech convergence. Doctoral dissertation, University of British Columbia.

Bakdash, Jonathan Z. & Laura R. Marusich. 2017. Repeated Measures Correlation. Frontiers in Psychology 8. 456. https://doi.org/10.3389/fpsyg.2017.00456.

Bakdash, Jonathan Z. & Laura R. Marusich. 2022. rmcorr: Repeated measures correlation. Manual.

Berthold, André. 1998. Repräsentation und Verarbeitung sprachlicher Indikatoren für kognitive Ressourcenbeschränkungen. MA disseration, Universität des Saarlandes.

Berthold, André & Anthony Jameson. 1999. Interpreting symptoms of cognitive load in speech input. In Judy Kay (ed.), User Modeling: Proceedings of the Seventh International Conference, UM99, 235–244. New York: Springer.

Bland, J Martin. & Douglas G. Altman. 1995. Statistics notes: Calculating correlation coefficients with repeated observations: Part 1--correlation within subjects. BMJ 310(6977). 446–446. https://doi.org/10.1136/bmj.310.6977.446.

Blumenthal-Dramé, Alice. 2021. The online processing of causal and concessive relations: Comparing native speakers of English and German. Discourse Processes 58(7). 642–661. https://doi.org/10.1080/0163853X.2020.1855693.

Boersma, Paul & David Weenink. 2023. Praat: Doing phonetics by computer. www.praat.org.

Bolinger, Dwight L. 1968. Entailment and the meaning of structures. Glossa 2. 119–127.

Bresnan, Joan, Anna Cueni, Tatiana Nikitina & Harald Baayen. 2007. Predicting the dative alternation. In Gerlof Boume, Irene Krämer & Joost Zwarts (eds.), Cognitive foundations of interpretation, 69–94. Amsterdam: Royal Netherlands Academy of Science.

Bresnan, Joan & Marilyn Ford. 2010. Predicting syntax. Language 86(1). 168–213. https://doi.org/10.1353/lan.0.0189.

Christodoulides, George. 2016. Effects of cognitive load on speech production and perception. Doctoral dissertation, Université catholique de Louvain, Louvain-la-Neuve.

Clark, Eve V. 1987. The principle of contrast: A constraint on language acquisition. In Brian MacWhinney (ed.), Mechanisms of language acquisition, 1–33. Hillsdale, NJ: Lawrence Erlbaum Associates.

Clark, Herbert H. & Jean E. Fox Tree. 2002. Using uh and um in spontaneous speaking. Cognition 84(1). 73–111. https://doi.org/10.1016/S0010-0277(02)00017-3.

Clark, Herbert H. & Thomas Wasow. 1998. Repeating words in spontaneous speech. Cognitive Psychology. Elsevier BV 37(3). 201–242. https://doi.org/10.1006/cogp.1998.0693.

Cooper, William E. & Jeanne Paccia-Cooper. 1980. Syntax and speech. Syntax and Speech. Cambridge, MA: Harvard University Press.

Dammalapati, Samvit, Rajakrishnan Rajkumar & Sumeet Agarwal. 2019. Expectation and locality effects in the prediction of disfluent fillers and repairs in English speech. In American Chapter of the Association for Computational Linguistics: Student research workshop, 103–109. Minneapolis, MN: Association for Computational Linguistics. https://doi.org/10.18653/v1/N19-3015.

De Smet, Hendrik. 2019. The motivated unmotivated: Variation, function and context. In Kristin Bech & Ruth Möhlig-Falke (eds.), Grammar – Discourse – Context, 305–332. Berlin: De Gruyter. https://doi.org/10.1515/9783110682564-011.

De Smet, Hendrik, Frauke D’hoedt, Lauren Fonteyn & Kristel Van Goethem. 2018. The changing functions of competing forms: Attraction and differentiation. Cognitive Linguistics 29(2). 197–234. https://doi.org/10.1515/cog-2016-0025.

De Troij, Robbert. 2022. Natiolectal variation in Dutch grammar: A data-driven approach. Doctoral dissertation, Radboud University, Nijmegen; KU Leuven.

Eckert, Penelope. 2008. Variation and the indexical field. Journal of Sociolinguistics 12(4). 453–76. https://doi.org/10.1111/j.1467-9841.2008.00374.x.

Eckert, Penelope. 2012. Three waves of variation study: The emergence of meaning in the study of sociolinguistic variation. Annual Review of Anthropology 41. 87–100.

Eckert, Penelope. 2016. Variation, meaning and social change. In Nikolas Coupland (ed.), Sociolinguistics: Theoretical debates, 68–85. Cambridge: Cambridge University Press. https://doi.org/10.1017/CBO9781107449787.004.

Ehret, Katharina, Alice Blumenthal-Dramé, Christian Bentz & Aleksandrs Berdicevskis. 2021. Meaning and measures: Interpreting and evaluating complexity metrics. Frontiers in Communication 6. 640510. https://doi.org/10.3389/fcomm.2021.640510.

Eitelmann, Matthias. 2016. Support for end-weight as a determinant of linguistic variation and change. English Language and Linguistics 20(3). 395–420. https://doi.org/10.1017/S1360674316000356.

Erdmann, Karl-Otto. 1910. Die Bedeutung des Wortes: Aufsätze aus dem Grenzgebiet der Sprachpsychologie und Logik. 2nd edn. Leipzig: Avenarius.

Ferreira, Fernanda. 1991. Effects of length and syntactic complexity on initiation times for prepared utterances. Journal of Memory and Language 30(2). 210–233. https://doi.org/10.1016/0749-596x(91)90004-4.

Fox Tree, Jean E. & Herbert H Clark. 1997. Pronouncing “the” as “thee” to signal problems in speaking. Cognition 62(2). 151–167. https://doi.org/10.1016/S0010-0277(96)00781-0.

Freeman, Valerie. 2015. The phonetics of stance-taking. Doctoral dissertation, University of Washington.

Gardner, Matt Hunt & Benedikt Szmrecsanyi. 2022. Um, uh, and variation in American English. (Paper presented at Methods XVII - Methods in Dialectology & Language Diversity, Johannes Gutenberg-University, Mainz, Germany, August 1-5, 2022).

Gardner, Matt Hunt, Eva Uffing, Nicholas Van Vaeck & Benedikt Szmrecsanyi. 2021. Variation isn’t that hard: Morphosyntactic choice does not predict production difficulty. (Ed.) Stefan Th. Gries. PLOS ONE 16(6). e0252602. https://doi.org/10.1371/journal.pone.0252602.

Godfrey, John J., Edward C. Holliman & Jane McDaniel. 1992. SWITCHBOARD: Telephone speech corpus for research and development. In IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 1, 517–520. IEEE Computer Society. https://doi.org/10.1109/ICASSP.1992.225858

Goldberg, Adele E. 1995. Constructions. Chicago: University of Chicago Press.

Goldberg, Adele E. 2019. Explain me this: Creativity, competition, and the partial productivity of constructions. Princeton (N.J.): Princeton University Press.

Gries, Stefan Th. 2017. Syntactic alternation research: Taking stock and some suggestions for the future. Belgian Journal of Linguistics 31. 8–29. https://doi.org/10.1075/bjl.00001.gri.

Gropen, Jess, Steven Pinker, Michelle Hollander, Richard Goldberg & Ronald Wilson. 1989. The learnability and acquisition of the dative alternation in English. Language 65(2). 203. https://doi.org/10.2307/415332.

Grosjean, François, Lysiane Grosjean & Harlan Lane. 1979. The patterns of silence: Performance structures in sentence production. Cognitive Psychology. Elsevier BV 11(1). 58–81. https://doi.org/10.1016/0010-0285(79)90004-5.

Gunitsky, Seva. 2019. Rival visions of parsimony. International Studies Quarterly 63(3). 707–716. https://doi.org/10.1093/isq/sqz009.

Haiman, John. 1980. The iconicity of grammar: Isomorphism and motivation. Language 56(3). 515–540. https://doi.org/10.2307/414448.

Haiman, John. 1985. Natural syntax: Iconicity and erosion. Cambridge: Cambridge University Press.

Hawkins, John A. 2019. Word-external properties in a typology of Modern English: a comparison with German. English Language and Linguistics 23(3). 701–727. https://doi.org/10.1017/S1360674318000060.

Hieke, Adolf E., Sabine Kowal & Daniel C. O’Connell. 1983. The trouble with “articulatory” pauses. Language and Speech 26(3). 203–214. https://doi.org/10.1177/002383098302600302.

Hout, Roeland van & Pieter Muysken. 2016. Taming chaos. Chance and variability in the language sciences. In Klaas Landsman & Ellen van Wolde (eds.), The Challenge of Chance, 249–266. Cham: Springer. https://doi.org/10.1007/978-3-319-26300-7_14.

Kusters, Wouter. 2003. Linguistic complexity: The influence of social change on verbal inflection (LOT 77). Utrecht: LOT.

Labov, William. 1972a. Sociolinguistic patterns. Philadelphia: University of Pennsylvania Press.

Labov, William. 1972b. Some principles of linguistic methodology. Language in Society 1. 97–120.

Labov, William. 1978. Where does the linguistic variable stop? A response to Batriz R. Lavandera. Working Papers in Sociolinguistics 44. 1–22.

Labov, William. 1984. Field methods of the project of linguistic change and variation. In John Baugh & Joel Sherzer (eds.), Language in use. Readings in sociolinguistics, 28–53. Englewood Cliffs, NJ: Prentice Hall.

Lavandera, Beatriz R. 1978. Where does the linguistic variable stop? Language in Society 7(2). 171–182.

Le Grézause, Esther. 2017. Um and uh, and the expression of stance in conversational speech. Doctoral dissertation, University of Washington.

Leclercq, Benoît & Cameron Morin. 2023. No equivalence: A new principle of no synonymy. Constructions 1–16. https://doi.org/10.24338/CONS-535.

Levshina, Natalia & David Lorenz. 2022. Communicative efficiency and the Principle of No Synonymy: Predictability effects and the variation of want to and wanna. Language and Cognition 14(2). 249–274. https://doi.org/10.1017/langcog.2022.7.

Levy, Roger & T Florian Jaeger. 2007. Speakers optimize information density through syntactic reduction. In Bernard Schölkopf, John Platt & Thomas Hoffman (eds.), Advances in Neural Information Processing Systems 19, 849–856. Cambridge, MA: MIT Press. http://books.nips.cc/papers/files/nips19/NIPS2006_0515.pdf.

Lickley, Robin J. 2015. Fluency and disfluency. In Melissa A. Redford (ed.), The handbook of speech production, 445–469. Oxford: Wiley-Blackwell.

Lieberman, Philip. 1963. Some effects of semantic and grammatical context on the production and perception of speech.Language and Speech 6(3). 172–187. https://doi.org/10.1177/002383096300600306.

Ma, Ruiming, Thomas Van Hoey & Benedikt Szmrecsanyi. 2025. Isomorphism-inspired theorising about optionality and variation: no empirical support from English grammar. English Language and Linguistics. https://doi.org/10.1017/S1360674325000097.

MacDonald, Maryellen C. 2013. How language production shapes language form and comprehension. Frontiers in Psychology 4. 1–16. https://doi.org/10/gbfpt3.

Maclay, Howard & Charles E. Osgood. 1959. Hesitation phenomena in spontaneous English speech. WORD 15(1). 19–44. https://doi.org/10.1080/00437956.1959.11659682.

MacWhinney, Brian. 1989. Competition and lexical categorization. In Roberta Corrigan, Fred Eckman & Michael Noonan (eds.), Current issues in linguistic theory. Vol 61: Linguistic categorization (Amsterdam Studies in the Theory and History of Linguistic Science 4), 195–241. Amsterdam: John Benjamins.

Mair, Christian. 2002. Three changing patterns of verb complementation in Late Modern English: A real-time study based on matching text corpora. English Language and Linguistics 6(1). 105–131. https://doi.org/10.1017/s1360674302001065.

Meister, Clara, Tiago Pimentel, Patrick Haller, Lena Jäger, Ryan Cotterell & Roger Levy. 2021. Revisiting the Uniform Information Density Hypothesis. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, 963–980. Online and Punta Cana, Dominican Republic: Association for Computational Linguistics. https://doi.org/10.18653/v1/2021.emnlp-main.74.

Merlo, Sandra & Letícia Lessa Mansur. 2004. Descriptive discourse. Journal of Communication Disorders 37(6). 489–503. https://doi.org/10.1016/j.jcomdis.2004.03.002.

Miestamo, Matti. 2008. Grammatical complexity in a cross-linguistic perspective. In Matti Miestamo, Kaius Sinnemäki & Fred Karlsson (eds.), Language complexity: typology, contact, change, 23–42. Amsterdam, Philadelphia: Benjamins.

Miestamo, Matti. 2017. Linguistic diversity and complexity. Lingue e Linguaggio 16(2). 227–253.

Moore, Emma. 2021. The social meaning of syntax. In Lauren Hall-Lew, Emma Moore & Robert J. Podesva (eds.), Social meaning and linguistic variation, 54–79. 1st edn. Cambridge: Cambridge University Press. https://doi.org/10.1017/9781108578684.003.

Oomen, Claudy & Albert Postma. 2001. Effects of divided attention on the production of filled pauses and repetitions. Journal of Speech, Language, and Hearing Research 44. 997–1004. https://doi.org/10.1044/1092-4388(2001/078).

Oviatt, Sharon. 1995. Predicting spoken disfluencies during human–computer interaction. Computer Speech & Language9(1). 19–35. https://doi.org/10.1006/csla.1995.0002.

Poplack, Shana & Nathalie Dion. 2009. Prescription vs praxis: The evolution of future temporal reference in French. Language. JSTOR 85(3). 557–587.

R Core Team. 2023. R: A language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. https://www.R-project.org/.

Rezaii, Neguine, Kyle Mahowald, Rachel Ryskin, Bradford Dickerson & Edward Gibson. 2022. A syntax–lexicon trade-off in language production. Proceedings of the National Academy of Sciences 119(25). e2120203119. https://doi.org/10.1073/pnas.2120203119.

Rodríguez Louro, Celeste, Glenys Dale Collard, Madeleine Clews & Matt Hunt Gardner. 2023. Quotation in earlier and contemporary Australian Aboriginal English. Language Variation and Change 35(2). 129–152. https://doi.org/10.1017/S0954394523000169.

Rohdenburg, Günter. 1996. Cognitive complexity and increased grammatical explicitness in English. Cognitive Linguistics 7(2). 149–182. https://doi.org/10.1515/cogl.1996.7.2.149.

Shih, Stephanie, Jason Grafmiller, Richard Futrell & Joan Bresnan. 2015. Rhythm’s role in genitive construction choice in spoken English. In Ralf Vogel & Ruben Vijver (eds.), Rhythm in cognition and grammar. Berlin: De Gruyter. https://doi.org/10.1515/9783110378092.207.

Shriberg, Elizabeth. 1994. Preliminaries to a theory of speech disfluencies. Doctoral dissertation, University of California, Berkeley.

Shriberg, Elizabeth E. 1996. Disfluencies in Switchboard. In H. Timothy Bunnell & Richard A. Foulds (eds.), International Conference on Spoken Language Processing, Addendum. Wilmington, DW: Alfred I. duPont Institute.

Smith, Vicki L. & Herbert H. Clark. 1993. On the course of answering questions. Journal of Memory and Language32(1). 25–38. https://doi.org/10.1006/jmla.1993.1002.

Suikkanen, Jussi. 2018. Deontic modality. Analysis 78(2). 354–363. https://doi.org/10.1093/analys/any015.

Szmrecsanyi, Benedikt. 2017. Variationist sociolinguistics and corpus-based variationist linguistics: Overlap and cross-pollination potential. Canadian Journal of Linguistics/Revue canadienne de linguistique 62(4). 685–701. https://doi.org/10.1017/cnj.2017.34.

Szmrecsanyi, Benedikt, Matt Hunt Gardner, Ruiming Ma & Thomas Van Hoey. in print. Empirical accountability meets theorizing about language variation. In Patricia Cukor-Avila, Sali A. Tagliamonte & Guy Bailey (eds.), Empirical accountability in variation linguistics: Taking the next step. Cambridge: Cambridge University Press.

Szmrecsanyi, Benedikt & Jason Grafmiller. 2023. Comparative variation analysis: Grammatical alternations in world Englishes (Studies in Language Variation and Change). Cambridge: Cambridge University Press.

Szmrecsanyi, Benedikt, Jason Grafmiller, Joan Bresnan, Anette Rosenbach, Sali Tagliamonte & Simon Todd. 2017. Spoken syntax in a comparative perspective: The dative and genitive alternation in varieties of English. Glossa: a journal of general linguistics 2(1). 86. https://doi.org/10.5334/gjgl.310.

Tagliamonte, Sali A. 2012. Variationist sociolinguistics: Change, observation, interpretation (Language in Society 40). Malden, MA: Wiley-Blackwell.

Tagliamonte, Sali A. & Katharina Pabst. 2020. A cool comparison: Adjectives of positive evaluation in Toronto, Canada and York, England. Journal of English Linguistics 48(1). 3–30. https://doi.org/10.1177/0075424219881487.

Tannenbaum, Percy H. & Frederick Williams. 1968. Generation of active and passive sentences as a function of subject or object focus. Journal of Verbal Learning and Verbal Behavior 7(1). 246–250. https://doi.org/10.1016/S0022-5371(68)80197-5.

Tily, Harry, Susanne Gahl, Inbal Arnon, Neal Snider, Anubha Kothari & Joan Bresnan. 2009. Syntactic probabilities affect pronunciation variation in spontaneous speech. Language and Cognition 1(2). 147–165. https://doi.org/10.1515/LANGCOG.2009.008.

Uhrig, Peter. 2015. Why the Principle of No Synonymy is overrated. Zeitschrift für Anglistik und Amerikanistik; Leipzig. Leipzig, Germany, Leipzig: Walter de Gruyter GmbH 63(3). 323–337. http://doi.org/10.1515/zaa-2015-0030.

Van Hoey, Thomas, Matt H. Gardner, Ruiming Ma & Benedikt Szmrecsanyi. Accepted. Unpredictable grammatical choices are not harder than predictable ones. Language Variation and Change.

Wälchli, Bernhard & Anna Sjöberg. 2025. A law of meaning. Linguistic Typology at the Crossroads. Linguistic Typology at the Crossroads 4(2). 1–71. https://doi.org/10.6092/ISSN.2785-0943/18920.

Wieling, Martijn, Jack Grieve, Gosse Bouma, Josef Fruehwald, John Coleman & Mark Liberman. 2016. Variation and change in the use of hesitation markers in Germanic languages. Language Dynamics and Change 6(2). 199–234. https://doi.org/10.1163/22105832-00602001.

Downloads

Published

2026-02-25

How to Cite

1.
Van Hoey T, Szmrecsanyi B, Gardner MH. Choice and complexity: In naturally occurring data, absolute complexity does not necessarily trigger relative complexity. LTC [Internet]. 2025 Jan. 1 [cited 2026 Feb. 27];5(2):323-51. Available from: https://typologyatcrossroads.unibo.it/article/view/19727

Issue

Section

Research articles