Is complementation a universal strategy? A cross-linguistic corpus study
DOI:
https://doi.org/10.60923/issn.2785-0943/21288Keywords:
complementation, clause combining, corpus-based typology, clustering, parallax corpora, quoted speech, propositional framingAbstract
This article examines the question of whether complementation structures are cross-linguistically universal by using two different cross-linguistic corpora, each drawing on the same thirteen languages, spanning every continent. One is SCOPIC, the Social Cognition Parallax Interview Corpus, specifically designed to elicit material rich in grammatical categories relevant to social cognition; for each language in our sample this was balanced by a “general corpus” of roughly the same size with no specific targeting of domains. We find that, while complementation is widespread, it is not universal within the languages in our sample: in some it is absent entirely and in others it is extremely rare. Of the structural alternatives used to achieve the same functional goal by far the commonest is quoted speech, suggesting that in the evolution of linguistic structures it is heteroglossia, the embedding of one person’s words in another’s, that is a more basic phenomenon, from which complementation structures then evolve in many but not all languages.
Downloads
References
Aikhenvald, Alexandra Y. & Robert M. W. Dixon. 2006. Introduction. In Robert M. W. Dixon & Alexandra Y. Aikhenvald (eds.), Complementation: a cross-linguistic typology, 1–48. Oxford: Oxford University Press.
Barth, Danielle & Nicholas Evans (eds.). 2017a. The Social Cognition Parallax Corpus (SCOPIC). Language Documentation and Conservation Special Publication 12.
Barth, Danielle & Nicholas Evans. 2017b. The social cognition parallax corpus (SCOPIC): design and overview. In Danielle Barth & Nicholas Evans (eds.), The Social Cognition Parallax Corpus (SCOPIC) (Language Documentation and Conservation Special Publication 12). 1–21.
Barth, Danielle & Nicholas Evans. 2024. SCOPIC 1.0 corpus files. SocCog-corp01 at catalog.paradisec.org.au. https://dx.doi.org/10.26278/1YH7-J821.
Barth, Danielle, Nicholas Evans, I Wayan Arka, Henrik Bergqvist, Diana Forker, Sonja Gipper, Gabrielle Hodge, Eri Kashima, Yuki Kasuga, Carine Kawakami, Yukinori Kimoto, Dominique Knuchel, Norikazu Kogura, Keita Kurabe, John Mansfield, Heiko Narrog, Desak P. Eka Pratiwi, Saskia van Putten, Chikako Senge & Olena Tykhostup. 2021. Language vs. individuals in cross-linguistic corpus typology. In Stefan Schnell, Geoffrey Haig & Frank Seifart (eds.), Doing corpus-based typology with spoken language corpora: State of the art (Language Documentation & Conservation Special Publication 25). 1–56.
Barth, Danielle, Nicholas Evans, Sonja Gipper, Stefan Schnell, Henrik Bergqvist, Menguistu Amberber, I Wayan Arka, Christian Döhler, Diana Forker, Volker Gast, Dolgor Guntsetseg, Gabrielle Hodge, Eri Kashima, Yukinori Kimoto, Norikazu Kogura, Dominique Knuchel, Inge Kral, Keita Kurabe, John Mansfield, Heiko Narrog, Desak Putu Eka Pratiwi, Hiroki Nomoto, Seongha Rhee, Alan Rumsey, Lila San Roque, Andrea C. Schalley, Asako Shiohara, Elena Skribnik, Olena Tykhostup, Saskia van Putten & Yanti. 2024. The Social Cognition Parallax Interview Corpus (SCOPIC) Project Guidelines. In Danielle Barth & Nicholas Evans (eds.), The Social Cognition Parallax Corpus (SCOPIC) (Language Documentation and Conservation Special Publication 12). 163–237.
Brugman, Hennie & Albert Russel. 2004. Annotating multi-media/multi-modal resources with ELAN. In Maria Teresa Lino, Maria Francisca Xavier, Fátima Ferreira, Rute Costa & Raquel Silva (eds.), Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC'04), 2065–2068. Lisbon: European Language Resources Association (ELRA).
Charrad, Malika, Nadia Ghazzali, Véronique Boiteau & Azam Niknafs. 2014. NbClust: An R package for determining the relevant number of clusters in a data set. Journal of Statistical Software, 61(6). 1–36.
Chomsky, Noam. 1957. Syntactic structures. The Hague: Mouton.
Deutscher, Guy. 2000. Syntactic change in Akkadian. Oxford: Oxford University Press.
De Villiers, Jill. 2000. Language and theory of mind: what are the developmental relationships?. In Simon Baron-Cohen, Helen Tager-Flusberg & Donald J. Cohen (eds.), Understanding other minds: perspectives from developmental cognitive neuroscience 2nd edn., 83–123. New York: Oxford University Press.
De Villiers, Jill G. & Peter A. De Villiers. 2003. Language for Thought: Coming to understand false beliefs. In Dedre Gentner & Susan Goldin-Meadow (eds.), Language in Mind. 335-384. Cambridge: MIT Press.
De Villiers, Jill G. & Peter A. De Villiers. 2014. The role of language in Theory of Mind Development. Top Lang Disorders 34(4). 313–328.
De Villiers, Jill G. & Jennie E. Pyers. 2002. Complements to cognition: the relationship between complex syntax and false-belief understanding. Cognitive Development 17(1). 1037–1060.
Diessel, Holger & Katja Hetterle. 2006. Causal clauses: a cross-linguistic investigation of their structure, meaning and use. In Peter Siemund (ed.), Linguistic Universals and Language Variation, 21–52. Berlin: Mouton de Gruyter.
Dingemanse, Mark, Andreas Liesenfeld, Marlou Rasenberg, Saul Albert, Felix K. Ameka, Abeba Birhane, Dimitris Bolis, Justine Cassell, Rebecca Clift, Elena Cuffari, Hanne De Jaegher, Catarina Dutilh Novaes, N. J. Enfield, Riccardo Fusaroli, Eleni Gregoromichelaki, Edwin Hutchins, Ivana Konvalinka, Damian Milton, Joanna Rączaszek-Leonardi, Vasudevi Reddy, Federico Rossano, David Schlangen, Joanna Seibt, Elizabeth Stokoe, Lucy Suchman, Cordula Vesper, Thalia Wheatley, Martina Wiltschko. 2023. Beyond Single‐Mindedness: A Figure‐Ground Reversal for the Cognitive Sciences. Cognitive Science 47(1). e13230.
Evans, Nicholas. 2006. Who said polysynthetic languages avoid subordination? Multiple subordination strategies in Dalabon. Australian Journal of Linguistics 26(1). 31–58.
Evans, Nicholas. 2021. Social cognition in Dalabon. In Danielle Barth & Nicholas Evans (eds.), The Social Cognition Parallax Corpus (SCOPIC) (Language Documentation and Conservation Special Publication 12). 22–84.
Evans, Nicholas, Francesca Merlan & Maggie Tukumba. 2004. A first dictionary of Dalabon (Ngalkbon). Winnellie: Bawinanga Aboriginal Corporation.
Evans, Nicholas & David Wilkins. 2000. In the mind’s ear: the semantic extensions of perception verbs in Australian languages. Language 76(3). 546–592.
Frajzyngier, Zygmunt. 1984. On the Origin of say and se as complementizers in Black-English and English-based creoles. American Speech 59(3). 207–210.
Givón, Thomas. 1991. The evolution of dependent clause morpho-syntax in Biblical Hebrew. In Elizabeth Closs Traugott & Bernd Heine (eds.), Approaches to Grammaticalization: Volume II. Types of grammatical markers, 257–310. Amsterdam/Philadelphia: John Benjamins Publishing Company.
Grzech, Karolina. & Henrik Bergqvist, 2025. Epistemicity in language: current horizons, future directions. In Karolina Grzech & Henrik Bergqvist (eds.), Expanding the Boundaries of Epistemicity: Epistemic Modality, Evidentiality, and Beyond, 1–30. Berlin: De Gruyter Mouton. https://doi.org/10.1515/9783111516233-001.
Harris, Alice & Lyle Campbell. 1995. Historical syntax in cross-linguistic perspective. Cambridge: Cambridge University Press.
Hernáiz Gomez, Rodrigo. 2024. The grammaticalization of manner expressions into complementizers: insights from Semitic languages. Linguistics: An Interdisciplinary Journal of the Language Sciences 62(3). 617–651.
Hodge, Gabrielle, Kazuki Sekine, Adam Schembri & Trevor Johnston. 2019. Comparing signers and speakers: Building a directly comparable corpus of Auslan and Australian English. Corpora 14(1). 63–76.
IDS, Datenbank für Gesprochenes Deutsch (DGD). http://dgd.ids-mannheim.de (Accessed 2025.12.28).
Kimoto, Yukinori, Asako Shiohara, Danielle Barth, Nicholas Evans, Norikazu Kogura, I Wayan Arka, Desak Putu Eka Pratiwi, Yuki Kasuga, Carine Kawakami, Keita Kurabe, Heiko Narrog, Hiroki Nomoto, Hitomi Ono, Alan Rumsey, Andrea C. Schalley, Yanti, Akiko Yokoyama. 2024. Syntactic embedding or parataxis? Corpus-based typology of complementation in language use. In Danielle Barth & Nicholas Evans (eds.), The Social Cognition Parallax Corpus (SCOPIC) (Language Documentation and Conservation Special Publication 12). 126–162.
King, Ronald S. 2015. Cluster analysis and data mining: An introduction. Dulles: Mercury Learning and Information.
Klamer, Marian. 2000. How report verbs become quote markers and complementisers. Lingua 110(2). 69–98.
Kolde, Raivo. 2019. pheatmap: Pretty heatmaps (R package version 1.0.12). Available online at: https://CRAN.R-project.org/package=pheatmap (Accessed 2025.12.28).
Levshina, Natalia. 2022. Corpus-based typology: applications, challenges and some solutions. Linguistic Typology 26(1). 129–160.
Lucas, Carolina, Patrick Wong, Jon Klein, Tiago B. R. Castro, Julio Silva, Maria Sundaram, Mallory K. Ellingson, Tianyang Mao, Ji Eun Oh, Benjamin Israelow, Takehiro Takahashi, Maria Tokuyama, Peiwen Lu, Arvind Venkataraman, Annsea Park, Subhasis Mohanty, Haowei Wang, Anne L.Wyllie, Chantal B. F. Vogels, Rebecca Earnest, Sarah Lapidus, Isabel M. Ott, Adam J. Moore, M. Catherine Muenker, John B. Fournier, Melissa Campbell, Camila D. Odio, Arnau Casanovas-Massana, Yale IMPACT Team, Roy Herbst, Albert C. Shaw, Ruslan Medzhitov, Wade L. Schulz, Nathan D. Grubaugh, Charles Dela Cruz, Shelli Farhadian, Albert I. Ko, Saad B. Omer & Akiko Iwasaki. 2020. Longitudinal analyses reveal immunological misfiring in severe COVID-19. Nature 584. 463–469.
Matsui, Tomoko, Hannes Rakoczy, Yui Mirua and Michael Tomasello. 2009. Understanding of speaker certainty and false-belief reasoning: a comparison of Japanese and German preschoolers. Developmental Science 12(4). 602–613.
Mayer, Thomas & Michael Cysouw. 2014. Creating a massively parallel Bible corpus. In Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Hrafn Loftsson, Bente Maegaard, Joseph Mariani, Asuncion Moreno, Jan Odijk & Stelios Piperidis (eds.). Proceedings of the 9th International Conference on Language Resources and Evaluation (LREC'14). 3148–3163. Reykjavik: European Language Resources Association (ELRA).
Mithun, Marianne. 2025. The Mighty Demonstrative. Linguistic Typology at the Crossroads 5-2. 104-122.
Noonan, Michael. 1985. Complementation. In Timothy Shopen (ed.), Language typology and syntactic description, Vol. II, Complex Constructions, 42–140. Cambridge: Cambridge University Press.
Ponsonnet, Maïa. 2013. A culturally informed corpus of Dalabon. Endangered Language Archive. https://www.elararchive.org/dk0071/ (Accessed 2025.12.28).
R Core Team. 2023. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. www.R-project.org (Accessed 2025.12.28).
Reesink, Ger P. 1993. ‘Inner speech’ in Papuan languages. Language and Linguistics in Melanesia 24. 217–225.
Rumsey, Alan, John Mansfield & Nicholas Evans. 2022. The sound of one quotation mark: Quoted speech in Indigenous Australian narrative. In Alexandra Aikhenvald, Robert Bradshaw, Luca Ciucci & Pema Wangdi (eds.), Celebrating Indigenous Voices. Legends and Narratives in Languages of the Tropics, 33–72. Berlin: De Gruyter Mouton.
Saito, Hiroaki. 2021. Grammaticalization as decategorialization. Journal of Historical Syntax 5(10). 1–24.
San Roque, Lila, Alan Rumsey, Lauren Gawne, Stef Spronck, Darja Hoenigman, Alice Carroll, Julia Miller & Nicholas Evans. 2012. Getting the story straight: language fieldwork using a narrative problem-solving task. Language Documentation and Conservation 6. 134–173.
Sauerland, Uli, Bart Hollebrandse & František Kratochvil. 2020. When hypotaxis looks like parataxis: embedding and complementizer agreement in Teiwa. Glossa: a journal of general linguistics 5(1). 89.
Schnell, Stefan, Geoffrey Haig & Frank Seifart. 2021. The role of language documentation in corpus-based typology. In Geoffrey Haig, Stefan Schnell & Frank Seifart (eds.), Doing corpus-based typology with spoken language data: State of the art, 1–28. Honolulu: University of Hawai’i Press.
Schnell, Stefan & Nils Norman Schiborr. 2022. Crosslinguistic Corpus Studies in Linguistic Typology. Annual Review of Linguistics 8(1). 171–191.
Spronck, Stef & Daniela Casartelli. 2021. In a manner of speaking: How reported speech may have shaped grammar. Frontiers in Communication 6. 624486.
Travis, Catherine E., James Grama, Simon Gonzalez, Benjamin Purser and Cale Johnstone. 2023. Sydney Speaks Corpus. ARC Centre of Excellence for the Dynamics of Language, Australian National University. https://dx.doi.org/10.25911/m03c-yz22.
van Gijn, Rik, Vincent Hirtzel, Sonja Gipper & Jeremías Ballivián Torrico. 2011. The Yurakaré archive. Online language documentation, DoBeS Archive, MPI Nijmegen. https://hdl.handle.net/1839/00-0000-0000-0016-662E-4.
Widmer, Manuel, Sandra Auderset, Johanna Nichols, Paul Widmer & Balthasar Bickel. 2017. NP recursion over time: Evidence from Indo-European. Language 93. 799–826.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Nicholas Evans, Danielle Barth, Wayan Arka, Henrik Bergqvist, Christian Döhler, Sonja Gipper, Yukinori Kimoto, Dominique Knuchel, Daniel Majchrzak, Hitomi Ono, Eka Pratiwi, Saskia van Putten, Andrea C. Schalley, Asako Shiohara, Yanti

This work is licensed under a Creative Commons Attribution 4.0 International License.