Morphological Complexity of Children Narratives in Eight Languages (CROSBI ID 720393)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Hržica, Gordana ; Liebeskind, Chaya ; Štrkalj Despot, Kristina ; Dontcheva-Navratilova, Olga ; Kamandulytė-Merfeldienė, Laura ; Košutar, Sara ; Kramarić, Matea ; Valūnaitė-Oleškevičienė, Giedrė
engleski
Morphological Complexity of Children Narratives in Eight Languages
The aim of this study was to compare the morphological complexity in a corpus representing the language production of younger and older children across different languages. The language samples were taken from the Frog Story subcorpus of the CHILDES corpora, which comprises oral narratives collected by various researchers between 1990 and 2005. We extracted narratives from typically developing, monolingual, middle- class children. Additionally, samples of the Lithuanian language, collected according to the same principles, were added. The corpus comprises 249 narratives evenly distributed across eight languages: Croatian, English, French, German, Italian, Lithuanian, Russian and Spanish. Two subcorpora were formed for each language: a younger children corpus and an older children corpus. Four measures of morphological complexity were calculated for each subcorpus: Bane, Kolmogorov, Word entropy and Relative entropy of word structure. The results showed that younger children's corpora had lower morphological complexity than older children's corpora for all four measures for Spanish and Russian. Reversed results were obtained for English and French, and the results for the remaining four languages showed variation. Relative entropy of word structure proved to be indicative of age differences. Word entropy and relative entropy of word structure show the potential to demonstrate typological differences.
language development, language sample analysis, morphological complexity, measurement
This work has been supported by the project NexusLinguarum – European network for Web-centred linguistic data science (COST Action CA18209) and it has been supported in part by the Croatian Science Foundation under the project Multilevel approach to spoken discourse in language development (UIP-2017-05-6603).
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
4729-4738.
2022.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the 13th Conference on Language Resources and Evaluation (LREC 2022)
Calzolari, Nicoletta ; Béchet, Frédéric ; Blache, Philippe ; Choukri, Khalid ; Cieri, Christopher ; Declerck, Thierry ; Goggi, Sara ; Isahara, Hitoshi ; Maegaard, Bente ; Mariani, Joseph ; Hélène, Mazo ; Odijk, Jan ; Piperidis, Stelios
Pariz: European Language Resources Association (ELRA)
979-10-95546-72-6
Podaci o skupu
13th Language Resources and Evaluation Conference (LREC2022)
poster
20.06.2022-25.06.2022
Marseille, Francuska