Pregled bibliografske jedinice broj: 919003
Croatian Adult Spoken Language Corpus (HrAL): overview and first analysis
Croatian Adult Spoken Language Corpus (HrAL): overview and first analysis // 12th Slavic Linguistics Society Meeting Book of Abstracts
Ljubljana, Slovenija, 2017. str. 179-179 (poster, međunarodna recenzija, sažetak, ostalo)
CROSBI ID: 919003 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Croatian Adult Spoken Language Corpus (HrAL): overview and first analysis
Autori
Hržica, Gordana ; Kuvač Kraljević, Jelena
Vrsta, podvrsta i kategorija rada
Sažeci sa skupova, sažetak, ostalo
Izvornik
12th Slavic Linguistics Society Meeting Book of Abstracts
/ - , 2017, 179-179
Skup
12th Slavic Linguistics Society Meeting
Mjesto i datum
Ljubljana, Slovenija, 21.09.2017. - 24.09.2017
Vrsta sudjelovanja
Poster
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
conversational anlysis, spoken language corpora, Croatian
Sažetak
Spoken-language corpora are based on spontaneous, unscripted speech defined by varieties of styles, registers and dialects. Consequently, these types of corpora represent the most comprehensive data source about everyday language of ordinary speakers. This paper has two main goals: 1. To present first Croatian spoken corpora - the Croatian Adult Spoken Language Corpus (HrAL ; Kuvač Kraljević, Hržica, 2016) - its structure and its possible application in different linguistic disciplines. HrAL was built by sampling spontaneous conversations of 617 speakers from all Croatian counties, and it comprises more than 250 000 tokens and more than 100 000 types. 2. To present the research of linguistic complexity in adult speakers of Croatian. The interrelation between two syntactic complexity measures was analysed: length of the production unit, as measured by the mean length of communication unit (MLCU) ; and syntactic sophistication, as measured by the ratio of relative clauses (RRC) in the total number of C-units. Results indicate a significant positive correlation between these two measures, confirming that speakers who produce longer utterances also produce less frequent and more complex syntactic structures. Since HrAL reflects actual use of language in everyday situations, it is expected that it will provide objective information about Croatian language and deeper insights in its usage. HrAL is available within TalkBank, a large database of spoken-language corpora covering different languages (https://talkbank.org), in the Conversational Analyses corpora within subsection Conversational Banks. Data were transcribed, coded and segmented using the transcription format Codes for Human Analysis of Transcripts (CHAT) and the Computerised Language Analysis (CLAN) suite of programmes within the TalkBank toolkit. Such open access should provide opportunities for the usage of HrAL in research of Croatian spoken language and its varieties, but also in cross-linguistic studies comparing various linguistic properties. KUVAČ KRALJEVIĆ, Jelena, HRŽICA, Gordana. 2016. Croatian Adult Spoken Language Corpus (HrAL). Fluminensia: Journal for philological research. 28/2. MACWHINNEY, Brian (2007). The TalkBank Project. In Creating and Digitizing Language Corpora: Synchronic Databases. Edited by J. C. Beal, K. P. Corrigan & H. L. Moisl. Vol.1. Houndmills: Palgrave-Macmillan. 163-180.
Izvorni jezik
Engleski
Znanstvena područja
Logopedija, Interdisciplinarne društvene znanosti, Interdisciplinarne humanističke znanosti
POVEZANOST RADA
Projekti:
HRZZ-UIP-2013-11-2421 - Jezična obrada u odraslih govornika (ALP) (Kuvač Kraljević, Jelena, HRZZ - 2013-11) ( CroRIS)
Ustanove:
Edukacijsko-rehabilitacijski fakultet, Zagreb