Pregled bibliografske jedinice broj: 397524
Lexical and morphological data from Croatian Child Frequency Dictionary: a developmental perspective
Lexical and morphological data from Croatian Child Frequency Dictionary: a developmental perspective // Learning & Perception / Mihály Racsmány (ur.).
Budimpešta: Akadémiai Kiadó, 2009. str. 24-24 (poster, međunarodna recenzija, sažetak, ostalo)
CROSBI ID: 397524 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Lexical and morphological data from Croatian Child Frequency Dictionary: a developmental perspective
Autori
Hržica, Gordana ; Kuvač Kraljević, Jelena ; Šnajder, Jan
Vrsta, podvrsta i kategorija rada
Sažeci sa skupova, sažetak, ostalo
Izvornik
Learning & Perception
/ Mihály Racsmány - Budimpešta : Akadémiai Kiadó, 2009, 24-24
Skup
1st Dubrovnik Conference on Cognitice Science DUCOG
Mjesto i datum
Dubrovnik, Hrvatska, 22.05.2009. - 24.05.2009
Vrsta sudjelovanja
Poster
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Croatian corpus of child language ; Croatian Child Frequency Dictionary ; CHILDES ; lemmatization ; tagging of corpus
Sažetak
The Croatian corpus of child language consists of recordings of spontaneous speech of three monolingual children, taken from 1 ; 5 to 2 ; 8, approximately twice a month. Recordings were transcribed in the CLAN program, according the CHAT rules and are available on-line in the CHILDES database (http://childes.psy.cmu.edu/data/Slavic/). As a second part of data processing, the complete corpus was morphologically tagged using a morphological lexicon developed specifically for Croatian child language, and then post- edited. The tagged corpus was used as a base for compiling the first Croatian Child Frequency Dictionary. The data from the corpus were lemmatized and then analyzed with regards to the specificity of the child language corpora, preserving in particular the time- developmental component. The Croatian Child Frequency Dictionary allows for the analysis of the most frequent lemmas in all three sub- corpora, according to frequency, alphabetic ordering, time of appearance, and part-of- speech. Also, it preserves morphological encoding of types and number of types and tokens. It therefore incorporates a larger amount of information than traditional corpora of written language, enabling users to extract relevant information on child language development, such as type/token ratio, lexical diversity, morphological diversity, etc.
Izvorni jezik
Engleski
Znanstvena područja
Logopedija, Interdisciplinarne društvene znanosti, Interdisciplinarne humanističke znanosti
POVEZANOST RADA
Projekti:
013-0131484-1488 - Više kortikalne funkcije i jezik: razvojni i stečeni poremećaji (Kovačević, Melita, MZOS ) ( CroRIS)
036-1300646-1986 - Otkrivanje znanja u tekstnim podacima (Dalbelo-Bašić, Bojana, MZO ) ( CroRIS)
Ustanove:
Edukacijsko-rehabilitacijski fakultet, Zagreb,
Fakultet elektrotehnike i računarstva, Zagreb