Vocabulary size prediction of Croatian texts

Tuđman, Miroslav; Mikelić, Nives; Boras, Damir

izvor podataka: crosbi ✓

Vocabulary size prediction of Croatian texts (CROSBI ID 493518)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Tuđman, Miroslav ; Mikelić, Nives ; Boras, Damir Vocabulary size prediction of Croatian texts // Proceedings of the 25th International Conference on Information Technology Interfaces / Budin, Leo ; Lužar-Stiffler, Vesna ; Bekić, Zoran et al. (ur.). Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2003. str. 223-228

Podaci o odgovornosti

Autori

Tuđman, Miroslav ; Mikelić, Nives ; Boras, Damir

Osnovni podaci na izvornom jeziku
Osnovni podaci na ostalim jezicima

Jezik

engleski

Naslov

Vocabulary size prediction of Croatian texts

Sažetak

The preliminary research of the vocabulary size of the Croatian lexical corpora shows that the distribution of types is regular and that deviations of the calculated values are within theoretically acceptable limit. The research also brought us to conclusion that Zipf's Law in Croatian language is not applicable because the lexical density is different, i.e. the proportion of types and tokens in different languages is different and the parameters of that proportion need to be calculated for every language separately.

Ključne riječi

Lexical items; vocabulary size; Zipf law; lexical density; token; type; Croatian text corpus

Napomena

nije evidentirano

Jezik

nije evidentirano

Naslov

nije evidentirano

Sažetak

nije evidentirano

Ključne riječi

nije evidentirano

Napomena

nije evidentirano

Podaci o prilogu

Stranice rada

223-228.

Godina izdavanja

2003.

Status objave rada

objavljeno

Podaci o matičnoj publikaciji

Naslov

Proceedings of the 25th International Conference on Information Technology Interfaces

Urednici

Budin, Leo ; Lužar-Stiffler, Vesna ; Bekić, Zoran ; Hljuz Dobrić, Vesna

Izdavač

Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce)

Podaci o skupu

Skup

International Conference on Information Technology Interfaces (25 ; 2003)

Vrsta sudjelovanja

predavanje

Datum održavanja skupa

18.06.2003-19.06.2003

Mjesto održavanja skupa

Cavtat, Hrvatska

Povezanost rada

Povezane osobe

Damir Boras (autor/i)

Nives Mikelić Preradović (autor/i)

Miroslav Tuđman (autor/i)

Povezane ustanove

Filozofski fakultet u Zagrebu (130) (autorova ustanova)

Područje

Informacijske i komunikacijske znanosti