Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 132968

Vocabulary size prediction of Croatian texts


Tuđman, Miroslav; Mikelić, Nives; Boras, Damir
Vocabulary size prediction of Croatian texts // Proceedings of the 25th International Conference on Information Technology Interfaces / Budin, Leo ; Lužar-Stiffler, Vesna ; Bekić, Zoran ; Hljuz Dobrić, Vesna (ur.).
Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2003. str. 223-228 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 132968 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Vocabulary size prediction of Croatian texts

Autori
Tuđman, Miroslav ; Mikelić, Nives ; Boras, Damir

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the 25th International Conference on Information Technology Interfaces / Budin, Leo ; Lužar-Stiffler, Vesna ; Bekić, Zoran ; Hljuz Dobrić, Vesna - Zagreb : Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2003, 223-228

Skup
International Conference on Information Technology Interfaces (25 ; 2003)

Mjesto i datum
Cavtat, Hrvatska, 18.06.2003. - 19.06.2003

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
Lexical items; vocabulary size; Zipf law; lexical density; token; type; Croatian text corpus.
(Lexical items; vocabulary size; Zipf law; lexical density; token; type; Croatian text corpus)

Sažetak
The preliminary research of the vocabulary size of the Croatian lexical corpora shows that the distribution of types is regular and that deviations of the calculated values are within theoretically acceptable limit. The research also brought us to conclusion that Zipf's Law in Croatian language is not applicable because the lexical density is different, i.e. the proportion of types and tokens in different languages is different and the parameters of that proportion need to be calculated for every language separately.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
0130443

Ustanove:
Filozofski fakultet, Zagreb


Citiraj ovu publikaciju:

Tuđman, Miroslav; Mikelić, Nives; Boras, Damir
Vocabulary size prediction of Croatian texts // Proceedings of the 25th International Conference on Information Technology Interfaces / Budin, Leo ; Lužar-Stiffler, Vesna ; Bekić, Zoran ; Hljuz Dobrić, Vesna (ur.).
Zagreb: Sveučilišni računski centar Sveučilišta u Zagrebu (Srce), 2003. str. 223-228 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Tuđman, M., Mikelić, N. & Boras, D. (2003) Vocabulary size prediction of Croatian texts. U: Budin, L., Lužar-Stiffler, V., Bekić, Z. & Hljuz Dobrić, V. (ur.)Proceedings of the 25th International Conference on Information Technology Interfaces.
@article{article, author = {Tu\djman, Miroslav and Mikeli\'{c}, Nives and Boras, Damir}, year = {2003}, pages = {223-228}, keywords = {Lexical items, vocabulary size, Zipf law, lexical density, token, type, Croatian text corpus.}, title = {Vocabulary size prediction of Croatian texts}, keyword = {Lexical items, vocabulary size, Zipf law, lexical density, token, type, Croatian text corpus.}, publisher = {Sveu\v{c}ili\v{s}ni ra\v{c}unski centar Sveu\v{c}ili\v{s}ta u Zagrebu (Srce)}, publisherplace = {Cavtat, Hrvatska} }
@article{article, author = {Tu\djman, Miroslav and Mikeli\'{c}, Nives and Boras, Damir}, year = {2003}, pages = {223-228}, keywords = {Lexical items, vocabulary size, Zipf law, lexical density, token, type, Croatian text corpus}, title = {Vocabulary size prediction of Croatian texts}, keyword = {Lexical items, vocabulary size, Zipf law, lexical density, token, type, Croatian text corpus}, publisher = {Sveu\v{c}ili\v{s}ni ra\v{c}unski centar Sveu\v{c}ili\v{s}ta u Zagrebu (Srce)}, publisherplace = {Cavtat, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font