Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1034196

A corpus-based approach to reevaluation of Croatian verb classification


Blazsetin, Danijel; Bago, Petra
A corpus-based approach to reevaluation of Croatian verb classification // 7th International ConferenceThe Future of Information Sciences INFuture2019: Knowledge in the Digital Age : proceedings / Bago, Petra ; Hebrang Grgić, Ivana ; Ivanjko, Tomislav ; Juričić, Vedran ; Miklošević, Željka ; Stublić, Helena (ur.).
Zagreb: Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu, 2019. str. 40-47 doi:10.17234/INFUTURE.2019.6 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 1034196 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
A corpus-based approach to reevaluation of Croatian verb classification

Autori
Blazsetin, Danijel ; Bago, Petra

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
7th International ConferenceThe Future of Information Sciences INFuture2019: Knowledge in the Digital Age : proceedings / Bago, Petra ; Hebrang Grgić, Ivana ; Ivanjko, Tomislav ; Juričić, Vedran ; Miklošević, Željka ; Stublić, Helena - Zagreb : Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu, 2019, 40-47

Skup
7th International Conference The Future of Information Sciences (INFuture 2019)

Mjesto i datum
Zagreb, Hrvatska, 21.11.2019. - 22.11.2019

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
corpus linguistics ; natural language processing ; verb classification ; grammar textbooks ; Croatian language

Sažetak
Croatian grammar textbooks have a long tradition of classifying verbs based on their morphosyntactic characteristics. Conclusions, such as the frequency or productiveness of a class, were drawn without having the insight into a big corpus. Corpora used in such descriptions were not described and were presumably made of literary works which is, in our opinion, describing a form of the Croatian language distant from its everyday use. The corpus used for analyzing verbs in this paper is hrWaC which contains 1.9 billion tokens and about 90, 000 verbs. This corpus was selected with the intention of describing and analyzing a less formal and less standardized language This paper offers a corpus-based approach to the problem of verb classification and emphasizes the importance of NLP methods in the process of classification as they fasten and simplify it. The paper gives a brief introduction to verbs, their morphological characteristics and their classification. By extracting verbs from the Croatian web corpus hrWaC and processing them computationally, the paper gives an insight into the verb distribution in the Croatian language and points out some difficulties that were encountered during this study. Even though this paper aimed to reevaluate the existing data data, the present findings mostly confirm the claims of previous researches. A number of recommendations for future research are given, foremost, the need of the extension of the language material.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Interdisciplinarne društvene znanosti, Interdisciplinarne humanističke znanosti



POVEZANOST RADA


Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Petra Bago (autor)

Poveznice na cjeloviti tekst rada:

doi infoz.ffzg.hr openbooks.ffzg.unizg.hr

Citiraj ovu publikaciju:

Blazsetin, Danijel; Bago, Petra
A corpus-based approach to reevaluation of Croatian verb classification // 7th International ConferenceThe Future of Information Sciences INFuture2019: Knowledge in the Digital Age : proceedings / Bago, Petra ; Hebrang Grgić, Ivana ; Ivanjko, Tomislav ; Juričić, Vedran ; Miklošević, Željka ; Stublić, Helena (ur.).
Zagreb: Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu, 2019. str. 40-47 doi:10.17234/INFUTURE.2019.6 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Blazsetin, D. & Bago, P. (2019) A corpus-based approach to reevaluation of Croatian verb classification. U: Bago, P., Hebrang Grgić, I., Ivanjko, T., Juričić, V., Miklošević, Ž. & Stublić, H. (ur.)7th International ConferenceThe Future of Information Sciences INFuture2019: Knowledge in the Digital Age : proceedings doi:10.17234/INFUTURE.2019.6.
@article{article, author = {Blazsetin, Danijel and Bago, Petra}, year = {2019}, pages = {40-47}, DOI = {10.17234/INFUTURE.2019.6}, keywords = {corpus linguistics, natural language processing, verb classification, grammar textbooks, Croatian language}, doi = {10.17234/INFUTURE.2019.6}, title = {A corpus-based approach to reevaluation of Croatian verb classification}, keyword = {corpus linguistics, natural language processing, verb classification, grammar textbooks, Croatian language}, publisher = {Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveu\v{c}ili\v{s}ta u Zagrebu}, publisherplace = {Zagreb, Hrvatska} }
@article{article, author = {Blazsetin, Danijel and Bago, Petra}, year = {2019}, pages = {40-47}, DOI = {10.17234/INFUTURE.2019.6}, keywords = {corpus linguistics, natural language processing, verb classification, grammar textbooks, Croatian language}, doi = {10.17234/INFUTURE.2019.6}, title = {A corpus-based approach to reevaluation of Croatian verb classification}, keyword = {corpus linguistics, natural language processing, verb classification, grammar textbooks, Croatian language}, publisher = {Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveu\v{c}ili\v{s}ta u Zagrebu}, publisherplace = {Zagreb, Hrvatska} }

Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font