Temporal Expression Tagging for Croatian Texts (CROSBI ID 402594)
Ocjenski rad | sveučilišni preddiplomski završni rad
Podaci o odgovornosti
Skukan, Luka
Šnajder, Jan
Glavaš, Goran
engleski
Temporal Expression Tagging for Croatian Texts
Temporal expression extraction has recently become a popular field of natural language processing. This task consists of locating temporal expressions and normalising them into a canonical form. Recent successes achieved by rule-based temporal taggers, HeidelTime in particular, and a lack of a good temporal tagger for Croatian have inspired the construction of HeidelTime rule set for the Croatian language. Additionally, WikiWarsHR, a corpus of Wikipedia-based historical narratives tagged with TIMEX3, has been developed, both for the purpose of developing the HeidelTime rule set and for future use. Results achieved by the Croatian implementation of HeidelTime are comparable to implementations for other languages, while WikiWarsHR is close in size and density of tags to other publicly available corpora.
TimeML; TIMEX3; rule-based tagging; temporal expression normalisation; natural language processing
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o izdanju
47
03.07.2014.
obranjeno
Podaci o ustanovi koja je dodijelila akademski stupanj
Fakultet elektrotehnike i računarstva
Zagreb