Pregled bibliografske jedinice broj: 807404
Temporal Expression Tagging for Croatian Texts
Temporal Expression Tagging for Croatian Texts, 2014., diplomski rad, preddiplomski, Fakultet elektrotehnike i računarstva, Zagreb
CROSBI ID: 807404 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Temporal Expression Tagging for Croatian Texts
Autori
Skukan, Luka
Vrsta, podvrsta i kategorija rada
Ocjenski radovi, diplomski rad, preddiplomski
Fakultet
Fakultet elektrotehnike i računarstva
Mjesto
Zagreb
Datum
03.07
Godina
2014
Stranica
47
Mentor
Šnajder, Jan
Neposredni voditelj
Glavaš, Goran
Ključne riječi
TimeML; TIMEX3; rule-based tagging; temporal expression normalisation; natural language processing
Sažetak
Temporal expression extraction has recently become a popular field of natural language processing. This task consists of locating temporal expressions and normalising them into a canonical form. Recent successes achieved by rule-based temporal taggers, HeidelTime in particular, and a lack of a good temporal tagger for Croatian have inspired the construction of HeidelTime rule set for the Croatian language. Additionally, WikiWarsHR, a corpus of Wikipedia-based historical narratives tagged with TIMEX3, has been developed, both for the purpose of developing the HeidelTime rule set and for future use. Results achieved by the Croatian implementation of HeidelTime are comparable to implementations for other languages, while WikiWarsHR is close in size and density of tags to other publicly available corpora.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb