Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 1037995

Parallel Corpus of Croatian-Italian Administrative Texts


Brkic Bakaric, Marija; Lalli Pacelat, Ivana
Parallel Corpus of Croatian-Italian Administrative Texts // Proceedings of the 2nd Workshop on Human- Informed Translation and Interpreting Technology (HiT-IT 2019) / Temnikova, I. ; Orăsan, C. ; Corpas Pastor, G. ; Mitkov, R. (ur.).
Varna, Bugarska, 2019. str. 11-18 doi:10.26615/issn.2683-0078.2019_002 (predavanje, međunarodna recenzija, sažetak, ostalo)


CROSBI ID: 1037995 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Parallel Corpus of Croatian-Italian Administrative Texts

Autori
Brkic Bakaric, Marija ; Lalli Pacelat, Ivana

Vrsta, podvrsta i kategorija rada
Sažeci sa skupova, sažetak, ostalo

Izvornik
Proceedings of the 2nd Workshop on Human- Informed Translation and Interpreting Technology (HiT-IT 2019) / Temnikova, I. ; Orăsan, C. ; Corpas Pastor, G. ; Mitkov, R. - , 2019, 11-18

Skup
2nd Workshop on Human-Informed Translation and Interpreting Technology

Mjesto i datum
Varna, Bugarska, 05.09.2019. - 06.09.2019

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
parallel corpora ; sentence alignment, automatic dictionary

Sažetak
Parallel corpora constitute a unique resource for providing assistance to human translators. The selection and preparation of the parallel corpora also conditions the quality of the resulting MT engine. Since Croatian is a national language and Italian is officially recognized as a minority language in seven cities and twelve municipalities of Istria County, a large amount of parallel texts is produced on a daily basis. However, there have been no attempts in using these texts for compiling a parallel corpus. A domain-specific sentencealigned parallel Croatian-Italian corpus of administrative texts would be of high value in creating different language tools and resources. The aim of this paper is, therefore, to explore the value of parallel documents which are publicly available mostly in pdf format and to investigate the use of automatically-built dictionaries in corpus compilation. The effects that a document format and, consequently sentence splitting, and the dictionary input have on the sentence alignment process are manually evaluated.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti, Filologija



POVEZANOST RADA


Ustanove:
Sveučilište Jurja Dobrile u Puli,
Fakultet informatike i digitalnih tehnologija, Rijeka

Poveznice na cjeloviti tekst rada:

doi rgcl.wlv.ac.uk acl-bg.org

Citiraj ovu publikaciju:

Brkic Bakaric, Marija; Lalli Pacelat, Ivana
Parallel Corpus of Croatian-Italian Administrative Texts // Proceedings of the 2nd Workshop on Human- Informed Translation and Interpreting Technology (HiT-IT 2019) / Temnikova, I. ; Orăsan, C. ; Corpas Pastor, G. ; Mitkov, R. (ur.).
Varna, Bugarska, 2019. str. 11-18 doi:10.26615/issn.2683-0078.2019_002 (predavanje, međunarodna recenzija, sažetak, ostalo)
Brkic Bakaric, M. & Lalli Pacelat, I. (2019) Parallel Corpus of Croatian-Italian Administrative Texts. U: Temnikova, I., Orăsan, C., Corpas Pastor, G. & Mitkov, R. (ur.)Proceedings of the 2nd Workshop on Human- Informed Translation and Interpreting Technology (HiT-IT 2019) doi:10.26615/issn.2683-0078.2019_002.
@article{article, author = {Brkic Bakaric, Marija and Lalli Pacelat, Ivana}, year = {2019}, pages = {11-18}, DOI = {10.26615/issn.2683-0078.2019\_002}, keywords = {parallel corpora, sentence alignment, automatic dictionary}, doi = {10.26615/issn.2683-0078.2019\_002}, title = {Parallel Corpus of Croatian-Italian Administrative Texts}, keyword = {parallel corpora, sentence alignment, automatic dictionary}, publisherplace = {Varna, Bugarska} }
@article{article, author = {Brkic Bakaric, Marija and Lalli Pacelat, Ivana}, year = {2019}, pages = {11-18}, DOI = {10.26615/issn.2683-0078.2019\_002}, keywords = {parallel corpora, sentence alignment, automatic dictionary}, doi = {10.26615/issn.2683-0078.2019\_002}, title = {Parallel Corpus of Croatian-Italian Administrative Texts}, keyword = {parallel corpora, sentence alignment, automatic dictionary}, publisherplace = {Varna, Bugarska} }

Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font