Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 101837

Building the Croatian-English Parallel Corpus


Tadić, Marko
Building the Croatian-English Parallel Corpus // Second International Conference on Language Resources and Evaluation LREC2000 / Gavrilidou, M., Carayannis, G., Markantonatou, S., Piperidis S.. (ur.).
Pariz : Atena: European Language Resources Association (ELRA), 2000. str. 523-530


CROSBI ID: 101837 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Building the Croatian-English Parallel Corpus

Autori
Tadić, Marko

Vrsta, podvrsta i kategorija rada
Poglavlja u knjigama, znanstveni

Knjiga
Second International Conference on Language Resources and Evaluation LREC2000

Urednik/ci
Gavrilidou, M., Carayannis, G., Markantonatou, S., Piperidis S..

Izdavač
European Language Resources Association (ELRA)

Grad
Pariz : Atena

Godina
2000

Raspon stranica
523-530

ISBN
0-000-00000-0

Ključne riječi
Alignment, Corpus Linguistics, Croatian, English, Parallel Corpus, XCES, XML

Sažetak
The contribution gives a survey of procedures and formats used in building the Croatian-English parallel corpus which is being collected in the Institute of Linguistics at the Philosophical Faculty, University of Zagreb. The primary text source is newspaper Croatia Weekly which has been published from the beginning of 1998 by HIKZ (Croatian Institute for Information and Culture). After quick survey of existing English-Croatian parallel corpora, the article copes with procedures involved in text conversion and text encoding, particularly the alignment. There are several recent suggestions for alignment encoding and they are elaborated. Preliminary statistics on numbers of S and W elements in each language is given at the end of the article.

Izvorni jezik
Engleski

Znanstvena područja
Filologija



POVEZANOST RADA


Projekti:
130718

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Marko Tadić (autor)


Citiraj ovu publikaciju:

Tadić, Marko
Building the Croatian-English Parallel Corpus // Second International Conference on Language Resources and Evaluation LREC2000 / Gavrilidou, M., Carayannis, G., Markantonatou, S., Piperidis S.. (ur.).
Pariz : Atena: European Language Resources Association (ELRA), 2000. str. 523-530
Tadić, M. (2000) Building the Croatian-English Parallel Corpus. U: Gavrilidou, M., Carayannis, G., Markantonatou, S., Piperidis S.. (ur.) Second International Conference on Language Resources and Evaluation LREC2000. Pariz : Atena, European Language Resources Association (ELRA), str. 523-530.
@inbook{inbook, author = {Tadi\'{c}, Marko}, year = {2000}, pages = {523-530}, keywords = {Alignment, Corpus Linguistics, Croatian, English, Parallel Corpus, XCES, XML}, isbn = {0-000-00000-0}, title = {Building the Croatian-English Parallel Corpus}, keyword = {Alignment, Corpus Linguistics, Croatian, English, Parallel Corpus, XCES, XML}, publisher = {European Language Resources Association (ELRA)}, publisherplace = {Pariz : Atena} }
@inbook{inbook, author = {Tadi\'{c}, Marko}, year = {2000}, pages = {523-530}, keywords = {Alignment, Corpus Linguistics, Croatian, English, Parallel Corpus, XCES, XML}, isbn = {0-000-00000-0}, title = {Building the Croatian-English Parallel Corpus}, keyword = {Alignment, Corpus Linguistics, Croatian, English, Parallel Corpus, XCES, XML}, publisher = {European Language Resources Association (ELRA)}, publisherplace = {Pariz : Atena} }




Contrast
Increase Font
Decrease Font
Dyslexic Font