Pregled bibliografske jedinice broj: 101936
Procedures in Building the Croatian-English Parallel Corpus
Procedures in Building the Croatian-English Parallel Corpus // International journal of corpus linguistics, 6 (2001), special issue; 107-123 (podatak o recenziji nije dostupan, članak, znanstveni)
CROSBI ID: 101936 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Procedures in Building the Croatian-English Parallel Corpus
Autori
Tadić, Marko
Izvornik
International journal of corpus linguistics (1384-6655) 6
(2001), Special issue;
107-123
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
corpus linguistics; parallel corpora; Croatian language; English language; corpus encoding; alignment; CES; XML
Sažetak
This contribution gives a survey of procedures and formats used in building the Croatian-English parallel corpus which is being collected at the Institute of Linguistics at the Philosophical Faculty, University of Zagreb. The primary text source is the newspaper Croatia Weekly which has been published from the beginning of 1998 by HIKZ (Croatian Institute for Information and Culture). After a quick survey of existing English-Croatian parallel corpora, the article copes with procedures involved in text conversion and text encoding, particularly the alignment. There are several recent suggestions for alignment encoding, and they are listed and elaborated at the end of the article.
Izvorni jezik
Engleski
Znanstvena područja
Filologija
Citiraj ovu publikaciju:
Uključenost u ostale bibliografske baze podataka::
- Linguistics and Language Behavior Abstracts
- Social Sciences Index
- Sociological Abstracts