Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 913277

Dealing with data sparseness in SMT with factored models and morphological expansion: a case study on Croatian


Sanchez-Cartagena, Victor M.; Ljubešić, Nikola; Klubička, Filip
Dealing with data sparseness in SMT with factored models and morphological expansion: a case study on Croatian // Baltic Journal of Modern Computing, 4 (2016), 2; 354-360 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 913277 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Dealing with data sparseness in SMT with factored models and morphological expansion: a case study on Croatian

Autori
Sanchez-Cartagena, Victor M. ; Ljubešić, Nikola ; Klubička, Filip

Izvornik
Baltic Journal of Modern Computing (2255-8942) 4 (2016), 2; 354-360

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
data sparseness, factored translation models, morphological expansion

Sažetak
This paper describes our experience using available linguistic resources for Croatian in order to address data sparseness when building an English-to-Croatian general domain phrase- based statistical machine translation system. We report the results obtained with factored translation models and morphological expansion, highlight the impact of the algorithm used for tagging the corpora, and show that the improvement brought by these methods is compatible with the application of data selection on out-of-domain parallel corpora.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Nikola Ljubešić (autor)

Citiraj ovu publikaciju:

Sanchez-Cartagena, Victor M.; Ljubešić, Nikola; Klubička, Filip
Dealing with data sparseness in SMT with factored models and morphological expansion: a case study on Croatian // Baltic Journal of Modern Computing, 4 (2016), 2; 354-360 (međunarodna recenzija, članak, znanstveni)
Sanchez-Cartagena, V., Ljubešić, N. & Klubička, F. (2016) Dealing with data sparseness in SMT with factored models and morphological expansion: a case study on Croatian. Baltic Journal of Modern Computing, 4 (2), 354-360.
@article{article, author = {Sanchez-Cartagena, Victor M. and Ljube\v{s}i\'{c}, Nikola and Klubi\v{c}ka, Filip}, year = {2016}, pages = {354-360}, keywords = {data sparseness, factored translation models, morphological expansion}, journal = {Baltic Journal of Modern Computing}, volume = {4}, number = {2}, issn = {2255-8942}, title = {Dealing with data sparseness in SMT with factored models and morphological expansion: a case study on Croatian}, keyword = {data sparseness, factored translation models, morphological expansion} }
@article{article, author = {Sanchez-Cartagena, Victor M. and Ljube\v{s}i\'{c}, Nikola and Klubi\v{c}ka, Filip}, year = {2016}, pages = {354-360}, keywords = {data sparseness, factored translation models, morphological expansion}, journal = {Baltic Journal of Modern Computing}, volume = {4}, number = {2}, issn = {2255-8942}, title = {Dealing with data sparseness in SMT with factored models and morphological expansion: a case study on Croatian}, keyword = {data sparseness, factored translation models, morphological expansion} }

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • Emerging Sources Citation Index (ESCI)





Contrast
Increase Font
Decrease Font
Dyslexic Font