Nalazite se na CroRIS probnoj okolini. Ovdje evidentirani podaci neće biti pohranjeni u Informacijskom sustavu znanosti RH. Ako je ovo greška, CroRIS produkcijskoj okolini moguće je pristupi putem poveznice www.croris.hr
izvor podataka: crosbi !

Merging Comparable Data Sources for the Discrimination of Similar Languages: The DSL Corpus Collection (CROSBI ID 660057)

Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija

Tan, Liling ; Zampieri. Marcos ; Ljubešić, Nikola ; Tiedemann, Jörg Merging Comparable Data Sources for the Discrimination of Similar Languages: The DSL Corpus Collection // Proceedings of the 7th Workshop on Building and Using Comparable Corpora (BUCC). Reykjavík: European Language Resources Association (ELRA), 2014. str. 20-24

Podaci o odgovornosti

Tan, Liling ; Zampieri. Marcos ; Ljubešić, Nikola ; Tiedemann, Jörg

engleski

Merging Comparable Data Sources for the Discrimination of Similar Languages: The DSL Corpus Collection

This paper presents the compilation of the DSL corpus collection created for the DSL (Discriminating Similar Languages) shared task to be held at the VarDial workshop at COLING 2014. The DSL corpus collection were merged from three comparable corpora to provide a suitable dataset for automatic classification to discriminate similar languages and language varieties. Along with the description of the DSL corpus collection we also present results of baseline discrimination experiments reporting performance of up to 87.4% accuracy.

comparable corpora, similar languages, language discrimination

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

nije evidentirano

Podaci o prilogu

20-24.

2014.

objavljeno

Podaci o matičnoj publikaciji

Proceedings of the 7th Workshop on Building and Using Comparable Corpora (BUCC)

Reykjavík: European Language Resources Association (ELRA)

Podaci o skupu

7th Workshop on Building and Using Comparable Corpora (BUCC)

predavanje

27.05.2014-27.05.2014

Reykjavík, Island

Povezanost rada

nije evidentirano

Indeksiranost