Analysis of Corpus-based Word-Order Typological Methods (CROSBI ID 733943)
Prilog sa skupa u zborniku | izvorni znanstveni rad | međunarodna recenzija
Podaci o odgovornosti
Alves, Diego ; Bekavac, Božo ; Zeman, Daniel ; Tadić, Marko
engleski
Analysis of Corpus-based Word-Order Typological Methods
This article presents a comparative analysis of four different syntactic typological approaches applied to 20 different languages. We compared three specific quantitative methods, using parallel CoNLL-U corpora, to the classification obtained via syntactic features provided by a typological database (lang2vec). First, we analyzed the Marsagram linear approach which consists of extracting the frequency word-order patterns regarding the position of components inside syntactic nodes. The second approach considers the relative position of heads and dependents, and the third is based simply on the relative position of verbs and objects. From the results, it was possible to observe that each method provides different language clusters which can be compared to the classic genealogical classification (the lang2vec and the head and dependent methods being the closest). As different word-order phenomena are considered in these specific typological strategies, each one provides a different angle of analysis to be applied according to the precise needs of the researchers.
dependency parsing ; typology ; multilingualism
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
36-46.
2023.
objavljeno
Podaci o matičnoj publikaciji
Proceedings of the Sixth Workshop on Universal Dependencies (UDW, GURT/SyntaxFest 2023)
Grobol, Loïc ; Tyers, Francis
Washington (MD): Association for Computational Linguistics (ACL)
978-1-959429-34-0
Podaci o skupu
Sixth Workshop on Universal Dependencies (UDW, GURT/SyntaxFest 2023)
predavanje
09.03.2023-12.03.2023
Washington D.C., Sjedinjene Američke Države