Pregled bibliografske jedinice broj: 1203043
Corpus linguistics for low-density varieties. Minority languages and corpus-based morphological investigations
Corpus linguistics for low-density varieties. Minority languages and corpus-based morphological investigations // Corpus, (2022), 23; 1-25 doi:10.4000/corpus.7345 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 1203043 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Corpus linguistics for low-density varieties.
Minority languages and corpus-based morphological
investigations
Autori
Gaeta, Livio ; Angster, Marco ; Cioffi, Raffaele ; Bellante, Marco
Izvornik
Corpus (1638-9808)
(2022), 23;
1-25
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
cultural heritage, minority languages, language documentation, verb prefixes, clitic pronouns, inflectional verb classes
Sažetak
Corpus linguistics grew up in the domain of written (and literary) varieties, while its recent methodological revolution is due to the computer-assisted capacity of elaborating massive amounts of text data. On the other hand, the so-called ‘low- density varieties’, including spoken varieties as well as varieties spoken in minority communities, have been confined to a rather marginal role. Among others, this is due to the technical problems connected to the scarce degree of normalization in linguistic –including graphemic– terms, as well as to the scarcity of language resources for automatic processing. In this paper, we will exploit the possibilities opened by corpus linguistics for acquiring and analyzing the textual patrimony of the Walser German communities of Piedmont and Aosta Valley. The varieties of Highest Alemannic spoken there, dramatically exposed to language decay, provide a limited but significant amount of data, which is accompanied by a substantial lexical documentation due to the active collaboration of the speakers’ communities in collecting and compiling local dictionaries. After briefly introducing our archive and discussing the peculiar solutions adopted for the construction of the platform, we will also present corpus-based morphological investigations regarding the representation of verbal prefixes, of the clitic group, as well as of the inflectional behaviour of verb classes.
Izvorni jezik
Engleski
Znanstvena područja
Filologija