Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 857681

Croatian Adult Spoken Language Corpus (HrAL)


Kuvač Kraljević, Jelena; Hržica, Gordana
Croatian Adult Spoken Language Corpus (HrAL) // Fluminensia, 28 (2016), 2; 87-102 (međunarodna recenzija, pregledni rad, znanstveni)


CROSBI ID: 857681 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Croatian Adult Spoken Language Corpus (HrAL)

Autori
Kuvač Kraljević, Jelena ; Hržica, Gordana

Izvornik
Fluminensia (0353-4642) 28 (2016), 2; 87-102

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, pregledni rad, znanstveni

Ključne riječi
Croatian Adult Spoken Language Corpus (HrAL) ; language sampling ; spontaneous speech corpora.

Sažetak
Interest in spoken-language corpora has increased over the last two decades leading to the development of new corpora and new facets about spoken language. These types of corpora represent the most comprehensive data source about language of ordinary speakers. Such corpora are based on spontaneous, unscripted speech defined by varieties of styles, registers and dialects. The aim of this paper is to present the Croatian Adult Spoken Language Corpus (HrAL), its structure and its possible application in different linguistic subfields. HrAL was built by sampling spontaneous conversations among 617 speakers from all Croatian counties, and it comprises more than 250 000 tokens and more than 100 000 types. Data were collected in three time slots: from 2010 to 2012, from 2014 to 2015 and during 2016. HrAL is today available within TalkBank, a large database of spoken-language corpora covering different languages (https://talkbank.org), in the Conversational Analyses corpora within subsection Conversational Banks. Data were transcribed, coded and segmented using the transcription format Codes for Human Analysis of Transcripts (CHAT) and the Computerised Language Analysis (CLAN) suite of programmes within the TalkBank toolkit. Speech streams were segmented into communication units (C-units) based on syntactic criteria. Most transcripts were linked to the source audio. The TalkBank is public free, i.e. all data stored in it can be shared by the wider community according to the basic rules of TalkBank. HrAL provides information about spoken grammar and lexicon, discourse skills, error production and productivity in general. It may be useful for sociolinguistic research and studies of synchronic language changes in Croatian.

Izvorni jezik
Engleski



POVEZANOST RADA


Projekti:
HRZZ-UIP-2013-11-2421 - Jezična obrada u odraslih govornika (ALP) (Kuvač Kraljević, Jelena, HRZZ - 2013-11) ( CroRIS)

Ustanove:
Edukacijsko-rehabilitacijski fakultet, Zagreb

Profili:

Avatar Url Jelena Kuvač (autor)

Avatar Url Gordana Hržica (autor)

Poveznice na cjeloviti tekst rada:

hrcak.srce.hr Hrčak

Citiraj ovu publikaciju:

Kuvač Kraljević, Jelena; Hržica, Gordana
Croatian Adult Spoken Language Corpus (HrAL) // Fluminensia, 28 (2016), 2; 87-102 (međunarodna recenzija, pregledni rad, znanstveni)
Kuvač Kraljević, J. & Hržica, G. (2016) Croatian Adult Spoken Language Corpus (HrAL). Fluminensia, 28 (2), 87-102.
@article{article, author = {Kuva\v{c} Kraljevi\'{c}, Jelena and Hr\v{z}ica, Gordana}, year = {2016}, pages = {87-102}, keywords = {Croatian Adult Spoken Language Corpus (HrAL), language sampling, spontaneous speech corpora.}, journal = {Fluminensia}, volume = {28}, number = {2}, issn = {0353-4642}, title = {Croatian Adult Spoken Language Corpus (HrAL)}, keyword = {Croatian Adult Spoken Language Corpus (HrAL), language sampling, spontaneous speech corpora.} }
@article{article, author = {Kuva\v{c} Kraljevi\'{c}, Jelena and Hr\v{z}ica, Gordana}, year = {2016}, pages = {87-102}, keywords = {Croatian Adult Spoken Language Corpus (HrAL), language sampling, spontaneous speech corpora.}, journal = {Fluminensia}, volume = {28}, number = {2}, issn = {0353-4642}, title = {Croatian Adult Spoken Language Corpus (HrAL)}, keyword = {Croatian Adult Spoken Language Corpus (HrAL), language sampling, spontaneous speech corpora.} }

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • Emerging Sources Citation Index (ESCI)
  • Scopus





Contrast
Increase Font
Decrease Font
Dyslexic Font