Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 433788

Statistical Language Models for Croatian Weather-domain Corpus


Načinović, Lucia; Martinčić-Ipšić, Sanda; Ipšić, Ivo
Statistical Language Models for Croatian Weather-domain Corpus // InFuture 2009 / Stančić, Hrvoje ; Seljan, Sanja ; Bawden, David ; Lasić-Lazić, Jadranka ; Slavić, Aida (ur.).
Zagreb: Vjesnik, 2009. str. 333-340 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 433788 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Statistical Language Models for Croatian Weather-domain Corpus

Autori
Načinović, Lucia ; Martinčić-Ipšić, Sanda ; Ipšić, Ivo

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

ISBN
978-953-175-355-5

Skup
InFuture 2009

Mjesto i datum
Zagreb, Hrvatska, 04.11.2009. - 06.11.2009

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
statistical language modelling; n-gram; smoothing methods; Croatian weather-domain corpus

Sažetak
Statistical language modelling estimates the regularities in natural languages. Language models are used in speech recognition, machine translation and other applications for speech and language technologies. In this paper we will present a procedure for language models building for the Croatian weather-domain corpus. Different types of n-gram statistic language models and smoothing methods for language modelling are presented. Those models are compared in terms of their estimated perplexity.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
009-0361935-0852 - Govorne tehnologije
318-0361935-0852 - Govorne tehnologije (Ipšić, Ivo, MZOS ) ( CroRIS)

Ustanove:
Fakultet informatike i digitalnih tehnologija, Rijeka


Citiraj ovu publikaciju:

Načinović, Lucia; Martinčić-Ipšić, Sanda; Ipšić, Ivo
Statistical Language Models for Croatian Weather-domain Corpus // InFuture 2009 / Stančić, Hrvoje ; Seljan, Sanja ; Bawden, David ; Lasić-Lazić, Jadranka ; Slavić, Aida (ur.).
Zagreb: Vjesnik, 2009. str. 333-340 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Načinović, L., Martinčić-Ipšić, S. & Ipšić, I. (2009) Statistical Language Models for Croatian Weather-domain Corpus. U: Stančić, H., Seljan, S., Bawden, D., Lasić-Lazić, J. & Slavić, A. (ur.)InFuture 2009.
@article{article, author = {Na\v{c}inovi\'{c}, Lucia and Martin\v{c}i\'{c}-Ip\v{s}i\'{c}, Sanda and Ip\v{s}i\'{c}, Ivo}, year = {2009}, pages = {333-340}, keywords = {statistical language modelling, n-gram, smoothing methods, Croatian weather-domain corpus}, isbn = {978-953-175-355-5}, title = {Statistical Language Models for Croatian Weather-domain Corpus}, keyword = {statistical language modelling, n-gram, smoothing methods, Croatian weather-domain corpus}, publisher = {Vjesnik}, publisherplace = {Zagreb, Hrvatska} }
@article{article, author = {Na\v{c}inovi\'{c}, Lucia and Martin\v{c}i\'{c}-Ip\v{s}i\'{c}, Sanda and Ip\v{s}i\'{c}, Ivo}, year = {2009}, pages = {333-340}, keywords = {statistical language modelling, n-gram, smoothing methods, Croatian weather-domain corpus}, isbn = {978-953-175-355-5}, title = {Statistical Language Models for Croatian Weather-domain Corpus}, keyword = {statistical language modelling, n-gram, smoothing methods, Croatian weather-domain corpus}, publisher = {Vjesnik}, publisherplace = {Zagreb, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font