Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 634807

From Digitisation Process to Terminological Digital Resources


Seljan, Sanja; Dunđer, Ivan; Gašpar, Angelina
From Digitisation Process to Terminological Digital Resources // Proceedings of the 36th International Convention MIPRO 2013 / Biljanović, P. (ur.).
Rijeka: Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2013. str. xx-xx (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 634807 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
From Digitisation Process to Terminological Digital Resources

Autori
Seljan, Sanja ; Dunđer, Ivan ; Gašpar, Angelina

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
Proceedings of the 36th International Convention MIPRO 2013 / Biljanović, P. - Rijeka : Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2013, Xx-xx

Skup
International Convention on Information and Communication Technology, Electronics and Microelectronics MIPRO

Mjesto i datum
Opatija, Hrvatska, 20.05.2013. - 24.05.2013

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
digitization ; term and collocation extraction ; Multi-Word Unit (MWU) ; statistical and language approaches ; evaluation ; English ; Croatian

Sažetak
Monolingual and multilingual terminology and collocation bases represent valuable additional electronic resources, which can be used in further research, in written communication and in everyday communication. Building of such resources can be supported by terminology extraction tools relying on statistical or language approaches, or on hybrid model, but require considerable human expertise in evaluation and final compilation. The paper describes the whole process: from digitisation of printed material, OCR techniques, sentence alignment and creation of translation memories, up to terminology extraction and evaluation. The performance of tools and applied methodology is assessed through standard statistical measures of precision, recall and F-measure. Experimental results are produced, deficiencies of semi-automatic statistical and linguistic system highlighted and recommendations for further research suggested.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
130-1300646-0909 - Informacijska tehnologija u prevođenju hrvatskoga i e-učenju jezika (Seljan, Sanja, MZOS ) ( CroRIS)

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Angelina Gašpar (autor)

Avatar Url Sanja Seljan (autor)

Avatar Url Ivan Dunđer (autor)

Poveznice na cjeloviti tekst rada:

Pristup cjelovitom tekstu rada

Citiraj ovu publikaciju:

Seljan, Sanja; Dunđer, Ivan; Gašpar, Angelina
From Digitisation Process to Terminological Digital Resources // Proceedings of the 36th International Convention MIPRO 2013 / Biljanović, P. (ur.).
Rijeka: Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2013. str. xx-xx (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Seljan, S., Dunđer, I. & Gašpar, A. (2013) From Digitisation Process to Terminological Digital Resources. U: Biljanović, P. (ur.)Proceedings of the 36th International Convention MIPRO 2013.
@article{article, author = {Seljan, Sanja and Dun\djer, Ivan and Ga\v{s}par, Angelina}, editor = {Biljanovi\'{c}, P.}, year = {2013}, pages = {xx-xx}, keywords = {digitization, term and collocation extraction, Multi-Word Unit (MWU), statistical and language approaches, evaluation, English, Croatian}, title = {From Digitisation Process to Terminological Digital Resources}, keyword = {digitization, term and collocation extraction, Multi-Word Unit (MWU), statistical and language approaches, evaluation, English, Croatian}, publisher = {Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO}, publisherplace = {Opatija, Hrvatska} }
@article{article, author = {Seljan, Sanja and Dun\djer, Ivan and Ga\v{s}par, Angelina}, editor = {Biljanovi\'{c}, P.}, year = {2013}, pages = {xx-xx}, keywords = {digitization, term and collocation extraction, Multi-Word Unit (MWU), statistical and language approaches, evaluation, English, Croatian}, title = {From Digitisation Process to Terminological Digital Resources}, keyword = {digitization, term and collocation extraction, Multi-Word Unit (MWU), statistical and language approaches, evaluation, English, Croatian}, publisher = {Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO}, publisherplace = {Opatija, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font