Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 439599

First Steps Toward Developing a System for Terminology Extraction


Bago, Petra; Boras, Damir; Ljubešić, Nikola
First Steps Toward Developing a System for Terminology Extraction // INFuture2009: Digital Resources and Knowledge Sharing / Stančić, Hrvoje ; Seljan, Sanja ; Bawden, David ; Lasić-Lazić, Jadranka ; Slavić, Aida (ur.).
Zagreb: Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu, 2009. str. 197-206 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)


CROSBI ID: 439599 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
First Steps Toward Developing a System for Terminology Extraction

Autori
Bago, Petra ; Boras, Damir ; Ljubešić, Nikola

Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni

Izvornik
INFuture2009: Digital Resources and Knowledge Sharing / Stančić, Hrvoje ; Seljan, Sanja ; Bawden, David ; Lasić-Lazić, Jadranka ; Slavić, Aida - Zagreb : Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu, 2009, 197-206

ISBN
978-953-175-355-5

Skup
2nd International Conference “The Future of Information Sciences: INFuture2009 – Digital Resources and Knowledge Sharing”

Mjesto i datum
Zagreb, Hrvatska, 04.11.2009. - 06.11.2009

Vrsta sudjelovanja
Predavanje

Vrsta recenzije
Međunarodna recenzija

Ključne riječi
terminology extraction; data sample; log-likelihood ratio test

Sažetak
The aim of this paper is to describe first steps in developing a system for terminology extraction. First a data sample is built from synopses of doctoral theses at the Faculty of Humanities and Social Sciences, University of Zagreb, accepted in the period from 2004 to 2009 written mostly in Croatian language. Data sample consists of 420 documents and 338, 706 tokens. A small sample was manually tagged for terminology to be used in an initial experiment. The approach for terminology extraction is knowledge-driven and consists of differential analysis of reference and domain-specific corpora. Specific method used is log-likelihood ratio test. Experiment deals with different reference corpora and linguistic pre-processing. First results are promising. Further research guidelines are discussed.

Izvorni jezik
Engleski

Znanstvena područja
Informacijske i komunikacijske znanosti



POVEZANOST RADA


Projekti:
130-1301679-1380 - Hrvatska rječnička baština i hrvatski europski identitet (Boras, Damir, MZOS ) ( CroRIS)

Ustanove:
Filozofski fakultet, Zagreb

Profili:

Avatar Url Petra Bago (autor)

Avatar Url Nikola Ljubešić (autor)

Avatar Url Damir Boras (autor)


Citiraj ovu publikaciju:

Bago, Petra; Boras, Damir; Ljubešić, Nikola
First Steps Toward Developing a System for Terminology Extraction // INFuture2009: Digital Resources and Knowledge Sharing / Stančić, Hrvoje ; Seljan, Sanja ; Bawden, David ; Lasić-Lazić, Jadranka ; Slavić, Aida (ur.).
Zagreb: Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveučilišta u Zagrebu, 2009. str. 197-206 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
Bago, P., Boras, D. & Ljubešić, N. (2009) First Steps Toward Developing a System for Terminology Extraction. U: Stančić, H., Seljan, S., Bawden, D., Lasić-Lazić, J. & Slavić, A. (ur.)INFuture2009: Digital Resources and Knowledge Sharing.
@article{article, author = {Bago, Petra and Boras, Damir and Ljube\v{s}i\'{c}, Nikola}, year = {2009}, pages = {197-206}, keywords = {terminology extraction, data sample, log-likelihood ratio test}, isbn = {978-953-175-355-5}, title = {First Steps Toward Developing a System for Terminology Extraction}, keyword = {terminology extraction, data sample, log-likelihood ratio test}, publisher = {Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveu\v{c}ili\v{s}ta u Zagrebu}, publisherplace = {Zagreb, Hrvatska} }
@article{article, author = {Bago, Petra and Boras, Damir and Ljube\v{s}i\'{c}, Nikola}, year = {2009}, pages = {197-206}, keywords = {terminology extraction, data sample, log-likelihood ratio test}, isbn = {978-953-175-355-5}, title = {First Steps Toward Developing a System for Terminology Extraction}, keyword = {terminology extraction, data sample, log-likelihood ratio test}, publisher = {Odsjek za informacijske i komunikacijske znanosti Filozofskog fakulteta Sveu\v{c}ili\v{s}ta u Zagrebu}, publisherplace = {Zagreb, Hrvatska} }




Contrast
Increase Font
Decrease Font
Dyslexic Font