Pregled bibliografske jedinice broj: 402275
Identification of persons and business subjects in text documents based on lexical analysis and scoring system
Identification of persons and business subjects in text documents based on lexical analysis and scoring system // MIPRO 2009, Proceedings Vol. III, CTS & CIS / Bogunović, Nikola ; Ribarić, Slobodan (ur.).
Rijeka: Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2009. str. 35-38 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 402275 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Identification of persons and business subjects in text documents based on lexical analysis and scoring system
Autori
Lončar, Goran ; Bogunović, Nikola
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
MIPRO 2009, Proceedings Vol. III, CTS & CIS
/ Bogunović, Nikola ; Ribarić, Slobodan - Rijeka : Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2009, 35-38
ISBN
978-953-233-045-8
Skup
MIPRO 2009, 32nd International convention on information and communication technology, electronics and microelectronics
Mjesto i datum
Opatija, Hrvatska, 25.05.2009. - 29.05.2009
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
data mining; information retrieval; lexical analysis
Sažetak
The amount of text documents and textual media news that is created every day on the Internet is growing rapidly, making it very difficult to find useful information effectively. The paper presents a system that identifies persons and business subjects in newly published text documents and matches them with persons and businesses previously stored in a database. The implemented system employs lexical analysis and scoring algorithm tagging the input documents with subjects' id from the database and enabling easy and effective search. Consequently, only the search object and the surrounding context is displayed to the end user. The system is currently successfully used in a web portal.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Projekti:
036-0362980-1921 - Računalne okoline za sveprisutne raspodijeljene sustave (Srbljić, Siniša, MZO ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb