Pregled bibliografske jedinice broj: 279281
Genre Document Classification Using Flexible Length Phrares
Genre Document Classification Using Flexible Length Phrares // Proceedings of 17th International Conference on Information and Intelligent Systems / Aurer, Boris ; Bača, Miroslav (ur.).
Varaždin: Fakultet organizacije i informatike Sveučilišta u Zagrebu, 2006. str. 23-28 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 279281 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Genre Document Classification Using Flexible Length Phrares
Autori
Radošević, Danijel ; Dobša, Jasminka ; Mladenić, Dunja ; Stapić, Zlatko ; Novak, Miroslav
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
Proceedings of 17th International Conference on Information and Intelligent Systems
/ Aurer, Boris ; Bača, Miroslav - Varaždin : Fakultet organizacije i informatike Sveučilišta u Zagrebu, 2006, 23-28
Skup
17th International Conference on Information and Intelligent Systems, IIS 2006
Mjesto i datum
Varaždin, Hrvatska, 20.09.2006. - 22.09.2006
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
flexible length phrases; bag of words representation; genre classification
Sažetak
In this paper we investigate possibility of using phrases of flexible length in genre classification of textual documents as an extension to classic bag of words document representation where documents are represented using single words as features. The investigation is conducted on collection of articles from document data base collected from three different sources representing different genres: newspaper reports, abstracts of scientific articles and legal documents. The investigation includes comparison between classification results obtained by using classic bag of words representation and results obtained by using bag of words extended by flexible length phrases.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo