Pretražite po imenu i prezimenu autora, mentora, urednika, prevoditelja

Napredna pretraga

Pregled bibliografske jedinice broj: 537344

Exploring Classification Concept Drift on a Large News Text Corpus


Šilić, Artur; Dalbelo Bašić, Bojana
Exploring Classification Concept Drift on a Large News Text Corpus // Springer Lecture Notes in Computer Science, 7181 (2012), 1; 428-437 doi:10.1007/978-3-642-28604-9 (međunarodna recenzija, članak, znanstveni)


CROSBI ID: 537344 Za ispravke kontaktirajte CROSBI podršku putem web obrasca

Naslov
Exploring Classification Concept Drift on a Large News Text Corpus

Autori
Šilić, Artur ; Dalbelo Bašić, Bojana

Izvornik
Springer Lecture Notes in Computer Science (0302-9743) 7181 (2012), 1; 428-437

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
text classification; concept drift; logistic regression

Sažetak
Concept drift research has regained research interest during recent years as many applications use data sources that are changing over time. We study the classification task using logistic regression on a large news collection of 248K texts during a period of seven years. We present extrinsic methods of concept drift detection and quantification using training set formation with different windowing techniques. On our corpus, we characterize concept drift and show the overestimation of classifier performance if it is neglected. We lay out paths for future work where we plan to refine extrinsic characterization methods and investigate the drifting of learning parameters when few examples are available.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Projekti:
036-1300646-1986 - Otkrivanje znanja u tekstnim podacima (Dalbelo-Bašić, Bojana, MZO ) ( CroRIS)

Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb

Profili:

Avatar Url Bojana Dalbelo Bašić (autor)

Avatar Url Artur Šilić (autor)

Poveznice na cjeloviti tekst rada:

doi www.springerlink.com

Citiraj ovu publikaciju:

Šilić, Artur; Dalbelo Bašić, Bojana
Exploring Classification Concept Drift on a Large News Text Corpus // Springer Lecture Notes in Computer Science, 7181 (2012), 1; 428-437 doi:10.1007/978-3-642-28604-9 (međunarodna recenzija, članak, znanstveni)
Šilić, A. & Dalbelo Bašić, B. (2012) Exploring Classification Concept Drift on a Large News Text Corpus. Springer Lecture Notes in Computer Science, 7181 (1), 428-437 doi:10.1007/978-3-642-28604-9.
@article{article, author = {\v{S}ili\'{c}, Artur and Dalbelo Ba\v{s}i\'{c}, Bojana}, year = {2012}, pages = {428-437}, DOI = {10.1007/978-3-642-28604-9}, keywords = {text classification, concept drift, logistic regression}, journal = {Springer Lecture Notes in Computer Science}, doi = {10.1007/978-3-642-28604-9}, volume = {7181}, number = {1}, issn = {0302-9743}, title = {Exploring Classification Concept Drift on a Large News Text Corpus}, keyword = {text classification, concept drift, logistic regression} }
@article{article, author = {\v{S}ili\'{c}, Artur and Dalbelo Ba\v{s}i\'{c}, Bojana}, year = {2012}, pages = {428-437}, DOI = {10.1007/978-3-642-28604-9}, keywords = {text classification, concept drift, logistic regression}, journal = {Springer Lecture Notes in Computer Science}, doi = {10.1007/978-3-642-28604-9}, volume = {7181}, number = {1}, issn = {0302-9743}, title = {Exploring Classification Concept Drift on a Large News Text Corpus}, keyword = {text classification, concept drift, logistic regression} }

Časopis indeksira:


  • Scopus


Uključenost u ostale bibliografske baze podataka::


  • Science Citation Index Expanded


Citati:





    Contrast
    Increase Font
    Decrease Font
    Dyslexic Font