Filtering noisy instances and outliers (CROSBI ID 26578)
Prilog u knjizi | izvorni znanstveni rad
Podaci o odgovornosti
Gamberger, Dragan ; Lavrač, Nada
engleski
Filtering noisy instances and outliers
Instance selection methods are aimed at finding a representative data subset that can replace the original dataset but still provide enough information to solve a given data mining task. If instance selection is done by sampling, the sample should preferably exclude noisy instances and outliers. This chapter presents methods for noise and outlier detection that can be incorporated into sampling as filters for data cleaning. The chapter presents the following filtering algorithms: a saturation filter, a classification filter, a combined classification-saturation filter, and a consensus saturation filter. The distinguishing feature of the novel consensus saturation filter is its high reliability which is due to the multiple detection of outliers and/or noisy instances. Medical evaluation in the problem of coronary artery disease diagnosis shows that the detected instances are indeed noisy or non-typical class representatives.
noise, outliers, saturation filter
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
nije evidentirano
Podaci o prilogu
375-394-x.
objavljeno
Podaci o knjizi
Instance Selection and Construction for Data Mining
Liu, Huan. ; Motoda, Hiroshi.
Boston (MA): Kluwer Academic Publishers
2001.
0-7923-7209-3