Pregled bibliografske jedinice broj: 583289
Effects of Data Anonymization on the Data Mining Results
Effects of Data Anonymization on the Data Mining Results // 35. International Convention MIPRO/miproBIS
Opatija: Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2012. str. 1965-1969 (predavanje, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 583289 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Effects of Data Anonymization on the Data Mining Results
Autori
Buratović, Ines ; Miličević, Mario ; Žubrinić, Krunoslav
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
35. International Convention MIPRO/miproBIS
/ - Opatija : Hrvatska udruga za informacijsku i komunikacijsku tehnologiju, elektroniku i mikroelektroniku - MIPRO, 2012, 1965-1969
ISBN
978-953-233-069-4
Skup
35. International Convention MIPRO/miproBIS
Mjesto i datum
Opatija, Hrvatska, 21.05.2012. - 25.05.2012
Vrsta sudjelovanja
Predavanje
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Data anonymization ; k-anonymization ; data mining
Sažetak
This article examines the possibility of publication of students’ data, such as secondary school success, state graduation exam scores and success during their first year of university study for analyses. In order to discover data patterns and relationships using data mining techniques, the data must be released in the form of original tuples, instead of pre-aggregated statistics. These records contain sensitive and even confidential personal information, which implies significant privacy concerns regarding the disclosure of such data. Removing explicit identifiers prior to data release cannot guarantee anonymity, since the datasets still contain information that can be used for linking the released records with publicly available collections that include students’ identities. One of the privacy preserving techniques proposed in the literature is the k-anonymization. The process of anonymizing a data set usually involves generalizing data records and, consequently, it incurs loss of relevant information. In the primary research undertaken in the University of Dubrovnik’s students’ database the effect of anonymization has been measured by comparing the results of mining the original data set with the results of mining the altered data set to determine if it is possible to use anonymized data for research purposes.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo, Informacijske i komunikacijske znanosti
POVEZANOST RADA
Projekti:
275-0000000-3260 - Integralna kvaliteta usluge komunikacijskih i informacijskih sustava (Lipovac, Vladimir, MZO ) ( CroRIS)
Ustanove:
Sveučilište u Dubrovniku
Citiraj ovu publikaciju:
Časopis indeksira:
- Scopus