Pregled bibliografske jedinice broj: 462865
Generating Data Quality Rules and Integration into ETL Process
Generating Data Quality Rules and Integration into ETL Process // ACM Twelfth International Workshop on Data Warehousing and OLAP
Hong Kong, Kina, 2009. str. 65-72 (ostalo, međunarodna recenzija, cjeloviti rad (in extenso), znanstveni)
CROSBI ID: 462865 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Generating Data Quality Rules and Integration into ETL Process
Autori
Rodić, Jasna ; Baranović, Mirta
Vrsta, podvrsta i kategorija rada
Radovi u zbornicima skupova, cjeloviti rad (in extenso), znanstveni
Izvornik
ACM Twelfth International Workshop on Data Warehousing and OLAP
/ - , 2009, 65-72
Skup
ACM Twelfth International Workshop on Data Warehousing and OLAP
Mjesto i datum
Hong Kong, Kina, 11.2009
Vrsta sudjelovanja
Ostalo
Vrsta recenzije
Međunarodna recenzija
Ključne riječi
Data quality; rules; generator; metadata; oracle
Sažetak
Many data quality projects are integrated into datawarehouse projects without enough time allocated for the data quality part, which leads to a need for a quicker data quality process implementation that can be easily adopted as the first stage of data warehouse implementation. We will see that many data quality rules an be implemented in a similar way, and thus generated based on metadata tables that tore information about the rules.These generated rules are then used to check data in designated tables and mark erroneous records, or to do certain updates of invalid data. We will also store information about the rules violations in order to provide analysis of such data. This could give a significant insight into our source systems. Entire data quality process will be integrated into ETL process in order to achieve load of data warehouse that is as automated, as correct and as quick as possible. Only small number of records would be left for manual inspection and reprocessing.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Projekti:
036-0361983-2012 - Semantička integracija heterogenih izvorišta podataka (Baranović, Mirta, MZO ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb
Profili:
Mirta Baranović
(autor)