Pregled bibliografske jedinice broj: 365260
Automated Integration of Heterogeneous Data Warehouse Schemas
Automated Integration of Heterogeneous Data Warehouse Schemas // International Journal of Data Warehousing and Mining, 4 (2008), 4; 1-21 (međunarodna recenzija, članak, znanstveni)
CROSBI ID: 365260 Za ispravke kontaktirajte CROSBI podršku putem web obrasca
Naslov
Automated Integration of Heterogeneous Data Warehouse Schemas
Autori
Banek, Marko ; Vrdoljak, Boris ; Tjoa, A Min ; Skočir, Zoran
Izvornik
International Journal of Data Warehousing and Mining (1548-3924) 4
(2008), 4;
1-21
Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni
Ključne riječi
Data Warehouse; Data Warehouse Integration; Data Warehouse Federation; Schema Matching; Multidimensional Model; Semantic Similarity; Mapping; Bipartite Graph
Sažetak
A federated data warehouse is a logical integration of data warehouses applicable when physical integration is impossible due to privacy policy or legal restrictions. In healthcare systems federated data warehouses are a most feasible source of data for deducing guidelines for evidence-based medicine based on data material from different participating institutions. In order to enable the translation of queries in a federated approach, schemas of the federated warehouse and the local warehouses must be matched. In this paper we present a procedure that enables the matching process for schema structures specific to the multidimensional model of data warehouses: facts, measures, dimensions, aggregation levels and dimensional attributes. Similarities between warehouse-specific structures are computed by using linguistic and structural comparison. The calculated values are used to create necessary mappings. We present restriction rules and recommendations for aggregation level matching, which builds the most complex part of the process. A software implementation of the entire process is provided in order to perform its verification, as well as to determine the proper selection metric for mapping different multidimensional structures.
Izvorni jezik
Engleski
Znanstvena područja
Računarstvo
POVEZANOST RADA
Projekti:
036-0361983-2012 - Semantička integracija heterogenih izvorišta podataka (Baranović, Mirta, MZO ) ( CroRIS)
036-0362027-1638 - Umrežena ekonomija (Skočir, Zoran, MZO ) ( CroRIS)
Ustanove:
Fakultet elektrotehnike i računarstva, Zagreb
Citiraj ovu publikaciju:
Časopis indeksira:
- Scopus