Napredna pretraga

Pregled bibliografske jedinice broj: 304898

Automating the Schema Matching Process for Heterogeneous Data Warehouses


Banek, Marko; Vrdoljak, Boris; Tjoa, A Min; Skočir, Zoran
Automating the Schema Matching Process for Heterogeneous Data Warehouses // Lecture Notes in Computer Science, 4654 (2007), 45-54 (međunarodna recenzija, članak, znanstveni)


Naslov
Automating the Schema Matching Process for Heterogeneous Data Warehouses

Autori
Banek, Marko ; Vrdoljak, Boris ; Tjoa, A Min ; Skočir, Zoran

Izvornik
Lecture Notes in Computer Science (0302-9743) 4654 (2007); 45-54

Vrsta, podvrsta i kategorija rada
Radovi u časopisima, članak, znanstveni

Ključne riječi
Data warehouse; data warehouse integration; data warehouse federation; schema matching; multidimensional model; semantic similarity; mapping; bipartite graphs

Sažetak
A federated data warehouse is a logical integration of data warehouses applicable when physical integration is impossible due to privacy policy or legal restrictions. In order to enable the translation of queries in a federated approach, schemas of the federated and the local warehouses must be matched. In this paper we present a procedure that enables the matching process for schema structures specific to the multidimensional model of data warehouses: facts, measures, dimensions, aggregation levels and dimensional attributes. Similarities between warehouse-specific structures are computed by using linguistic and structural comparison, where calculated values are used to create necessary mappings. We present restriction rules and recommendations for aggregation level matching, which builds the most complex part of the process. A software implementation of the entire process is provided in order to perform its verification, as well as to determine the proper selection metric for mapping different multidimensional structures.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Projekt / tema
036-0361983-2012 - Semantička integracija heterogenih izvorišta podataka (Mirta Baranović, )
036-0362027-1638 - Umrežena ekonomija (Zoran Skočir, )

Ustanove
Fakultet elektrotehnike i računarstva, Zagreb

Časopis indeksira:


  • Web of Science Core Collection (WoSCC)
    • SCI-EXP, SSCI i/ili A&HCI
  • Scopus