Napredna pretraga

Pregled bibliografske jedinice broj: 313761

Automating the Process of Schema Integration for Heterogeneous Data Warehouses


Banek, Marko
Automating the Process of Schema Integration for Heterogeneous Data Warehouses 2007., doktorska disertacija, Fakultet elektrotehnike i računarstva, Zagreb


Naslov
Automating the Process of Schema Integration for Heterogeneous Data Warehouses

Autori
Banek, Marko

Vrsta, podvrsta i kategorija rada
Ocjenski radovi, doktorska disertacija

Fakultet
Fakultet elektrotehnike i računarstva

Mjesto
Zagreb

Datum
07.12

Godina
2007

Stranica
192

Mentor
Vrdoljak, Boris ; Tjoa, A Min

Ključne riječi
Data warehouse; data warehouse integration; federated data warehouse; schema matching; multidimensional data model; semantic similarity; structure similarity; mapping; bipartite graphs

Sažetak
This doctoral thesis proposes an approach for automated schema integration of heterogeneous data warehouses. A federated data warehouse is a logical integration of data warehouses applicable when physical integration is impossible due to privacy policy or legal restrictions. In order to enable the translation of queries in a federated approach, heterogeneous schemas of the federated and the local warehouses must be matched. The proposed schema integration procedure is capable of solving heterogeneities among data warehouse structures specific to the multidimensional conceptual model: facts, measures, dimensions, aggregation levels and dimensional attributes. Similarities between warehouse schema structures are computed by using semantic and structural comparison. Filter algorithms, based on bipartite graph matching, use the calculated similarity values for creating necessary mappings between multidimensional structures. Restriction rules are proposed for aggregation level matching, as the partial order in dimension hierarchies must be preserved. A software implementation of the entire process is provided in order to perform its verification and to determine the proper filter algorithms for mapping different multidimensional structures.

Izvorni jezik
Engleski

Znanstvena područja
Računarstvo



POVEZANOST RADA


Projekt / tema
036-0362027-1638 - Umrežena ekonomija (Zoran Skočir, )

Ustanove
Fakultet elektrotehnike i računarstva, Zagreb

Autor s matičnim brojem:
Marko Banek, (267872)