Summary of MSc Thesis

 

Tools for Integrating Conceptual Schemes in a Multi-database Environment using an Object-Oriented Approach.

 

The integration of heterogeneous pre-existing database systems is important to provide access to data distributed across systems. Some difficulties arise in the integration process due to conflicting values for related data in different database systems.

To alleviate the problems of heterogeneity, each component should support an intermediate schema in a common data model. Such a common data model should have some desired features to overcome many obstacles.

Recently, Object-Orientation technology seems promising to solve such obstacles in schema translation and integration because of the ability of  capturing the semantics of  conceptual schemes.

In this project, an Object-Oriented Model has been proposed as a Common Data Model to capture the necessary semantics of the relational, hierarchical, and network data model schemes.

Three transformation rules were designed in order to transform relational, hierarchical, and network data model schemes to the object common data model that represent the  intermediate schemes.

It was difficult to capture clear conceptual schema from Pre-relational systems such as CODASYL network system and IMS hierarchical  system, because most of the semantics are hidden in the application programs. However, such difficulties may be overcome -simply- by user interaction, or by achieving operational mapping that can be supported by object-oriented paradigm.   

The produced intermediate schemes will be integrated through various  integration rules to produce a Global Conceptual Schema that describes the structure of the  whole nodes participating in the Multi-database system. Integration process, in general,  consists of three layers:-

1) The lowest layer compares object  properties( attributes and methods) of pair of schemes. This layer produces different weights that represent the number of equivalent properties between the two objects.

2)The Intermediate layer calculates the weighted sum to decide whether two objects are equivalent or not, taking into consideration the user conformation.

3)The upper layer decide whether the pair of schemes are integratable or not, this will be decided according to the entered  weights and assertions.

Some operators were built to merge the structure of the equivalent objects and relationships in order to generate a minimal, understandable, and complete Global Conceptual Schemes.

An interactive tool (Tools for Schema Transformation and Integration TOOSTI) was developed using Borland C++ under Microsoft Windows, which supports the various mentioned phases of the schema transformation and integration processes.

The main features of this tool is that it is based on object- oriented methodology. Secondly, The phases of transformation and integration are driven automatically without user intervention, but  user may be interrogated  in the case of semantic comparison. Third, this tool may be expanded to involve future algorithms and techniques because of the reusability property of Object-Orientation. Forth, object correspondences depend on two  measures weighted sum and Assertions. Assertions may be generated due to weighted sum, or user interaction.

Through integration of tens experimental applications using this tool, we found that it is difficult to isolate user interaction during integration process, since we found that two schemes may be integratable, in spite that there are no object correspondences between them. However, we may minimize user interaction by following future work:

1) Object Common Data Model may be augmented by a suitable knowledge-base that support reasoning system of the available assertions.

2) A case-reasoning system can be developed to solve semantic integration problems by utilizing previous solutions.

3) Some of linguistic methods may be adopted to solve serious problems in semantic integration.