000 | 03228cam a2200337 a 4500 | ||
---|---|---|---|
003 | EG-GiCUC | ||
005 | 20250223032440.0 | ||
008 | 191117s2019 ua do f m 000 0 eng d | ||
040 |
_aEG-GiCUC _beng _cEG-GiCUC |
||
041 | 0 | _aeng | |
049 | _aDeposite | ||
097 | _aPh.D | ||
099 | _aCai01.20.04.Ph.D.2019.Al.D | ||
100 | 0 | _aAli Eid Ali Zidane Elqutaany | |
245 | 1 | 0 |
_aData integration framework for multi-objective queries / _cAli Eid Ali Zidane Elqutaany ; Supervised Osman Hegazi , Ali H. Elbastawissy |
246 | 1 | 5 | _aإطار تكامل البيانات للإستعلام متعددة الأهداف |
260 |
_aCairo : _bAli Eid Ali Zidane Elqutaany , _c2019 |
||
300 |
_a161 Leaves : _bcharts , photographs ; _c30cm |
||
502 | _aThesis (Ph.D.) - Cairo University - Faculty of Computers and Artificial Intelligence - Department of Information Systems | ||
520 | _aNowadays, organizations cannot satisfy their information needs from one data source. Moreover, multiple data sources across the organization fuels the need for data integration. Data integration system{u2019}s users pose their queries to the integration system in terms of an integrated schema and expect duplicate-free and complete answers. In order to meet users{u2019} expectations; data integration is not limited to getting the answers from the sources, but it is extended to detect and resolve the data quality problems appeared due to the integration. Three processes: data integration, entity matching and entity resolution are mandatory for an integration framework to provide duplicate free and complete answers for user{u2019}s queries. The existing data integration frameworks are performing their processes independently from each other, where the data is integrated from the sources, then the duplicates are detected regardless how data was integrated, and finally the duplicates are resolved regardless how the other two processes were performed. In this thesis, a new data integration framework is introduced to provide complete and duplicate free answers for user{u2019}s queries, as it performs all its processes with complete interfacing and interleaving. The interfacing and interleaving between the processes provide significant enhancements in the effectiveness and completeness of the provided answers. The most crucial component in any data integration framework is the mappings of the data sources to the integrated schema, hence the first contribution in the proposed framework is a new mapping approach which introduced to map not only the elements of the integrated schema as performed by the existing approaches, but also it maps other elements required in detecting and resolving the duplicates. This approach provides means to facilitate future extensibility of the integration system and provides a linkage between the processes of the framework | ||
530 | _aIssued also as CD | ||
653 | 4 | _aData integration | |
653 | 4 | _aEntity matching | |
653 | 4 | _aVirtual data integration | |
700 | 0 |
_aAli H. Elbastawissy , _eSupervisor |
|
700 | 0 |
_aOsman Hegazi , _eSupervisor |
|
856 | _uhttp://172.23.153.220/th.pdf | ||
905 |
_aNazla _eRevisor |
||
905 |
_aSamia _eCataloger |
||
942 |
_2ddc _cTH |
||
999 |
_c75252 _d75252 |