000 03228cam a2200337 a 4500
003 EG-GiCUC
005 20250223032440.0
008 191117s2019 ua do f m 000 0 eng d
040 _aEG-GiCUC
_beng
_cEG-GiCUC
041 0 _aeng
049 _aDeposite
097 _aPh.D
099 _aCai01.20.04.Ph.D.2019.Al.D
100 0 _aAli Eid Ali Zidane Elqutaany
245 1 0 _aData integration framework for multi-objective queries /
_cAli Eid Ali Zidane Elqutaany ; Supervised Osman Hegazi , Ali H. Elbastawissy
246 1 5 _aإطار تكامل البيانات للإستعلام متعددة الأهداف
260 _aCairo :
_bAli Eid Ali Zidane Elqutaany ,
_c2019
300 _a161 Leaves :
_bcharts , photographs ;
_c30cm
502 _aThesis (Ph.D.) - Cairo University - Faculty of Computers and Artificial Intelligence - Department of Information Systems
520 _aNowadays, organizations cannot satisfy their information needs from one data source. Moreover, multiple data sources across the organization fuels the need for data integration. Data integration system{u2019}s users pose their queries to the integration system in terms of an integrated schema and expect duplicate-free and complete answers. In order to meet users{u2019} expectations; data integration is not limited to getting the answers from the sources, but it is extended to detect and resolve the data quality problems appeared due to the integration. Three processes: data integration, entity matching and entity resolution are mandatory for an integration framework to provide duplicate free and complete answers for user{u2019}s queries. The existing data integration frameworks are performing their processes independently from each other, where the data is integrated from the sources, then the duplicates are detected regardless how data was integrated, and finally the duplicates are resolved regardless how the other two processes were performed. In this thesis, a new data integration framework is introduced to provide complete and duplicate free answers for user{u2019}s queries, as it performs all its processes with complete interfacing and interleaving. The interfacing and interleaving between the processes provide significant enhancements in the effectiveness and completeness of the provided answers. The most crucial component in any data integration framework is the mappings of the data sources to the integrated schema, hence the first contribution in the proposed framework is a new mapping approach which introduced to map not only the elements of the integrated schema as performed by the existing approaches, but also it maps other elements required in detecting and resolving the duplicates. This approach provides means to facilitate future extensibility of the integration system and provides a linkage between the processes of the framework
530 _aIssued also as CD
653 4 _aData integration
653 4 _aEntity matching
653 4 _aVirtual data integration
700 0 _aAli H. Elbastawissy ,
_eSupervisor
700 0 _aOsman Hegazi ,
_eSupervisor
856 _uhttp://172.23.153.220/th.pdf
905 _aNazla
_eRevisor
905 _aSamia
_eCataloger
942 _2ddc
_cTH
999 _c75252
_d75252