An intelligent data clustering model for a real application / Doaa Saleh Ali ; Supervised Mohamed Saleh , Mohamed Rasmy , Ayman Ghoneim

بواسطة:

Doaa Saleh Ali

المساهم:

نوع المادة :

نصاللغة: الإنجليزية تفاصيل النشر: Cairo : Doaa Saleh Ali , 2017الوصف: 195 Leaves : charts ; 30cmعنوان آخر:

نموذج عنقودية البيانات الذكية مع التطبيق الواقعي [عنوان مضاف عنوان الصفحة]

الموضوع:

موارد على الإنترنت:

اضغط هنا للوصول بشكل مباشر

Available additional physical forms:

Issued also as CD

ملاحظة الأطروحة: Thesis (Ph.D.) - Cairo University - Faculty of Computers and Information - Department of Operations Research and Decision Support ملخص: Data Clustering, an important unsupervised technique in data mining, aims to identify interesting distributions and patterns in the underlying data. Cluster validity indices are used to evaluate the performance of clustering models. Some recent research used cluster validity indices as the objective functions in multiobjective framework, in order to improve the clustering performance. Therefore, an interesting research question is how to further improve the clustering performance via cluster validity indices. We address this research question by three main contributions. First, using new combinations of cluster validity indices, we introduce two new multiobjective data clustering models for numerical and categorical data. Based on our literature review, we select a combination of cluster validity indices (i.e. objective functions) for the proposed clustering models. Based on the experimental results, the proposed multiobjective data clustering models prove their efficiency in improving the clustering performance. However, when forming a new combination of the cluster validity indices for any given dataset, there are still open research questions regarding what the best cluster validity indices are to use and what the best size for this combination is. The second contribution of the dissertation addresses these questions by proposing a hybrid meta-heuristic clustering (HMHC) methodology for computing the best combination of the cluster validity indices for any used dataset. The HMHC methodology illustrates its ability to compute a different and better-performing combination of indices for each benchmark dataset. Also, for reducing the complexity of the HMHC methodology, we introduce a way to filter the indices in the pool based on the data features of the dataset under consideration. Finally, we also introduce some recommendations for the practitioners in a data clustering field, by doing some additional analyses on the experimental results by using the concepts of Shapely value and mutual information

وسوم من هذه المكتبة: لا توجد وسوم لهذا العنوان في هذه المكتبة. قم بتسجيل الدخول لإضافة الوسوم.

المقتنيات
نوع المادة	المكتبة الحالية	المكتبة الرئيسية	رقم الاستدعاء	رقم النسخة	حالة	الباركود
Thesis	قاعة الرسائل الجامعية - الدور الاول	المكتبة المركزبة الجديدة - جامعة القاهرة	Cai01.20.02.Ph.D.2017.Do.I (استعراض الرف(يفتح أدناه))		لا تعار	01010110074659000
CD - Rom	مخـــزن الرســائل الجـــامعية - البدروم	المكتبة المركزبة الجديدة - جامعة القاهرة	Cai01.20.02.Ph.D.2017.Do.I (استعراض الرف(يفتح أدناه))	74659.CD	لا تعار	01020110074659000

Thesis (Ph.D.) - Cairo University - Faculty of Computers and Information - Department of Operations Research and Decision Support

Data Clustering, an important unsupervised technique in data mining, aims to identify interesting distributions and patterns in the underlying data. Cluster validity indices are used to evaluate the performance of clustering models. Some recent research used cluster validity indices as the objective functions in multiobjective framework, in order to improve the clustering performance. Therefore, an interesting research question is how to further improve the clustering performance via cluster validity indices. We address this research question by three main contributions. First, using new combinations of cluster validity indices, we introduce two new multiobjective data clustering models for numerical and categorical data. Based on our literature review, we select a combination of cluster validity indices (i.e. objective functions) for the proposed clustering models. Based on the experimental results, the proposed multiobjective data clustering models prove their efficiency in improving the clustering performance. However, when forming a new combination of the cluster validity indices for any given dataset, there are still open research questions regarding what the best cluster validity indices are to use and what the best size for this combination is. The second contribution of the dissertation addresses these questions by proposing a hybrid meta-heuristic clustering (HMHC) methodology for computing the best combination of the cluster validity indices for any used dataset. The HMHC methodology illustrates its ability to compute a different and better-performing combination of indices for each benchmark dataset. Also, for reducing the complexity of the HMHC methodology, we introduce a way to filter the indices in the pool based on the data features of the dataset under consideration. Finally, we also introduce some recommendations for the practitioners in a data clustering field, by doing some additional analyses on the experimental results by using the concepts of Shapely value and mutual information

Issued also as CD

لا توجد تعليقات على هذا العنوان.

لنشر تعليق.

اضغط على الصورة لمشاهدتها في عارض الصور

جامعة القاهرة

المكتبة المركزية الجديدة

مكتبة جامعة القاهرة الأهلية

An intelligent data clustering model for a real application / Doaa Saleh Ali ; Supervised Mohamed Saleh , Mohamed Rasmy , Ayman Ghoneim