Software defect prediction using data categorization and machine learning techniques / Moheb Mofied Ragheb Henein ; Supervised Salwa K. Abdelhafiz , Doaa M. Shawky
Material type:
- التنبؤ بعيوب البرامج عن طريق استخدام طرق تصنيف البيانات وتقنيات التعلم الآلى [Added title page title]
- Issued also as CD
Item type | Current library | Home library | Call number | Copy number | Status | Barcode | |
---|---|---|---|---|---|---|---|
![]() |
قاعة الرسائل الجامعية - الدور الاول | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.13.10.M.Sc.2020.Mo.S (Browse shelf(Opens below)) | Not for loan | 01010110082711000 | ||
![]() |
مخـــزن الرســائل الجـــامعية - البدروم | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.13.10.M.Sc.2020.Mo.S (Browse shelf(Opens below)) | 82711.CD | Not for loan | 01020110082711000 |
Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Mathematics and Physics
In this thesis, two approaches are proposed to overcome two main challenges in SDP; namely the class imbalance and overlap. The first approach is Clustering-based Undersampling Artificial Neural Network (CU-ANN) that tackles the imbalance problem. The second approach is Hybrid sampling Cost- Sensitive Support Vector Machine (HCSVM), which balances the data set by undersampling the majority class instances and oversampling the minority class ones. Moreover, minority samples are categorized based on their severity, where the degree of severity is directly proportional to the number of neighbors belonging to the majority class. Taking into consideration the severity of minority samples in the learning phase alleviates the impact of class overlap. A cost-sensitive approach that assigns high misclassification costs to di cult minority samples considers these samples rather than treating them as outliers. Experiments are conducted on benchmark data sets, NASA MDP, which are the most used datasets in SDP performance evaluation
Issued also as CD
There are no comments on this title.