header
Local cover image
Local cover image
Image from OpenLibrary

Software defect prediction using data categorization and machine learning techniques / Moheb Mofied Ragheb Henein ; Supervised Salwa K. Abdelhafiz , Doaa M. Shawky

By: Contributor(s): Material type: TextTextLanguage: English Publication details: Cairo : Moheb Mofied Ragheb Henein , 2020Description: 81 P . : charts ; 30cmOther title:
  • التنبؤ بعيوب البرامج عن طريق استخدام طرق تصنيف البيانات وتقنيات التعلم الآلى [Added title page title]
Subject(s): Online resources: Available additional physical forms:
  • Issued also as CD
Dissertation note: Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Mathematics and Physics Summary: In this thesis, two approaches are proposed to overcome two main challenges in SDP; namely the class imbalance and overlap. The first approach is Clustering-based Undersampling Artificial Neural Network (CU-ANN) that tackles the imbalance problem. The second approach is Hybrid sampling Cost- Sensitive Support Vector Machine (HCSVM), which balances the data set by undersampling the majority class instances and oversampling the minority class ones. Moreover, minority samples are categorized based on their severity, where the degree of severity is directly proportional to the number of neighbors belonging to the majority class. Taking into consideration the severity of minority samples in the learning phase alleviates the impact of class overlap. A cost-sensitive approach that assigns high misclassification costs to di cult minority samples considers these samples rather than treating them as outliers. Experiments are conducted on benchmark data sets, NASA MDP, which are the most used datasets in SDP performance evaluation
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Home library Call number Copy number Status Barcode
Thesis Thesis قاعة الرسائل الجامعية - الدور الاول المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.13.10.M.Sc.2020.Mo.S (Browse shelf(Opens below)) Not for loan 01010110082711000
CD - Rom CD - Rom مخـــزن الرســائل الجـــامعية - البدروم المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.13.10.M.Sc.2020.Mo.S (Browse shelf(Opens below)) 82711.CD Not for loan 01020110082711000

Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Mathematics and Physics

In this thesis, two approaches are proposed to overcome two main challenges in SDP; namely the class imbalance and overlap. The first approach is Clustering-based Undersampling Artificial Neural Network (CU-ANN) that tackles the imbalance problem. The second approach is Hybrid sampling Cost- Sensitive Support Vector Machine (HCSVM), which balances the data set by undersampling the majority class instances and oversampling the minority class ones. Moreover, minority samples are categorized based on their severity, where the degree of severity is directly proportional to the number of neighbors belonging to the majority class. Taking into consideration the severity of minority samples in the learning phase alleviates the impact of class overlap. A cost-sensitive approach that assigns high misclassification costs to di cult minority samples considers these samples rather than treating them as outliers. Experiments are conducted on benchmark data sets, NASA MDP, which are the most used datasets in SDP performance evaluation

Issued also as CD

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer

Local cover image