header
Local cover image
Local cover image
Image from OpenLibrary

Handling mixed missing data / Mai Ahmed Mohsen Moustafa ; Supervised Amany Mousa Mohamed , Yasmin Mohamed Ibrahim

By: Contributor(s): Material type: TextTextLanguage: English Publication details: Cairo : Mai Ahmed Mohsen Moustafa , 2018Description: 154 Leaves : charts , facsimiles ; 30cmOther title:
  • التعامل مع البيانات المفقودة المختلطة [Added title page title]
Subject(s): Online resources: Available additional physical forms:
  • Issued also as CD
Dissertation note: Thesis (M.Sc.) - Cairo University - Institute of Statistical Studies and Research - Department of Statistics and Econometrics Summary: Incomplete data is often an unavoidable problem faced by most applied researchers as survey results often include some non-response. Various techniques have been developed for dealing with missing values in data sets with homogeneous attributes (their independent attributes are all either continuous or discrete). However, these imputation algorithms cannot be directly applied to many real data sets, as survey data sets in general often consist of large numbers of variables which have mixed data types i.e. different measurement scales. Specific methods and modification in existing methods are found for dealing with such kind of data. This thesis reviews some methods for such kind of data and applies six imputation methods out of them. Assessing the performance of the six imputation methods which are MICE, MICE-CART, MICE-RF, MissForest, MissRanger and KNN is performed using 3 real datasets at 5 different missing rates. Complete datasets have been used and variables were artificially made 2missing at random3and results were assessed using different criteria. Across the imputed datasets MissForest and MissRanger tend to have the best results while MICE-RF and KNN tend to have the worst results
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Home library Call number Copy number Status Barcode
Thesis Thesis قاعة الرسائل الجامعية - الدور الاول المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.18.04.M.Sc.2018.Ma.H (Browse shelf(Opens below)) Not for loan 01010110078244000
CD - Rom CD - Rom مخـــزن الرســائل الجـــامعية - البدروم المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.18.04.M.Sc.2018.Ma.H (Browse shelf(Opens below)) 78244.CD Not for loan 01020110078244000

Thesis (M.Sc.) - Cairo University - Institute of Statistical Studies and Research - Department of Statistics and Econometrics

Incomplete data is often an unavoidable problem faced by most applied researchers as survey results often include some non-response. Various techniques have been developed for dealing with missing values in data sets with homogeneous attributes (their independent attributes are all either continuous or discrete). However, these imputation algorithms cannot be directly applied to many real data sets, as survey data sets in general often consist of large numbers of variables which have mixed data types i.e. different measurement scales. Specific methods and modification in existing methods are found for dealing with such kind of data. This thesis reviews some methods for such kind of data and applies six imputation methods out of them. Assessing the performance of the six imputation methods which are MICE, MICE-CART, MICE-RF, MissForest, MissRanger and KNN is performed using 3 real datasets at 5 different missing rates. Complete datasets have been used and variables were artificially made 2missing at random3and results were assessed using different criteria. Across the imputed datasets MissForest and MissRanger tend to have the best results while MICE-RF and KNN tend to have the worst results

Issued also as CD

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer

Local cover image