Mlrmud : A multi linear regression approach for missing values prediction with unknown dependent variable / Ahmed Karama Mahboab Alhebshi ; Supervised Samir I. Shaheen , Amir F. Atiya , Mona F. Ahmed
Material type: TextLanguage: English Publication details: Cairo : Ahmed Karama Mahboab Alhebshi , 2019Description: 75 P. : charts ; 30cmOther title:- طريقة الانحدار الخطى للتنبؤ بالقيم المفقودة مع المتغير المعتمد المجهول [Added title page title]
- Issued also as CD
Item type | Current library | Home library | Call number | Copy number | Status | Date due | Barcode | |
---|---|---|---|---|---|---|---|---|
Thesis | قاعة الرسائل الجامعية - الدور الاول | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.13.06.M.Sc.2019.Ah.M (Browse shelf(Opens below)) | Not for loan | 01010110079751000 | |||
CD - Rom | مخـــزن الرســائل الجـــامعية - البدروم | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.13.06.M.Sc.2019.Ah.M (Browse shelf(Opens below)) | 79751.CD | Not for loan | 01020110079751000 |
Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Computer Engineering
The missing value problem (MV) is the problem of predicting the missing value in the data set while achieving accurate values. An additional attribute has been imposed on the missing value problem which is an unknown dependent variable. In this work, a new approach, MLRMUD, based on multiple linear regression is used to predict missing values for a data set with an Unknown Dependent variable if complete rows are at least 20%. If they are less than that the mean method is used to fill some rows until the complete rows reach 20%, after that MLRMUD can be applied normally. This approach is composed of three algorithms; splitting algorithm, dependent variable selection algorithm and multi linear regression algorithm. MLRMUD is compared to other counterparts in the literature where it was proved that it outperforms them all in the accuracy of missing values computation determined in terms of the root mean square error (RMSE) and mean standard error (MSE). A method to determine the unknown dependent variable from the training set is proposed. The results show that the proposed method can successfully select the dependent variable with an accuracy of 83% overall the data sets examined
Issued also as CD
There are no comments on this title.