000 03253cam a2200349 a 4500
003 EG-GiCUC
005 20250223032855.0
008 211204s2021 ua dh f m 000 0 eng d
040 _aEG-GiCUC
_beng
_cEG-GiCUC
041 0 _aeng
049 _aDeposite
097 _aM.Sc
099 _aCai01.20.03.M.Sc.2021.Hu.I
100 0 _aHuda Mohammed Touny
245 1 0 _aIntelligent outlier identification and categorization in dynamic big data systems /
_cHuda Mohammed Touny ; Supervised Ahmed Ibrahim Farag , Ahmed Shawky Moussa , Ali S. Hadi
246 1 5 _aآلية ذكية للتعرف وتصنيف القيم المتطرفة فى نظم البيانات الكبيرة الديناميكية
260 _aCairo :
_bHuda Mohammed Touny ,
_c2021
300 _a103 Leaves :
_bcharts , facsimiles ;
_c30cm
502 _aThesis (M.Sc.) - Cairo University - Faculty of Computers and Artificial Intelligence - Department of Computer Science
520 _aOutlier detection has been a critical task of various application domains and has been researched for a while. Outlier detection represents a challenge as it is difficult to accurately define and quantify the notion of outliers. Another challenge lies in the customization of outlier detection to the corresponding domain. Thus, many techniques have been introduced for outlier detection, yet they do suffer drawbacks such as labelling a datum that is close to the separating boundary between normal and outlying behaviour. Hence, depending on a crisp cut-off value to identify outliers is not linguistically meaningful or insightful for reliable decision-making. In this research, five methods of fuzzy treatment for the Blocked Adaptive Computationallyefficient Outlier Nominator (BACON) algorithm are proposed rather than a crisp cutoff threshold. The proposed solutions use Fuzzy Computing to capture the intrinsic uncertainty around the border between the main-stream data and outliers.The experimentations done in this research are mainly divided into two sets.The first set of experiments concerns about fuzzifying the output of the last iteration of BACON. The other set of experiments concerns about the fuzzification of each intermediate iteration of BACON.The aim of conducting the first set of experiments is to analyze the levels of uncertainty of the candidate outliers obtained by BACON and how this may affect the interpretation of outliers. The motive for the other set of experiments is to investigate the possibility of reducing the number of iterations of BACON while still having approximate fuzzy intermediate set of outliers that matches the final set declared by BACON. Four repository datasets have been used in the experimental part of this research. The datasets are different in their characteristics to validate the proposed solutions under various scenarios
530 _aIssued also as CD
653 4 _aFuzzy Numbers
653 4 _aFuzzy Outliers
653 4 _aMultivariate Outliers
700 0 _aAhmed Shawky Moussa ,
_eSupervisor
700 0 _aAli S. Hadi ,
_eSupervisor
700 0 _aIbrahim Farag ,
_eSupervisor
856 _uhttp://172.23.153.220/th.pdf
905 _aNazla
_eRevisor
905 _aShimaa
_eCataloger
942 _2ddc
_cTH
999 _c83388
_d83388