header
Local cover image
Local cover image
Image from OpenLibrary

Intelligent outlier identification and categorization in dynamic big data systems / Huda Mohammed Touny ; Supervised Ahmed Ibrahim Farag , Ahmed Shawky Moussa , Ali S. Hadi

By: Contributor(s): Material type: TextTextLanguage: English Publication details: Cairo : Huda Mohammed Touny , 2021Description: 103 Leaves : charts , facsimiles ; 30cmOther title:
  • آلية ذكية للتعرف وتصنيف القيم المتطرفة فى نظم البيانات الكبيرة الديناميكية [Added title page title]
Subject(s): Online resources: Available additional physical forms:
  • Issued also as CD
Dissertation note: Thesis (M.Sc.) - Cairo University - Faculty of Computers and Artificial Intelligence - Department of Computer Science Summary: Outlier detection has been a critical task of various application domains and has been researched for a while. Outlier detection represents a challenge as it is difficult to accurately define and quantify the notion of outliers. Another challenge lies in the customization of outlier detection to the corresponding domain. Thus, many techniques have been introduced for outlier detection, yet they do suffer drawbacks such as labelling a datum that is close to the separating boundary between normal and outlying behaviour. Hence, depending on a crisp cut-off value to identify outliers is not linguistically meaningful or insightful for reliable decision-making. In this research, five methods of fuzzy treatment for the Blocked Adaptive Computationallyefficient Outlier Nominator (BACON) algorithm are proposed rather than a crisp cutoff threshold. The proposed solutions use Fuzzy Computing to capture the intrinsic uncertainty around the border between the main-stream data and outliers.The experimentations done in this research are mainly divided into two sets.The first set of experiments concerns about fuzzifying the output of the last iteration of BACON. The other set of experiments concerns about the fuzzification of each intermediate iteration of BACON.The aim of conducting the first set of experiments is to analyze the levels of uncertainty of the candidate outliers obtained by BACON and how this may affect the interpretation of outliers. The motive for the other set of experiments is to investigate the possibility of reducing the number of iterations of BACON while still having approximate fuzzy intermediate set of outliers that matches the final set declared by BACON. Four repository datasets have been used in the experimental part of this research. The datasets are different in their characteristics to validate the proposed solutions under various scenarios
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Home library Call number Copy number Status Barcode
Thesis Thesis قاعة الرسائل الجامعية - الدور الاول المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.20.03.M.Sc.2021.Hu.I (Browse shelf(Opens below)) Not for loan 01010110084891000
CD - Rom CD - Rom مخـــزن الرســائل الجـــامعية - البدروم المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.20.03.M.Sc.2021.Hu.I (Browse shelf(Opens below)) 84891.CD Not for loan 01020110084891000

Thesis (M.Sc.) - Cairo University - Faculty of Computers and Artificial Intelligence - Department of Computer Science

Outlier detection has been a critical task of various application domains and has been researched for a while. Outlier detection represents a challenge as it is difficult to accurately define and quantify the notion of outliers. Another challenge lies in the customization of outlier detection to the corresponding domain. Thus, many techniques have been introduced for outlier detection, yet they do suffer drawbacks such as labelling a datum that is close to the separating boundary between normal and outlying behaviour. Hence, depending on a crisp cut-off value to identify outliers is not linguistically meaningful or insightful for reliable decision-making. In this research, five methods of fuzzy treatment for the Blocked Adaptive Computationallyefficient Outlier Nominator (BACON) algorithm are proposed rather than a crisp cutoff threshold. The proposed solutions use Fuzzy Computing to capture the intrinsic uncertainty around the border between the main-stream data and outliers.The experimentations done in this research are mainly divided into two sets.The first set of experiments concerns about fuzzifying the output of the last iteration of BACON. The other set of experiments concerns about the fuzzification of each intermediate iteration of BACON.The aim of conducting the first set of experiments is to analyze the levels of uncertainty of the candidate outliers obtained by BACON and how this may affect the interpretation of outliers. The motive for the other set of experiments is to investigate the possibility of reducing the number of iterations of BACON while still having approximate fuzzy intermediate set of outliers that matches the final set declared by BACON. Four repository datasets have been used in the experimental part of this research. The datasets are different in their characteristics to validate the proposed solutions under various scenarios

Issued also as CD

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer

Local cover image
Share
Under the supervision of New Central Library Manager

Implemented and Customized by: Eng.M.Mohamady
Contact:   info@cl.cu.edu.eg

© All rights reserved  New Central Library