Intelligent outlier identification and categorization in dynamic big data systems / (Record no. 83388)
[ view plain ]
000 -LEADER | |
---|---|
fixed length control field | 03253cam a2200349 a 4500 |
003 - CONTROL NUMBER IDENTIFIER | |
control field | EG-GiCUC |
005 - DATE AND TIME OF LATEST TRANSACTION | |
control field | 20250223032855.0 |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
fixed length control field | 211204s2021 ua dh f m 000 0 eng d |
040 ## - CATALOGING SOURCE | |
Original cataloging agency | EG-GiCUC |
Language of cataloging | eng |
Transcribing agency | EG-GiCUC |
041 0# - LANGUAGE CODE | |
Language code of text/sound track or separate title | eng |
049 ## - LOCAL HOLDINGS (OCLC) | |
Holding library | Deposite |
097 ## - Thesis Degree | |
Thesis Level | M.Sc |
099 ## - LOCAL FREE-TEXT CALL NUMBER (OCLC) | |
Classification number | Cai01.20.03.M.Sc.2021.Hu.I |
100 0# - MAIN ENTRY--PERSONAL NAME | |
Personal name | Huda Mohammed Touny |
245 10 - TITLE STATEMENT | |
Title | Intelligent outlier identification and categorization in dynamic big data systems / |
Statement of responsibility, etc. | Huda Mohammed Touny ; Supervised Ahmed Ibrahim Farag , Ahmed Shawky Moussa , Ali S. Hadi |
246 15 - VARYING FORM OF TITLE | |
Title proper/short title | آلية ذكية للتعرف وتصنيف القيم المتطرفة فى نظم البيانات الكبيرة الديناميكية |
260 ## - PUBLICATION, DISTRIBUTION, ETC. | |
Place of publication, distribution, etc. | Cairo : |
Name of publisher, distributor, etc. | Huda Mohammed Touny , |
Date of publication, distribution, etc. | 2021 |
300 ## - PHYSICAL DESCRIPTION | |
Extent | 103 Leaves : |
Other physical details | charts , facsimiles ; |
Dimensions | 30cm |
502 ## - DISSERTATION NOTE | |
Dissertation note | Thesis (M.Sc.) - Cairo University - Faculty of Computers and Artificial Intelligence - Department of Computer Science |
520 ## - SUMMARY, ETC. | |
Summary, etc. | Outlier detection has been a critical task of various application domains and has been researched for a while. Outlier detection represents a challenge as it is difficult to accurately define and quantify the notion of outliers. Another challenge lies in the customization of outlier detection to the corresponding domain. Thus, many techniques have been introduced for outlier detection, yet they do suffer drawbacks such as labelling a datum that is close to the separating boundary between normal and outlying behaviour. Hence, depending on a crisp cut-off value to identify outliers is not linguistically meaningful or insightful for reliable decision-making. In this research, five methods of fuzzy treatment for the Blocked Adaptive Computationallyefficient Outlier Nominator (BACON) algorithm are proposed rather than a crisp cutoff threshold. The proposed solutions use Fuzzy Computing to capture the intrinsic uncertainty around the border between the main-stream data and outliers.The experimentations done in this research are mainly divided into two sets.The first set of experiments concerns about fuzzifying the output of the last iteration of BACON. The other set of experiments concerns about the fuzzification of each intermediate iteration of BACON.The aim of conducting the first set of experiments is to analyze the levels of uncertainty of the candidate outliers obtained by BACON and how this may affect the interpretation of outliers. The motive for the other set of experiments is to investigate the possibility of reducing the number of iterations of BACON while still having approximate fuzzy intermediate set of outliers that matches the final set declared by BACON. Four repository datasets have been used in the experimental part of this research. The datasets are different in their characteristics to validate the proposed solutions under various scenarios |
530 ## - ADDITIONAL PHYSICAL FORM AVAILABLE NOTE | |
Additional physical form available note | Issued also as CD |
653 #4 - INDEX TERM--UNCONTROLLED | |
Uncontrolled term | Fuzzy Numbers |
653 #4 - INDEX TERM--UNCONTROLLED | |
Uncontrolled term | Fuzzy Outliers |
653 #4 - INDEX TERM--UNCONTROLLED | |
Uncontrolled term | Multivariate Outliers |
700 0# - ADDED ENTRY--PERSONAL NAME | |
Personal name | Ahmed Shawky Moussa , |
Relator term | |
700 0# - ADDED ENTRY--PERSONAL NAME | |
Personal name | Ali S. Hadi , |
Relator term | |
700 0# - ADDED ENTRY--PERSONAL NAME | |
Personal name | Ibrahim Farag , |
Relator term | |
856 ## - ELECTRONIC LOCATION AND ACCESS | |
Uniform Resource Identifier | <a href="http://172.23.153.220/th.pdf">http://172.23.153.220/th.pdf</a> |
905 ## - LOCAL DATA ELEMENT E, LDE (RLIN) | |
Cataloger | Nazla |
Reviser | Revisor |
905 ## - LOCAL DATA ELEMENT E, LDE (RLIN) | |
Cataloger | Shimaa |
Reviser | Cataloger |
942 ## - ADDED ENTRY ELEMENTS (KOHA) | |
Source of classification or shelving scheme | Dewey Decimal Classification |
Koha item type | Thesis |
Source of classification or shelving scheme | Not for loan | Home library | Current library | Date acquired | Full call number | Barcode | Date last seen | Koha item type | Copy number |
---|---|---|---|---|---|---|---|---|---|
Dewey Decimal Classification | المكتبة المركزبة الجديدة - جامعة القاهرة | قاعة الرسائل الجامعية - الدور الاول | 11.02.2024 | Cai01.20.03.M.Sc.2021.Hu.I | 01010110084891000 | 22.09.2023 | Thesis | ||
Dewey Decimal Classification | المكتبة المركزبة الجديدة - جامعة القاهرة | مخـــزن الرســائل الجـــامعية - البدروم | 11.02.2024 | Cai01.20.03.M.Sc.2021.Hu.I | 01020110084891000 | 22.09.2023 | CD - Rom | 84891.CD |