Initial data reorderering in mapreduce technique for specific data categories / (Record no. 72176)
[ view plain ]
| 000 -LEADER | |
|---|---|
| fixed length control field | 02575cam a2200337 a 4500 |
| 003 - CONTROL NUMBER IDENTIFIER | |
| control field | EG-GiCUC |
| 005 - DATE AND TIME OF LATEST TRANSACTION | |
| control field | 20250223032307.0 |
| 008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
| fixed length control field | 190526s2018 ua dh f m 000 0 eng d |
| 040 ## - CATALOGING SOURCE | |
| Original cataloging agency | EG-GiCUC |
| Language of cataloging | eng |
| Transcribing agency | EG-GiCUC |
| 041 0# - LANGUAGE CODE | |
| Language code of text/sound track or separate title | eng |
| 049 ## - LOCAL HOLDINGS (OCLC) | |
| Holding library | Deposite |
| 097 ## - Thesis Degree | |
| Thesis Level | M.Sc |
| 099 ## - LOCAL FREE-TEXT CALL NUMBER (OCLC) | |
| Classification number | Cai01.20.04.M.Sc.2018.Ah.I |
| 100 0# - MAIN ENTRY--PERSONAL NAME | |
| Personal name | Ahmed Abdelrahim Ali Eldouh |
| 245 10 - TITLE STATEMENT | |
| Title | Initial data reorderering in mapreduce technique for specific data categories / |
| Statement of responsibility, etc. | Ahmed Abdelrahim Ali Eldouh ; Supervised Hatem Elkadi , Mohamed Helmy Khafagy |
| 246 15 - VARYING FORM OF TITLE | |
| Title proper/short title | إعادة ترتيب البيانات الاولية فى تقنية تصغير الخريطة لفئات بيانات محددة |
| 260 ## - PUBLICATION, DISTRIBUTION, ETC. | |
| Place of publication, distribution, etc. | Cairo : |
| Name of publisher, distributor, etc. | Ahmed Abdelrahim Ali Eldouh , |
| Date of publication, distribution, etc. | 2018 |
| 300 ## - PHYSICAL DESCRIPTION | |
| Extent | 87 Leaves : |
| Other physical details | charts , facsimiles ; |
| Dimensions | 30cm |
| 502 ## - DISSERTATION NOTE | |
| Dissertation note | Thesis (M.Sc.) - Cairo University - Faculty of Computers and Information - Department of Information System |
| 520 ## - SUMMARY, ETC. | |
| Summary, etc. | The rapid increase in big data sets presents an urgent need for handling the difficulty in storing and processing of these datasets. MapReduce is a recent programming model which was initiated by Google{u2019}s Team to handle big data sets and storing. Hadoop is an open source software with an implementation of MapReduce presented by Apache. MapReduce requires a shuffling phase to exchange global the intermediate data generated by the mapping phase, but the shuffling phase in MapReduce increases the overhead on performance. In this thesis, we explore the literature on the shuffling subject and discuss previous techniques adopted to enhance the performance of MapReduce. In addition to our focus on an approach to improve the performance of MapReduce through reducing the overhead caused by shuffling phase. Improving the locality of data will lead to eliminating the network overhead in the shuffling phase for the MapReduce. We achieve this by pre-partitioning data based on query-based similarity through the TF {u2013} IDF and Cosine similarity algorithms and grouping the related queries with each other using K-means clustering algorithm. In this regard, we support HDFS with the related data and control where data are stored to collocate the related data files in the same nodes |
| 530 ## - ADDITIONAL PHYSICAL FORM AVAILABLE NOTE | |
| Additional physical form available note | Issued also as CD |
| 653 #4 - INDEX TERM--UNCONTROLLED | |
| Uncontrolled term | Hadoop |
| 653 #4 - INDEX TERM--UNCONTROLLED | |
| Uncontrolled term | Mapreduce |
| 653 #4 - INDEX TERM--UNCONTROLLED | |
| Uncontrolled term | Shuffling |
| 700 0# - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Hatem Elkadi , |
| Relator term | |
| 700 0# - ADDED ENTRY--PERSONAL NAME | |
| Personal name | Mohamed Helmy Khafagy , |
| Relator term | |
| 856 ## - ELECTRONIC LOCATION AND ACCESS | |
| Uniform Resource Identifier | <a href="http://172.23.153.220/th.pdf">http://172.23.153.220/th.pdf</a> |
| 905 ## - LOCAL DATA ELEMENT E, LDE (RLIN) | |
| Cataloger | Asmaa |
| Reviser | Cataloger |
| 905 ## - LOCAL DATA ELEMENT E, LDE (RLIN) | |
| Cataloger | Nazla |
| Reviser | Revisor |
| 942 ## - ADDED ENTRY ELEMENTS (KOHA) | |
| Source of classification or shelving scheme | Dewey Decimal Classification |
| Koha item type | Thesis |
| Source of classification or shelving scheme | Not for loan | Home library | Current library | Date acquired | Full call number | Barcode | Date last seen | Koha item type | Copy number |
|---|---|---|---|---|---|---|---|---|---|
| Dewey Decimal Classification | المكتبة المركزبة الجديدة - جامعة القاهرة | قاعة الرسائل الجامعية - الدور الاول | 11.02.2024 | Cai01.20.04.M.Sc.2018.Ah.I | 01010110078233000 | 22.09.2023 | Thesis | ||
| Dewey Decimal Classification | المكتبة المركزبة الجديدة - جامعة القاهرة | مخـــزن الرســائل الجـــامعية - البدروم | 11.02.2024 | Cai01.20.04.M.Sc.2018.Ah.I | 01020110078233000 | 22.09.2023 | CD - Rom | 78233.CD |