Automatic mapping of documents to multiple domains using ontology and fuzzy sets / (Record no. 74815)
[ view plain ]
000 -LEADER | |
---|---|
fixed length control field | 02770cam a2200337 a 4500 |
003 - CONTROL NUMBER IDENTIFIER | |
control field | EG-GiCUC |
005 - DATE AND TIME OF LATEST TRANSACTION | |
control field | 20250223032426.0 |
008 - FIXED-LENGTH DATA ELEMENTS--GENERAL INFORMATION | |
fixed length control field | 191028s2018 ua d f m 000 0 eng d |
040 ## - CATALOGING SOURCE | |
Original cataloging agency | EG-GiCUC |
Language of cataloging | eng |
Transcribing agency | EG-GiCUC |
041 0# - LANGUAGE CODE | |
Language code of text/sound track or separate title | eng |
049 ## - LOCAL HOLDINGS (OCLC) | |
Holding library | Deposite |
097 ## - Thesis Degree | |
Thesis Level | M.Sc |
099 ## - LOCAL FREE-TEXT CALL NUMBER (OCLC) | |
Classification number | Cai01.18.02.M.Sc.2018.Ab.A |
100 0# - MAIN ENTRY--PERSONAL NAME | |
Personal name | Abdelrahman Mostafa Arab |
245 10 - TITLE STATEMENT | |
Title | Automatic mapping of documents to multiple domains using ontology and fuzzy sets / |
Statement of responsibility, etc. | Abdelrahman Mostafa Arab ; Supervised Ahmad Gadallah , Akram Salah |
246 15 - VARYING FORM OF TITLE | |
Title proper/short title | المطابقة الآلية للوثائق فى مجالات متعددة بإستخدام الأونطولوجى و الفئات الفازية |
260 ## - PUBLICATION, DISTRIBUTION, ETC. | |
Place of publication, distribution, etc. | Cairo : |
Name of publisher, distributor, etc. | Abdelrahman Mostafa Arab , |
Date of publication, distribution, etc. | 2018 |
300 ## - PHYSICAL DESCRIPTION | |
Extent | 226 Leaves : |
Other physical details | charts ; |
Dimensions | 30cm |
502 ## - DISSERTATION NOTE | |
Dissertation note | Thesis (M.Sc.) - Cairo University - Faculty of Graduate Studies for Statistical Research - Department of Computer and Information Science |
520 ## - SUMMARY, ETC. | |
Summary, etc. | Classification is an important technique used in information retrieval. Supervised classification suffers from certain limitations concerning the collection and the labeling of the training dataset. The problem gets more complicated when facing Multi-domain classification where multiple training datasets and classifiers are needed which is typically difficult. This thesis proposes a training-less multi-domain classification approach where each domain is represented by an ontology. A document is mapped on each ontology based on the weights of the mutual tokens between them. A mapping degree for the document with each domain is then determined with the help of fuzzy sets. A Multi-Domain Document Classification information retrieval system (MDDC) is built as an implementation of the proposed approach. A fuzzy matching approach, based on fuzzy triangular numbers, has also been used as another way in determining the mapping degree. The system was tested on a dataset of 180 journal articles of different domains where it succeeded in classifying them with an accuracy of 92.22%. The fuzzy triangular numbers approach succeeded in obtaining comparable results with the original approach. A number of evaluations have also been performed including comparing the system{u2019}s results with those of other algorithms using WEKA and RapidMiner as two of the top machine learning tools nowadays. The evaluation results were highly comparable and promising |
530 ## - ADDITIONAL PHYSICAL FORM AVAILABLE NOTE | |
Additional physical form available note | Issued also as CD |
653 #4 - INDEX TERM--UNCONTROLLED | |
Uncontrolled term | Information retrieval |
653 #4 - INDEX TERM--UNCONTROLLED | |
Uncontrolled term | Machine learning |
653 #4 - INDEX TERM--UNCONTROLLED | |
Uncontrolled term | Ontology |
700 0# - ADDED ENTRY--PERSONAL NAME | |
Personal name | Ahmad Gadallah , |
Relator term | |
700 0# - ADDED ENTRY--PERSONAL NAME | |
Personal name | Akram Salah , |
Relator term | |
856 ## - ELECTRONIC LOCATION AND ACCESS | |
Uniform Resource Identifier | <a href="http://172.23.153.220/th.pdf">http://172.23.153.220/th.pdf</a> |
905 ## - LOCAL DATA ELEMENT E, LDE (RLIN) | |
Cataloger | Nazla |
Reviser | Revisor |
905 ## - LOCAL DATA ELEMENT E, LDE (RLIN) | |
Cataloger | Samia |
Reviser | Cataloger |
942 ## - ADDED ENTRY ELEMENTS (KOHA) | |
Source of classification or shelving scheme | Dewey Decimal Classification |
Koha item type | Thesis |
Source of classification or shelving scheme | Not for loan | Home library | Current library | Date acquired | Full call number | Barcode | Date last seen | Koha item type | Copy number |
---|---|---|---|---|---|---|---|---|---|
Dewey Decimal Classification | المكتبة المركزبة الجديدة - جامعة القاهرة | قاعة الرسائل الجامعية - الدور الاول | 11.02.2024 | Cai01.18.02.M.Sc.2018.Ab.A | 01010110079690000 | 22.09.2023 | Thesis | ||
Dewey Decimal Classification | المكتبة المركزبة الجديدة - جامعة القاهرة | مخـــزن الرســائل الجـــامعية - البدروم | 11.02.2024 | Cai01.18.02.M.Sc.2018.Ab.A | 01020110079690000 | 22.09.2023 | CD - Rom | 79690.CD |