header
Local cover image
Local cover image
Image from OpenLibrary

Automatic mapping of documents to multiple domains using ontology and fuzzy sets / Abdelrahman Mostafa Arab ; Supervised Ahmad Gadallah , Akram Salah

By: Contributor(s): Material type: TextTextLanguage: English Publication details: Cairo : Abdelrahman Mostafa Arab , 2018Description: 226 Leaves : charts ; 30cmOther title:
  • المطابقة الآلية للوثائق فى مجالات متعددة بإستخدام الأونطولوجى و الفئات الفازية [Added title page title]
Subject(s): Online resources: Available additional physical forms:
  • Issued also as CD
Dissertation note: Thesis (M.Sc.) - Cairo University - Faculty of Graduate Studies for Statistical Research - Department of Computer and Information Science Summary: Classification is an important technique used in information retrieval. Supervised classification suffers from certain limitations concerning the collection and the labeling of the training dataset. The problem gets more complicated when facing Multi-domain classification where multiple training datasets and classifiers are needed which is typically difficult. This thesis proposes a training-less multi-domain classification approach where each domain is represented by an ontology. A document is mapped on each ontology based on the weights of the mutual tokens between them. A mapping degree for the document with each domain is then determined with the help of fuzzy sets. A Multi-Domain Document Classification information retrieval system (MDDC) is built as an implementation of the proposed approach. A fuzzy matching approach, based on fuzzy triangular numbers, has also been used as another way in determining the mapping degree. The system was tested on a dataset of 180 journal articles of different domains where it succeeded in classifying them with an accuracy of 92.22%. The fuzzy triangular numbers approach succeeded in obtaining comparable results with the original approach. A number of evaluations have also been performed including comparing the system{u2019}s results with those of other algorithms using WEKA and RapidMiner as two of the top machine learning tools nowadays. The evaluation results were highly comparable and promising
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Home library Call number Copy number Status Barcode
Thesis Thesis قاعة الرسائل الجامعية - الدور الاول المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.18.02.M.Sc.2018.Ab.A (Browse shelf(Opens below)) Not for loan 01010110079690000
CD - Rom CD - Rom مخـــزن الرســائل الجـــامعية - البدروم المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.18.02.M.Sc.2018.Ab.A (Browse shelf(Opens below)) 79690.CD Not for loan 01020110079690000

Thesis (M.Sc.) - Cairo University - Faculty of Graduate Studies for Statistical Research - Department of Computer and Information Science

Classification is an important technique used in information retrieval. Supervised classification suffers from certain limitations concerning the collection and the labeling of the training dataset. The problem gets more complicated when facing Multi-domain classification where multiple training datasets and classifiers are needed which is typically difficult. This thesis proposes a training-less multi-domain classification approach where each domain is represented by an ontology. A document is mapped on each ontology based on the weights of the mutual tokens between them. A mapping degree for the document with each domain is then determined with the help of fuzzy sets. A Multi-Domain Document Classification information retrieval system (MDDC) is built as an implementation of the proposed approach. A fuzzy matching approach, based on fuzzy triangular numbers, has also been used as another way in determining the mapping degree. The system was tested on a dataset of 180 journal articles of different domains where it succeeded in classifying them with an accuracy of 92.22%. The fuzzy triangular numbers approach succeeded in obtaining comparable results with the original approach. A number of evaluations have also been performed including comparing the system{u2019}s results with those of other algorithms using WEKA and RapidMiner as two of the top machine learning tools nowadays. The evaluation results were highly comparable and promising

Issued also as CD

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer

Local cover image
Share
Under the supervision of New Central Library Manager

Implemented and Customized by: Eng.M.Mohamady
Contact:   info@cl.cu.edu.eg

© All rights reserved  New Central Library