header
Image from OpenLibrary

Modeling detection of genetic mutation based on machine learning techniques / Emad Mohamed Fawzy Mashhour ; Supervised Akram Ibrahim Salah , Enas M. Fahmy Elhouby , Khaled Tawfik Wassif

By: Contributor(s): Material type: TextTextLanguage: English Publication details: Cairo : Emad Mohamed Fawzy Mashhour , 2018Description: 123 Leaves : charts ; 30cmOther title:
  • نمذجة الكشف عن التغيرات الوراثيه باستخدام تقنيات تعلم الاله [Added title page title]
Subject(s): Available additional physical forms:
  • Issued also as CD
Dissertation note: Thesis (Ph.D.) - Cairo University - Faculty of Computers and Information - Department of Computer Science Summary: Huge dimensionality in real world datasets is considered an obstacle for data analysis. Unbalanced datasets contain a huge number of features against a few number of samples. An issue may appear when analyzing large datasets, such as noise, irrelevant, and redundant data. This kind of datasets may affect negatively any future decision, and it may lead the classification process to poor performance. Medical datasets are a clear example of huge datasets that need simplification. Microarrays are used to represent samples of genes used to diagnose cancer cases. It works by analyzing a huge number of genes, investigating which genes are activated and responsible for cancer.Gene expression in microarray is the main key for evaluating how much this gene is involved in causing the disease.Feature selection process is considered a solution to overcome the problem of huge dimensionality. It helps in clarifying any kind of dataset. In feature selection, datasets are refined and reduced to a small subset containing the most informative features\genes in the original dataset. Selecting the optimal features\genes can help in improving the performance of the classification process through reducing time and memory storage. Evaluating feature selection process appears through applying a classification process using the produced small subset of data
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Home library Call number Copy number Status Date due Barcode
Thesis Thesis قاعة الرسائل الجامعية - الدور الاول المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.20.03.Ph.D.2018.Em.M (Browse shelf(Opens below)) Not for loan 01010110078749000
CD - Rom CD - Rom مخـــزن الرســائل الجـــامعية - البدروم المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.20.03.Ph.D.2018.Em.M (Browse shelf(Opens below)) 78749.CD Not for loan 01020110078749000

Thesis (Ph.D.) - Cairo University - Faculty of Computers and Information - Department of Computer Science

Huge dimensionality in real world datasets is considered an obstacle for data analysis. Unbalanced datasets contain a huge number of features against a few number of samples. An issue may appear when analyzing large datasets, such as noise, irrelevant, and redundant data. This kind of datasets may affect negatively any future decision, and it may lead the classification process to poor performance. Medical datasets are a clear example of huge datasets that need simplification. Microarrays are used to represent samples of genes used to diagnose cancer cases. It works by analyzing a huge number of genes, investigating which genes are activated and responsible for cancer.Gene expression in microarray is the main key for evaluating how much this gene is involved in causing the disease.Feature selection process is considered a solution to overcome the problem of huge dimensionality. It helps in clarifying any kind of dataset. In feature selection, datasets are refined and reduced to a small subset containing the most informative features\genes in the original dataset. Selecting the optimal features\genes can help in improving the performance of the classification process through reducing time and memory storage. Evaluating feature selection process appears through applying a classification process using the produced small subset of data

Issued also as CD

There are no comments on this title.

to post a comment.