TY  - BOOK
AU  - Ghada Mohammed Mansour Eissa, 
AU  - Shahira Shaaban Azab
AU  - Hesham Ahmed  Hefny
TI  - Breast cancer prediction using machine learning
U1  - 006.31 
PY  - 2025///
KW  - Machine Learning
KW  - ØªØ¹ÙÙ Ø§ÙØ¢ÙØ©
KW  - EMGD dataset
KW  - Breast Cancer Prediction
KW  - Deep Learning Models
KW  - Voting Ensemble
KW  - Ø§ÙØªÙØ¨Ø¤ Ø¨Ø³Ø±Ø·Ø§Ù Ø§ÙØ«Ø¯Ù
KW  - EMGD ÙØ¬ÙÙØ¹Ø© Ø¨ÙØ§ÙØ§Øª
N1  - Thesis (M.Sc)-Cairo University, 2025; Bibliography: pages 89-99; Issues also as CD
N2  - In the past few decades, breast cancer has become a critical global health concern as one 
of the leading causes of high mortality among women worldwide, linked to the 
development of modern human civilization. Early detection and prognosis of breast cancer 
have led to higher cure and survival rates, providing an ideal opportunity for effective 
treatment. Convolutional neural network (CNN) models have proven effective in 
classifying and predicting breast cancer tumors using mammography images. Multiclass 
classifications are used to classify tumors according to the BI-RADS classification system. 
This research aims to conduct a comparative analysis to explore the results of breast cancer 
prediction using DL models and voting ensemble techniques to improve accuracy. This 
study is based on a dataset from the Nasser Institute for Research and Treatment in Egypt. 
For the first time, DL models are used on an Egyptian breast cancer x-ray dataset of 1,000 
female cases, based on various genetic, geographic, and environmental factors, to predict 
breast cancer. The investigating such an Egyptian dataset can give a deep understanding of 
the impact of genetic, geographical, and environmental factors on breast cancer 
development in Egypt. The dataset was collected from the Mammography Unit at the 
Oncology Center of the Nasser Institute for Research and Treatment in Egypt, using the 
HOLOGIC Selenia Dimensions Breast Imaging System. All mammograms were classified 
according to the BI-RADS classification, which includes four mammogramsâ views (RCC, 
RMLO, LCC, LMLO). A comparative experimental study was conducted on AlexNet, 
GoogleNet-V3, VGG16, ResNet50, and MobileNet models. The results indicate that 
ResNet50 and VGG16 achieved higher accuracies of 80% and 79.58%, respectively, while 
AlexNet, GoogleNet-V3, and MobileNet achieved accuracies of 73%, 78%, and 79.02%, 
respectively. The voting scores for all models on the same test dataset were 91.6%.; ÙÙ Ø§ÙØ¹ÙÙØ¯ Ø§ÙÙÙÙÙØ© Ø§ÙÙØ§Ø¶ÙØ©Ø Ø£ØµØ¨Ø­ Ø³Ø±Ø·Ø§Ù Ø§ÙØ«Ø¯Ù ÙØµØ¯Ø± ÙÙÙ ØµØ­Ù Ø¹Ø§ÙÙÙ Ø­Ø§Ø³Ù ÙØ£Ø­Ø¯ Ø§ÙØ£Ø³Ø¨Ø§Ø¨ Ø§ÙØ±Ø¦ÙØ³ÙØ© ÙØ§Ø±ØªÙØ§Ø¹ ÙØ¹Ø¯Ù Ø§ÙÙÙÙØ§Øª Ø¨ÙÙ Ø§ÙÙØ³Ø§Ø¡ ÙÙ Ø¬ÙÙØ¹ Ø£ÙØ­Ø§Ø¡ Ø§ÙØ¹Ø§ÙÙØ ÙÙÙ ÙØ§ Ø§Ø±ØªØ¨Ø· Ø¨ØªØ·ÙØ± Ø§ÙØ­Ø¶Ø§Ø±Ø© Ø§ÙØ¥ÙØ³Ø§ÙÙØ© Ø§ÙØ­Ø¯ÙØ«Ø©. Ø£Ø¯Ù Ø§ÙÙØ´Ù Ø§ÙÙØ¨ÙØ± ÙØ§ÙØªÙØ¨Ø¤ Ø¨Ø³Ø±Ø·Ø§Ù Ø§ÙØ«Ø¯Ù Ø¥ÙÙ Ø§Ø±ØªÙØ§Ø¹ ÙØ¹Ø¯ÙØ§Øª Ø§ÙØ´ÙØ§Ø¡ ÙØ¨ÙØ§Ø¡ Ø§ÙÙØ±ÙØ¶ Ø¹ÙÙ ÙÙØ¯ Ø§ÙØ­ÙØ§Ø©Ø ÙÙØ§ ÙÙÙØ± ÙØ±ØµØ© ÙØ«Ø§ÙÙØ© ÙÙØ¹ÙØ§Ø¬ Ø§ÙÙØ¹Ø§Ù. Ø£Ø«Ø¨ØªØª ÙÙØ§Ø°Ø¬ Ø§ÙØ´Ø¨ÙØ© Ø§ÙØ¹ØµØ¨ÙØ© Ø§ÙØªÙØ§ÙÙÙÙØ© (CNN) ÙØ¹Ø§ÙÙØªÙØ§ ÙÙ ØªØµÙÙÙ ÙØªÙÙØ¹ Ø£ÙØ±Ø§Ù Ø³Ø±Ø·Ø§Ù Ø§ÙØ«Ø¯Ù ÙÙ Ø®ÙØ§Ù ØµÙØ± ØªØµÙÙØ± Ø§ÙØ«Ø¯Ù Ø¨Ø§ÙØ£Ø´Ø¹Ø© Ø§ÙØ³ÙÙÙØ© ÙØ³Ø±Ø·Ø§Ù Ø§ÙØ«Ø¯Ù. ØªÙØ³ØªØ®Ø¯Ù Ø§ÙØªØµÙÙÙØ§Øª ÙØªØ¹Ø¯Ø¯Ø© Ø§ÙÙØ¦Ø§Øª ÙÙ ÙØ°Ù Ø§ÙÙØ±ÙØ© ÙØ¥Ø¸ÙØ§Ø± ÙØ¦Ø© Ø§ÙÙØ±Ù Ø¨ÙØ§Ø¡Ù Ø¹ÙÙ ØªØµÙÙÙ BI-RADS ÙÙØ¯Ù ÙØ°Ø§ Ø§ÙØ¨Ø­Ø« Ø¥ÙÙ Ø¥Ø¬Ø±Ø§Ø¡ ØªØ­ÙÙÙ ÙÙØ§Ø±Ù ÙØ§Ø³ØªÙØ´Ø§Ù ÙØªØ§Ø¦Ø¬ Ø§ÙØªÙØ¨Ø¤ Ø¨Ø³Ø±Ø·Ø§Ù Ø§ÙØ«Ø¯Ù Ø¨Ø§Ø³ØªØ®Ø¯Ø§Ù ÙÙØ§Ø°Ø¬ Ø§ÙØªØ¹ÙÙ Ø§ÙØ¹ÙÙÙ ÙØªÙÙÙØ§Øª ÙØ¬ÙÙØ¹Ø© Ø§ÙØªØµÙÙØª ÙØªØ­ÙÙÙ Ø¯ÙØ© Ø£ÙØ¶Ù Ø¨ÙØ§Ø¡Ù Ø¹ÙÙ ÙØ¬ÙÙØ¹Ø© Ø§ÙØ¨ÙØ§ÙØ§Øª Ø§ÙÙØ£Ø®ÙØ°Ø© ÙÙ ÙØ¹ÙØ¯ ÙØ§ØµØ± ÙÙØ¨Ø­ÙØ« ÙØ§ÙØ¹ÙØ§Ø¬ ÙÙ ÙØµØ±.
ÙØ³ØªØ®Ø¯Ù ÙÙØ§Ø°Ø¬ Ø§ÙØªØ¹ÙÙ Ø§ÙØ¹ÙÙÙ ÙØ£ÙÙ ÙØ±Ø© Ø¹ÙÙ ÙØ¬ÙÙØ¹Ø© Ø¨ÙØ§ÙØ§Øª Ø§ÙØ£Ø´Ø¹Ø© Ø§ÙØ³ÙÙÙØ© ÙØ³Ø±Ø·Ø§Ù Ø§ÙØ«Ø¯Ù Ø§ÙÙØµØ±Ù Ø§ÙÙÙÙÙØ© ÙÙ 1000 Ø­Ø§ÙØ© Ø£ÙØ«ÙÙØ© Ø¨ÙØ§Ø¡Ù Ø¹ÙÙ Ø¹ÙØ§ÙÙ ÙØ±Ø§Ø«ÙØ© ÙØ¬ØºØ±Ø§ÙÙØ© ÙØ¨ÙØ¦ÙØ© ÙØªÙÙØ¹Ø© ÙÙØªÙØ¨Ø¤ Ø¨Ø³Ø±Ø·Ø§Ù Ø§ÙØ«Ø¯Ù. ÙØ­Ù ÙØ¬Ø§Ø¯Ù Ø¨Ø£Ù Ø§ÙØªØ­ÙÙÙ ÙÙ ÙØ«Ù ÙØ°Ù Ø§ÙÙØ¬ÙÙØ¹Ø© ÙÙ Ø§ÙØ¨ÙØ§ÙØ§Øª Ø§ÙÙØµØ±ÙØ© ÙÙÙÙ Ø£Ù ÙØ¹Ø·Ù ÙÙÙÙØ§ Ø¹ÙÙÙÙØ§ ÙØªØ£Ø«ÙØ± Ø§ÙØ¹ÙØ§ÙÙ Ø§ÙÙØ±Ø§Ø«ÙØ© ÙØ§ÙØ¬ØºØ±Ø§ÙÙØ© ÙØ§ÙØ¨ÙØ¦ÙØ© Ø¹ÙÙ ØªØ·ÙØ± Ø³Ø±Ø·Ø§Ù Ø§ÙØ«Ø¯Ù ÙÙ ÙØµØ±. ØªÙ Ø¬ÙØ¹ ÙØ¬ÙÙØ¹Ø© Ø§ÙØ¨ÙØ§ÙØ§Øª ÙÙ ÙØ­Ø¯Ø© Ø§ÙØªØµÙÙØ± Ø§ÙØ´Ø¹Ø§Ø¹Ù ÙÙØ«Ø¯Ù ÙÙ ÙØ±ÙØ² Ø§ÙØ£ÙØ±Ø§Ù Ø¨ÙØ¹ÙØ¯ ÙØ§ØµØ± ÙÙØ¨Ø­ÙØ« ÙØ§ÙØ¹ÙØ§Ø¬ ÙÙ ÙØµØ±Ø ÙØ§ÙØªÙ ØªÙ Ø§ÙØªÙØ§Ø·ÙØ§ Ø¨ÙØ§Ø³Ø·Ø© ÙØ¸Ø§Ù Ø§ÙØªØµÙÙØ± Ø§ÙØ´Ø¹Ø§Ø¹Ù ÙÙØ«Ø¯Ù HOLOGIC Selenia Dimensions. ØªÙ ØªØµÙÙÙ Ø¬ÙÙØ¹ ØµÙØ± Ø§ÙØ£Ø´Ø¹Ø© Ø§ÙØ³ÙÙÙØ© ÙÙÙÙØ§ ÙØªØµÙÙÙ BI-RADSØ ÙØ§ÙØ°Ù ÙØ­ØªÙÙ Ø¹ÙÙ Ø£Ø±Ø¨Ø¹ ØµÙØ± ÙÙØªØµÙÙØ± Ø§ÙØ´Ø¹Ø§Ø¹Ù ÙÙØ«Ø¯Ù (RCC Ù RMLO Ù LCC Ù LMLO). ØªÙ Ø¥Ø¬Ø±Ø§Ø¡ Ø¯Ø±Ø§Ø³Ø© ÙÙØ§Ø±ÙØ© ØªØ¬Ø±ÙØ¨ÙØ© Ø¹ÙÙ ÙÙØ§Ø°Ø¬ AlexNet Ù GoogleNet-V3 Ù VGG16 Ù ResNet50 Ù MobileNet. ØªØ´ÙØ± Ø§ÙÙØªØ§Ø¦Ø¬ Ø¥ÙÙ Ø£Ù ResNet50 Ù VGG16 Ø­ÙÙØ§ Ø¯ÙØ© Ø£Ø¹ÙÙ Ø¨ÙØ³Ø¨Ø© 80Ùª Ù 79.58Ùª Ø¹ÙÙ Ø§ÙØªÙØ§ÙÙØ Ø¨ÙÙÙØ§ Ø­ÙÙ AlexNet Ù GoogleNet-V3 Ù MobileNet Ø¯ÙØ© Ø¨ÙØ³Ø¨Ø© 73Ùª Ù 78Ùª Ù 79.02Ùª Ø¹ÙÙ Ø§ÙØªÙØ§ÙÙ. ÙØªØ§Ø¦Ø¬ Ø§ÙØªØµÙÙØª ÙØ¬ÙÙØ¹ Ø§ÙÙÙØ§Ø°Ø¬ Ø¹ÙÙ ÙÙØ³ ÙØ¬ÙÙØ¹Ø© Ø§ÙØ¨ÙØ§ÙØ§Øª Ø§ÙØ§Ø®ØªØ¨Ø§Ø±ÙØ© 91.60Ùª
ER  -