TY - BOOK AU - Omnia Ismail Mohammad Ismail AU - Hatem Mohamed Moharram , AU - Laila Fahmy Abdelal , AU - Nasser Hassan Sweilam , TI - Fault tolerance scheme for some mathematical models / PY - 2015/// CY - Cairo : PB - Omnia Ismail Mohammad Ismail , KW - Algorithm based fault tolerance KW - Diskless checkpointing KW - Fault tolerance N1 - Thesis (M.Sc.) - Cairo University - Faculty of Science - Department of Mathematics; Issued also as CD N2 - This thesis has two purposes, the {uFB01}rst purpose is to study the numerical solution of fractional order di{uFB00}erential equations using computer cluster machines then measure the e{uFB03}ciency of the solution algorithm when applied on computer cluster machines using a parallel programming model. The second purpose is to detect and handle faults that may occur during the implementation of the solution algorithm. In this thesis, a parallel Crank-Nicholson {uFB01}nite di{uFB00}erence method (P - CN - FDM) is presented for solving time - fractional parabolic equation using distributed memory systems. The resultant large sparse system of equations is solved using a parallel preconditioned conjugate gradient algorithm (PPCG) that is implemented using a two level parallel programming model. A series of tests has been carried out on a Linux PC cluster using di{uFB00}erent problem sizes and di{uFB00}erent number of processes and nodes. The proposed algorithm in this the- sis has a great performance enhancement with respect to the total execution time and memory utilization in comparison with a previously proposed techniques. An online algorithm based fault tolerance technique (online ABFT) for detecting soft errors in Krylov osed technique is explained using the preconditioned conjugate gradient method (PCG). Experimental results showed a good enhancement in the execution time when compared with disk-based checkpointing technique UR - http://172.23.153.220/th.pdf ER -