000 01977cam a2200313 a 4500
003 EG-GiCUC
008 151229s2015 ua e f m 000 0 eng d
040 _aEG-GiCUC
_beng
_cEG-GiCUC
041 0 _aeng
049 _aDeposite
097 _aM.Sc
099 _aCai01.13.06.M.Sc.2015.Mo.N
100 0 _aMohamed Abdelrahman Zahran Mohamed
245 1 0 _aNew trends for building arabic language resources /
_cMohamed Abdelrahman Zahran Mohamed ; Supervised Amir Atyia , Mohsen Rashwan
246 1 5 _aاتجاهات جديدة لبناء موارد اللغة العربية
260 _aCairo :
_bMohamed Abdelrahman Zahran Mohamed ,
_c2015
300 _a87 P. :
_bplans ;
_c30cm
502 _aThesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Computer Engineering
520 _aLanguage resources are important factor in any natural language processing application. However, the language resource support for Arabic is not mature because the existing Arabic language resources are either scattered, inconsistent or even incomplete. To solve this problem, first, we automatically bootstrap a rich Arabic language resource leveraging the existing resources. Next, we build the largest statistical Arabic language resource, and then introduce a new technique to map this statistical Arabic resource to the English counterpart outperforming standard techniques in this task. Finally, using the new statistical methods we present a novel hoto here technique to link conventional Arabic language resources to English using cross lingual lexical substitution outperforming the state of the art system in this problem
530 _aIssued also as CD
653 4 _aArabic language resources integration
653 4 _aNatural language processing
653 4 _aText similarity
700 0 _aAmir Atyia ,
_eSupervisor
700 0 _aMohsen Rashwan ,
_eSupervisor
905 _aNazla
_eRevisor
905 _aSamia
_eCataloger
942 _2ddc
_cTH
999 _c54175
_d54175