Zero-shot deep learning for media mining : person spotting and face clustering in video big data / Mohamed Sameer Mohamed Abdallah ; Supervised Amr G. Wassal , Elsayed E. Hemayed , Mohammad E. Ragab
Material type: TextLanguage: English Publication details: Cairo : Mohamed Sameer Mohamed Abdallah , 2020Description: 106 P. : charts , facimiles , photoghrphs ; 30cmOther title:- التعلم العميق للتنقيب فى وسائل الإعلام : اكتشاف الأشخاص وتجميع الوجوه فى بيانات الفيديو الكبيرة [Added title page title]
- Issued also as CD
Item type | Current library | Home library | Call number | Copy number | Status | Date due | Barcode | |
---|---|---|---|---|---|---|---|---|
Thesis | قاعة الرسائل الجامعية - الدور الاول | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.13.06.Ph.D.2020.Mo.Z (Browse shelf(Opens below)) | Not for loan | 01010110081393000 | |||
CD - Rom | مخـــزن الرســائل الجـــامعية - البدروم | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.13.06.Ph.D.2020.Mo.Z (Browse shelf(Opens below)) | 81393.CD | Not for loan | 01020110081393000 |
Thesis (Ph.D.) - Cairo University - Faculty of Engineering - Department of Computer Engineering
In this thesis, we propose a TV media mining system that is based on a deep convolutional neural network approach, which has been trained with a triplet loss minimization method. The main function of the proposed system is the indexing and clustering of video data for achieving an effective media production analysis of individuals in talk show videos and rapidly identifying a specific individual in video data in real-time processing.Our system uses several face datasets from Labeled Faces in the Wild (LFW), which is a collection of unlabeled web face images, as well as YouTube Faces and talk show faces datasets. In the recognition (person spotting) task, our system achieves an F-measure of 0.996 for the collection of unlabeled web face images dataset and an F-measure of 0.972 for the talk show faces dataset. In the clustering task, our system achieves an F-measure of 0.764 and 0.935 for the YouTube Faces database and the LFW dataset, respectively, while achieving an F-measure of 0.832 for the talk show faces dataset, an improvement of 5.4%, 6.5%, and 8.2% over the previous methods
Issued also as CD
There are no comments on this title.