Advanced techniques in speaker diarization for arabic TV brpadcast / Mohamed Salem Mohamed Elhady ; Supervised Mohsen Abdelrazeq Rashwan , Sehrif Mahdy Abdou
Material type:
- تقنيات متقدمة في فصل المتحدثين في البث التلفزيوني العربي [Added title page title]
- Issued also as CD
Item type | Current library | Home library | Call number | Copy number | Status | Barcode | |
---|---|---|---|---|---|---|---|
![]() |
قاعة الرسائل الجامعية - الدور الاول | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.13.08.M.Sc.2017.Mo.A (Browse shelf(Opens below)) | Not for loan | 01010110074259000 | ||
![]() |
مخـــزن الرســائل الجـــامعية - البدروم | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.13.08.M.Sc.2017.Mo.A (Browse shelf(Opens below)) | 74259.CD | Not for loan | 01020110074259000 |
Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Electronics and Communications
Speaker Diarization is known as the task that answers the question, who spoke, when in an audio {uFB01}le or a set of audio {uFB01}les that contain unknown number of speakers. The determination of speaker segments is done in an unsupervised manner. Our Speaker Diarization system composed of two main blocks; Speech Activity Detector and Speaker Clustering. In speech activity detection we propose several solutions including; Phoneme Recognition system, SVMHMM system and i-vector based system. In speaker clustering area we propose an enhancement over state of the art techniques as cosine based Hierarchal Agglomerative Clustering. Such enhancement including enhancing clustering by classi{uFB01}cation methods as SVM, DNN and Random Forrest. Finally we investigated enhancing the i-vector representation via extracting them from a DNN based background model
Issued also as CD
There are no comments on this title.