Advanced techniques in speaker diarization for arabic TV brpadcast / Mohamed Salem Mohamed Elhady ; Supervised Mohsen Abdelrazeq Rashwan , Sehrif Mahdy Abdou

By:

Mohamed Salem Mohamed Elhady

Contributor(s):

Material type: Text

TextLanguage: English Publication details: Cairo : Mohamed Salem Mohamed Elhady , 2017Description: 79 P. : charts , facsimiles ; 30cmOther title:

تقنيات متقدمة في فصل المتحدثين في البث التلفزيوني العربي [Added title page title]

Subject(s):

Online resources:

Click here to access online

Available additional physical forms:

Issued also as CD

Dissertation note: Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Electronics and Communications Summary: Speaker Diarization is known as the task that answers the question, who spoke, when in an audio {uFB01}le or a set of audio {uFB01}les that contain unknown number of speakers. The determination of speaker segments is done in an unsupervised manner. Our Speaker Diarization system composed of two main blocks; Speech Activity Detector and Speaker Clustering. In speech activity detection we propose several solutions including; Phoneme Recognition system, SVMHMM system and i-vector based system. In speaker clustering area we propose an enhancement over state of the art techniques as cosine based Hierarchal Agglomerative Clustering. Such enhancement including enhancing clustering by classi{uFB01}cation methods as SVM, DNN and Random Forrest. Finally we investigated enhancing the i-vector representation via extracting them from a DNN based background model

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Current library	Home library	Call number	Copy number	Status	Barcode
Thesis	قاعة الرسائل الجامعية - الدور الاول	المكتبة المركزبة الجديدة - جامعة القاهرة	Cai01.13.08.M.Sc.2017.Mo.A (Browse shelf(Opens below))		Not for loan	01010110074259000
CD - Rom	مخـــزن الرســائل الجـــامعية - البدروم	المكتبة المركزبة الجديدة - جامعة القاهرة	Cai01.13.08.M.Sc.2017.Mo.A (Browse shelf(Opens below))	74259.CD	Not for loan	01020110074259000

Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Electronics and Communications

Speaker Diarization is known as the task that answers the question, who spoke, when in an audio {uFB01}le or a set of audio {uFB01}les that contain unknown number of speakers. The determination of speaker segments is done in an unsupervised manner. Our Speaker Diarization system composed of two main blocks; Speech Activity Detector and Speaker Clustering. In speech activity detection we propose several solutions including; Phoneme Recognition system, SVMHMM system and i-vector based system. In speaker clustering area we propose an enhancement over state of the art techniques as cosine based Hierarchal Agglomerative Clustering. Such enhancement including enhancing clustering by classi{uFB01}cation methods as SVM, DNN and Random Forrest. Finally we investigated enhancing the i-vector representation via extracting them from a DNN based background model

Issued also as CD

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer