header
Local cover image
Local cover image
Image from OpenLibrary

Advanced techniques in speaker diarization for arabic TV brpadcast / Mohamed Salem Mohamed Elhady ; Supervised Mohsen Abdelrazeq Rashwan , Sehrif Mahdy Abdou

By: Contributor(s): Material type: TextTextLanguage: English Publication details: Cairo : Mohamed Salem Mohamed Elhady , 2017Description: 79 P. : charts , facsimiles ; 30cmOther title:
  • تقنيات متقدمة في فصل المتحدثين في البث التلفزيوني العربي [Added title page title]
Subject(s): Online resources: Available additional physical forms:
  • Issued also as CD
Dissertation note: Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Electronics and Communications Summary: Speaker Diarization is known as the task that answers the question, who spoke, when in an audio {uFB01}le or a set of audio {uFB01}les that contain unknown number of speakers. The determination of speaker segments is done in an unsupervised manner. Our Speaker Diarization system composed of two main blocks; Speech Activity Detector and Speaker Clustering. In speech activity detection we propose several solutions including; Phoneme Recognition system, SVMHMM system and i-vector based system. In speaker clustering area we propose an enhancement over state of the art techniques as cosine based Hierarchal Agglomerative Clustering. Such enhancement including enhancing clustering by classi{uFB01}cation methods as SVM, DNN and Random Forrest. Finally we investigated enhancing the i-vector representation via extracting them from a DNN based background model
Tags from this library: No tags from this library for this title. Log in to add tags.
Star ratings
    Average rating: 0.0 (0 votes)
Holdings
Item type Current library Home library Call number Copy number Status Barcode
Thesis Thesis قاعة الرسائل الجامعية - الدور الاول المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.13.08.M.Sc.2017.Mo.A (Browse shelf(Opens below)) Not for loan 01010110074259000
CD - Rom CD - Rom مخـــزن الرســائل الجـــامعية - البدروم المكتبة المركزبة الجديدة - جامعة القاهرة Cai01.13.08.M.Sc.2017.Mo.A (Browse shelf(Opens below)) 74259.CD Not for loan 01020110074259000

Thesis (M.Sc.) - Cairo University - Faculty of Engineering - Department of Electronics and Communications

Speaker Diarization is known as the task that answers the question, who spoke, when in an audio {uFB01}le or a set of audio {uFB01}les that contain unknown number of speakers. The determination of speaker segments is done in an unsupervised manner. Our Speaker Diarization system composed of two main blocks; Speech Activity Detector and Speaker Clustering. In speech activity detection we propose several solutions including; Phoneme Recognition system, SVMHMM system and i-vector based system. In speaker clustering area we propose an enhancement over state of the art techniques as cosine based Hierarchal Agglomerative Clustering. Such enhancement including enhancing clustering by classi{uFB01}cation methods as SVM, DNN and Random Forrest. Finally we investigated enhancing the i-vector representation via extracting them from a DNN based background model

Issued also as CD

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer

Local cover image
Share
Under the supervision of New Central Library Manager

Implemented and Customized by: Eng.M.Mohamady
Contact:   info@cl.cu.edu.eg

© All rights reserved  New Central Library