Audio-visual matching assisted in sources speech separation / By Ghada Dahy Fathy Kamel; Supervisors Prof. Reda Abd Elwahab Ahmed El-Khoribi, Prof: Mahmoud Ahmed Ismail Ibrahim Shoman, Dr. Mohamed Ahmed Ahmed Refaey.

By:

Ghada Dahy Fathy Kamel [preparation.]

Contributor(s):

Material type: Text

TextLanguage: English Summary language: English, Arabic Producer: 2023Description: 86 Leaves : illustrations ; 30 cm. + CDContent type:

text

Media type:

Unmediated

Carrier type:

volume

Other title:

المطابفة الصوتية و المرئية للمساعدة في فصل مصادر الكلام [Added title page title]

Subject(s):

DDC classification:

006.5

Available additional physical forms:

Issues also as CD.

Dissertation note: Thesis (Ph.D)-Cairo University, 2023. Summary: Our proposed speech separation model can be used in speech separation, Automatic Speech Recognition Systems (ASR) and also in creating a single speaker speech database. Speech separation is a complicated problem using audio information only so visual and auditory signals are combined to complete the separation process. The speech separation model consists of four modules, two for audio signal, one for visual feature and the last one is used to concatenate the features resulting from the previous three modules to generate the separated signals. Speech enhancement is the process of improving the quality of audio relative to the target speaker.Summary: تعد عملية فصل الكلام واحده من المشاكل الأكثر تعقيدا عندما يتم اللجوء إلى استخدام المعلومات الصوتية فقط لذلك يتم دمج بعض الإشارات المرئية والسمعية لإكمال عملية الفصل بنجاح. يتكون النموذج الخاص بفصل الأصوات من أربع عمليات ، اثنتان للإشارات الصوتية ، وواحدة للخصائص المرئية والأخيرة تستخدم لربط الخصائص الناتجة من الثلاث عمليات السابقة. تعد عملية تحسين الكلام طريقة لتحسين جودة الصوت الخاص بالشخص المستهدف مع تقليل تأثير الأصوات الأخرى ، حيث يمكن استخدامها في العديد من التطبيقات مثل التعرف على الكلام والهاتف المحمول والأشخاص الذين لديهم ضعف سمع و كذلك تحسين الملفات الصوتية الناتجة من نماذج فصل الأصوات .

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Current library	Home library	Call number	Status	Barcode
Thesis	قاعة الرسائل الجامعية - الدور الاول	المكتبة المركزبة الجديدة - جامعة القاهرة	Cai01.20.01.Ph.D.2023.Gh.A. (Browse shelf(Opens below))	Not for loan	01010110089877000

Browsing المكتبة المركزبة الجديدة - جامعة القاهرة shelves Close shelf browser (Hides shelf browser)

Previous	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	Next
Previous	Cai01.20.01.Ph.D.2021.So.P Proposed system for image forgery detection /	Cai01.20.01.Ph.D.2022.Fa.W Writer Adaptation For End-To-EndArabic Online HandwritingRecognition /	Cai01.20.01.Ph.D.2022.Ka.P. Psychological human traits detection using deeppattern classification approaches /	Cai01.20.01.Ph.D.2023.Gh.A. Audio-visual matching assisted in sources speech separation /	Cai01.20.02.M.Sc.2003.Ah.M A model to solve multi - cycles machine time scheduling problems /	Cai01.20.02.M.Sc.2003.Ah.M A model to solve multi - cycles machine time scheduling problems /	Cai01.20.02.M.Sc.2003.Ah.M Multi criteria approach to solve resource allocation problem /	Next

Thesis (Ph.D)-Cairo University, 2023.

Bibliography: pages 78-86.

Our proposed speech separation model can be used in speech separation, Automatic Speech Recognition Systems (ASR) and also in creating a single speaker speech database. Speech separation is a complicated problem using audio information only so visual and auditory signals are combined to complete the separation process. The speech separation model consists of four modules, two for audio signal, one for visual feature and the last one is used to concatenate the features resulting from the previous three modules to generate the separated signals. Speech enhancement is the process of improving the quality of audio relative to the target speaker.

تعد عملية فصل الكلام واحده من المشاكل الأكثر تعقيدا عندما يتم اللجوء إلى استخدام المعلومات الصوتية فقط لذلك يتم دمج بعض الإشارات المرئية والسمعية لإكمال عملية الفصل بنجاح. يتكون النموذج الخاص بفصل الأصوات من أربع عمليات ، اثنتان للإشارات الصوتية ، وواحدة للخصائص المرئية والأخيرة تستخدم لربط الخصائص الناتجة من الثلاث عمليات السابقة. تعد عملية تحسين الكلام طريقة لتحسين جودة الصوت الخاص بالشخص المستهدف مع تقليل تأثير الأصوات الأخرى ، حيث يمكن استخدامها في العديد من التطبيقات مثل التعرف على الكلام والهاتف المحمول والأشخاص الذين لديهم ضعف سمع و كذلك تحسين الملفات الصوتية الناتجة من نماذج فصل الأصوات .

Issues also as CD.

Text in English and abstract in English.

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer