Enhanced Approach for Group Activity Recognition Using Deep Learning / by Aly Fathy Aly El Said Radwan ; Supervised by Prof. Ammar Mohammed, Dr. Ahmed Hamza

By:

Aly Fathy Aly El Said Radwan [preparation.]

Contributor(s):

Material type: Text

TextLanguage: English Summary language: English, Arabic Producer: 2023Description: 111 leaves : illustrations ; 30 cm. + CDContent type:

text

Media type:

Unmediated

Carrier type:

volume

Other title:

أسلوب محسن للتعرف علي نشاط مجموعه من الأفراد بإستخدام التعلم العميق [Added title page title]

Subject(s):

DDC classification:

Available additional physical forms:

Issued also as CD

Dissertation note: Thesis (M.Sc.)-Cairo University, 2023. Summary: Human action recognition (HAR) is the process of interpreting human action using computer and machine vision technology. Research in this area has advanced significantly recently in different application domains. However, the absence of suitable, sizeable, trained datasets still limits HAR in several areas of domains. Many factors impact the availability of suitable HAR data in specific domains. One of those interesting industrial domains is garment sewing. Many world reports were published to illustrate the progress of the worldwide garment sewing industry. To our knowledge, HAR data for the garment sewing industry is limited. Thus, in this Thesis, we review recent works in HAR approaches and propose a new HAR dataset, called VARSew, specifically designed for recognizing human sewing actions during production time to overcome this limitation. VARSew is designed to capture multiple research challenges in HAR. We provide two types of video classification tasks: binary and multi-class. The Data contains 3, 121 video segments of 49, 936 frames. We conduct extensive benchmark experiments with 32 state-of-the-art (SOTA) HAR models and compare their performance. The results on both binary and multi-class reveal that the best accuracy scores are 80% and 95% for binary and multi-class classification, respectively. As a result, deep learning vision transformer (Vit) models outperformed deep learning convolutional and ResNet models on both the VARSew multi-class and VARSew binary-class datasets.Summary: التعرف علي الإجراءات البشريه هو عملية تفسير للعمل البشري باستخدام تكنولوجيا الحاسوب والرؤية الآلية. وقد أحرزت البحوث في هذا المجال تقدما كبيرا مؤخرا في مجالات تطبيق مختلفة. ومع ذلك، فإن عدم وجود مجموعات بيانات مناسبة وكبيرة مدربة لا يزال يحد من تقييم التعرف علي الإجراءات البشريه في عدة مجالات. وتؤثر عوامل كثيرة على توافر بيانات مناسبة للتعرف علي الإجراءات البشريه في بعض المجالات. وأحد هذه المجالات الصناعية المثيرة للاهتمام هو خياطة الملابس. وقد نُشرت العديد من التقارير العالمية لتوضيح التقدم المحرز في صناعة خياطة الملابس على نطاق العالم. وعلى حد علمنا، فإن البيانات المتعلقة بصناعة خياطة الملابس محدودة. وبالتالي، في هذه الرساله، نستعرض الأعمال الحديثه في نُ التعرف علي الإجراءات البشريه ونقترح مجموعة بيانات جديدة من هذا النوع، ُهج البحث والتطوير في مجال ، مصممة خصيصا للتعرف علي الإجراءات البشريه في مجال صناعة خياطة الملابس أثناء فترة ً تسمى VARSew الإنتاج للتغلب على هذا التقييد. وقد ُصِّممت مجموعه بيانات VARSew لتتيح العديد من التحديات البحثية المتعلقه بمجال التعرف علي الإجراءات البشريه. حيث أنها توفر نوعين من مهام تصنيف الفيديو: ثنائي ومتعدد التصنيف. تتضمن البيانات أجزاء فيديو تبلغ عددها ١٢١ ٣, فيديو من أطرات عمل تبلغ ,٩٣٦ ٤٩ أطار. وتم أجراء تجارب قياسية واسعة النطاق مع نماذج أحدث طرازات التعرف علي الإجراءات البشريه التي تبلغ ٣٢ طراز، ونقارن أدائها. وتكشف النتائج المتعلقة بكل من الأجرأت الثنائية والمتعددة وحصلنا علي أن أفضل درجات الدقة هي ٨٠ بالمائة و٩٥ بالمائةللتصنيف الثنائي والتصنيف المتعدد الإجراءات ، على التوالي ونتيجة لذلك، تفوقت نماذج محِّول رؤية التعلم العميق (Vit)(علي التعلم العميق المتطور للشبكات العصبية التلافيفية في كل من نموذجي المتعدد التصنيف والثنائي التصنيفين لمجوعه بيانات. VARSew

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Current library	Home library	Call number	Status	Barcode
Thesis	قاعة الرسائل الجامعية - الدور الاول	المكتبة المركزبة الجديدة - جامعة القاهرة	Cai01.18.02.M.Sc.2023.Al.E (Browse shelf(Opens below))	Not for loan	01010110089800000

Browsing المكتبة المركزبة الجديدة - جامعة القاهرة shelves Close shelf browser (Hides shelf browser)

Previous	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	No cover image available	Next
Previous	Cai01.18.02.M.Sc.2022.Is.A An Approach for Expanding theStorage Capacity and Accelerating the Response of QR-Code /	Cai01.18.02.M.Sc.2022.Mo.S Sign Language Translation using Deep Nets /	Cai01.18.02.M.Sc.2022.No.P A Proposed Fuzzy Approach for Reducing Energy Consumption in IoT Temperature Monitoring System /	Cai01.18.02.M.Sc.2023.Al.E Enhanced Approach for Group Activity Recognition Using Deep Learning /	Cai01.18.02.M.Sc.2023.Ge.E An Enhanced Approach For Classification Of Common Thorax Diseases Using Deep Learning/	Cai01.18.02.M.Sc.2023.Ma.E An Enhanced approach for breast cancer detection in histopathology images using deep learning /	Cai01.18.02.M.Sc.2023.Mo.P A proposed approach for uncertain decision making using interval based probabilistic automata /	Next

Thesis (M.Sc.)-Cairo University, 2023.

Bibliography: pages 84-111.

Human action recognition (HAR) is the process of interpreting human action using computer and machine vision technology. Research in this area has advanced significantly recently in different application domains. However, the absence of suitable, sizeable, trained datasets still limits HAR in several areas of domains. Many factors impact the availability of suitable HAR data in specific domains. One of those interesting industrial domains is garment sewing. Many world reports were published to illustrate the progress of the worldwide garment sewing industry. To our knowledge, HAR data for the garment sewing industry is limited. Thus, in this Thesis, we review recent works in HAR approaches and propose a new HAR dataset, called VARSew, specifically designed for recognizing human sewing actions during production time to overcome this limitation. VARSew is designed to capture multiple research challenges in HAR. We provide two types of video classification tasks: binary and multi-class. The Data contains 3, 121 video segments of 49, 936 frames. We conduct extensive benchmark experiments with 32 state-of-the-art (SOTA) HAR models and compare their performance. The results on both binary and multi-class reveal that the best accuracy scores are 80% and 95% for binary and multi-class classification, respectively. As a result, deep learning vision transformer (Vit) models outperformed deep learning convolutional and ResNet models on both the VARSew multi-class and VARSew binary-class datasets.

التعرف علي الإجراءات البشريه هو عملية تفسير للعمل البشري باستخدام تكنولوجيا الحاسوب والرؤية الآلية. وقد أحرزت البحوث في هذا المجال تقدما كبيرا مؤخرا في مجالات تطبيق مختلفة. ومع ذلك، فإن عدم وجود مجموعات بيانات مناسبة وكبيرة مدربة لا يزال يحد من تقييم التعرف علي الإجراءات البشريه في عدة مجالات. وتؤثر عوامل كثيرة على توافر بيانات مناسبة للتعرف علي الإجراءات البشريه في بعض المجالات. وأحد هذه المجالات الصناعية المثيرة للاهتمام هو خياطة الملابس.
وقد نُشرت العديد من التقارير العالمية لتوضيح التقدم المحرز في صناعة خياطة الملابس على نطاق العالم. وعلى حد علمنا، فإن البيانات المتعلقة بصناعة خياطة الملابس محدودة. وبالتالي، في هذه الرساله، نستعرض الأعمال الحديثه في نُ التعرف علي الإجراءات البشريه ونقترح مجموعة بيانات جديدة من هذا النوع، ُهج البحث والتطوير في مجال ، مصممة خصيصا للتعرف علي الإجراءات البشريه في مجال صناعة خياطة الملابس أثناء فترة ً تسمى VARSew الإنتاج للتغلب على هذا التقييد.
وقد ُصِّممت مجموعه بيانات VARSew لتتيح العديد من التحديات البحثية المتعلقه بمجال التعرف علي الإجراءات البشريه. حيث أنها توفر نوعين من مهام تصنيف الفيديو: ثنائي ومتعدد التصنيف. تتضمن البيانات أجزاء فيديو تبلغ عددها ١٢١ ٣, فيديو من أطرات عمل تبلغ ,٩٣٦ ٤٩ أطار. وتم أجراء تجارب قياسية واسعة النطاق مع نماذج أحدث طرازات التعرف علي الإجراءات البشريه التي تبلغ ٣٢ طراز، ونقارن أدائها. وتكشف النتائج المتعلقة بكل من الأجرأت الثنائية والمتعددة وحصلنا علي أن أفضل درجات الدقة هي ٨٠ بالمائة و٩٥ بالمائةللتصنيف الثنائي والتصنيف المتعدد الإجراءات ، على التوالي
ونتيجة لذلك، تفوقت نماذج محِّول رؤية التعلم العميق (Vit)(علي التعلم العميق المتطور للشبكات العصبية التلافيفية في كل من نموذجي المتعدد التصنيف والثنائي التصنيفين لمجوعه بيانات. VARSew

Issued also as CD

Text in English and abstract in Arabic & English.

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer