News video clustering and annotation / Ibrahim Ali Zedan Swelam ; Supervised Khaled Mostafa Elsayed , Eid Mohamed Emary

By:

Ibrahim Ali Zedan Swelam

Contributor(s):

Material type: Text

TextLanguage: English Publication details: Cairo : Ibrahim Ali Zedan Swelam , 2016Description: 79 Leaves : photographs ; 230cmOther title:

تقسيم و عنونة فيديو الاخبار [Added title page title]

Subject(s):

Online resources:

Click here to access online

Available additional physical forms:

Issued also as CD

Dissertation note: Thesis (M.Sc.) - Cairo University - Faculty of Computers and Information - Department of Information Technology Summary: In order to enable the users of news videos to gather the maximum amount of information contained in the news video in minimum time, we cluster the news video frames into shots using abrupt cut detection, summarize news video shots, and extract captions contained in the news video as annotation clues. A method to detect and localize all caption types in Arabic news videos is proposed. Moreover, different types of captions are considered including static, horizontal scrolling and vertical scrolling captions. Our method is able to deal with different patterns of appearance and disappearance of captions in news video. Also it can deal with news videos with multiple captions. The proposed method is based on edge feature and multiple frames integration. Canny edge map is computed for each frame. Horizontal lines detection is applied and frames are categorized into clusters. Finally, caption types are recognized from each cluster by observing the normalized inter-frame edge map difference. A new representation of images is proposed. We called that representation as 2dominant colors3. The dissimilarity of two images is defined as a vector contains the difference in order of each dominant color between the two images representations. The new image representation and dissimilarity measure are utilized to detect the abrupt cuts in news videos. A neural network is trained with the new dissimilarity measure to differentiate between two classes of news videos frames: cut frames, and non-cut frames. In addition, a key frame extraction method is proposed. The proposed method takes a confidence level as input from the user to satisfy the different needs

Tags from this library: No tags from this library for this title. Log in to add tags.

Average rating: 0.0 (0 votes)

Holdings
Item type	Current library	Home library	Call number	Copy number	Status	Barcode
Thesis	قاعة الرسائل الجامعية - الدور الاول	المكتبة المركزبة الجديدة - جامعة القاهرة	Cai01.20.01.M.Sc.2016.Ib.N (Browse shelf(Opens below))		Not for loan	01010110071714000
CD - Rom	مخـــزن الرســائل الجـــامعية - البدروم	المكتبة المركزبة الجديدة - جامعة القاهرة	Cai01.20.01.M.Sc.2016.Ib.N (Browse shelf(Opens below))	71714.CD	Not for loan	01020110071714000

Thesis (M.Sc.) - Cairo University - Faculty of Computers and Information - Department of Information Technology

In order to enable the users of news videos to gather the maximum amount of information contained in the news video in minimum time, we cluster the news video frames into shots using abrupt cut detection, summarize news video shots, and extract captions contained in the news video as annotation clues. A method to detect and localize all caption types in Arabic news videos is proposed. Moreover, different types of captions are considered including static, horizontal scrolling and vertical scrolling captions. Our method is able to deal with different patterns of appearance and disappearance of captions in news video. Also it can deal with news videos with multiple captions. The proposed method is based on edge feature and multiple frames integration. Canny edge map is computed for each frame. Horizontal lines detection is applied and frames are categorized into clusters. Finally, caption types are recognized from each cluster by observing the normalized inter-frame edge map difference. A new representation of images is proposed. We called that representation as 2dominant colors3. The dissimilarity of two images is defined as a vector contains the difference in order of each dominant color between the two images representations. The new image representation and dissimilarity measure are utilized to detect the abrupt cuts in news videos. A neural network is trained with the new dissimilarity measure to differentiate between two classes of news videos frames: cut frames, and non-cut frames. In addition, a key frame extraction method is proposed. The proposed method takes a confidence level as input from the user to satisfy the different needs

Issued also as CD

There are no comments on this title.

to post a comment.

Click on an image to view it in the image viewer