News video clustering and annotation / Ibrahim Ali Zedan Swelam ; Supervised Khaled Mostafa Elsayed , Eid Mohamed Emary
Material type: TextLanguage: English Publication details: Cairo : Ibrahim Ali Zedan Swelam , 2016Description: 79 Leaves : photographs ; 230cmOther title:- تقسيم و عنونة فيديو الاخبار [Added title page title]
- Issued also as CD
Item type | Current library | Home library | Call number | Copy number | Status | Date due | Barcode | |
---|---|---|---|---|---|---|---|---|
Thesis | قاعة الرسائل الجامعية - الدور الاول | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.20.01.M.Sc.2016.Ib.N (Browse shelf(Opens below)) | Not for loan | 01010110071714000 | |||
CD - Rom | مخـــزن الرســائل الجـــامعية - البدروم | المكتبة المركزبة الجديدة - جامعة القاهرة | Cai01.20.01.M.Sc.2016.Ib.N (Browse shelf(Opens below)) | 71714.CD | Not for loan | 01020110071714000 |
Thesis (M.Sc.) - Cairo University - Faculty of Computers and Information - Department of Information Technology
In order to enable the users of news videos to gather the maximum amount of information contained in the news video in minimum time, we cluster the news video frames into shots using abrupt cut detection, summarize news video shots, and extract captions contained in the news video as annotation clues. A method to detect and localize all caption types in Arabic news videos is proposed. Moreover, different types of captions are considered including static, horizontal scrolling and vertical scrolling captions. Our method is able to deal with different patterns of appearance and disappearance of captions in news video. Also it can deal with news videos with multiple captions. The proposed method is based on edge feature and multiple frames integration. Canny edge map is computed for each frame. Horizontal lines detection is applied and frames are categorized into clusters. Finally, caption types are recognized from each cluster by observing the normalized inter-frame edge map difference. A new representation of images is proposed. We called that representation as 2dominant colors3. The dissimilarity of two images is defined as a vector contains the difference in order of each dominant color between the two images representations. The new image representation and dissimilarity measure are utilized to detect the abrupt cuts in news videos. A neural network is trained with the new dissimilarity measure to differentiate between two classes of news videos frames: cut frames, and non-cut frames. In addition, a key frame extraction method is proposed. The proposed method takes a confidence level as input from the user to satisfy the different needs
Issued also as CD
There are no comments on this title.