N Gonuguntla
Enhanced Deep Video Summarization Network
Gonuguntla, N; Mandal, B; Puhan, NB
Abstract
Video summarization is understanding video which aims to get an abstract view of the original video sequence by the concatenation of keyframes representing the highlights of the video. In this work, we propose an enhanced deep summarization network (EDSN) to summarize videos. We implement a reinforcement learning based framework to train our EDSN, where we design a novel reward function which considers the spatial and temporal features of the original video to be included in the summary. The reward function is formulated using the spatial and temporal scores obtained for each frame of the video using the temporal segment networks. During training, the reward function seeks to generate a summary by including the frames with high temporal and spatial scores, while the EDSN strives for earning higher rewards by learning to produce more diverse summaries. The method is completely unsupervised since no labels are required during training. Extensive experiments on two benchmark datasets show that the proposed approach achieves state-of-the-art performance.
Citation
Gonuguntla, N., Mandal, B., & Puhan, N. (2019, September). Enhanced Deep Video Summarization Network. Paper presented at 30th British Machine Vision Conference, Cardiff
Presentation Conference Type | Conference Paper (unpublished) |
---|---|
Conference Name | 30th British Machine Vision Conference |
Conference Location | Cardiff |
Start Date | Sep 9, 2019 |
End Date | Sep 12, 2019 |
Acceptance Date | Aug 5, 2019 |
Publication Date | Aug 5, 2019 |
Series Title | British Machine Vision Conference 2019 (BMVC) |
Related Public URLs | https://bmvc2019.org/programme/detailed-programme/ https://researchr.org/publication/bmvc-2019 |
Files
0007Downloaded.pdf
(801 Kb)
PDF
You might also like
Deep Neural Network Based Attention Model for Structural Component Recognition
(2023)
Conference Proceeding
StructureNet: Deep Context Attention Learning for Structural Component Recognition
(2022)
Conference Proceeding
Variance-guided attention-based twin deep network for cross-spectral periocular recognition
(2020)
Journal Article
Multi-Deformation Aware Attention Learning for Concrete Structural Defect Classification
(2020)
Journal Article
Downloadable Citations
About Keele Repository
Administrator e-mail: research.openaccess@keele.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search