S Sarangi
Deep Neural Network Based Attention Model for Structural Component Recognition
Sarangi, S; Mandal, B
Abstract
The recognition of structural components from images/videos is a highly complex task because of the appearance of huge components and their extended existence alongside, which are relatively small components. The latter is frequently overestimated or overlooked by existing methodologies. For the purpose of automating bridge visual inspection efficiently, this research examines and aids vision-based automated bridge component recognition. In this work, we propose a novel deep neural network-based attention model (DNNAM) architecture, which comprises synchronous dual attention modules (SDAM) and residual modules to recognise structural components. These modules help us to extract local discriminative features from structural component images and classify different categories of bridge components. These innovative modules are constructed at the contextual level of information encoding across spatial and channel dimensions.
Citation
Sarangi, S., & Mandal, B. (2023, February). Deep Neural Network Based Attention Model for Structural Component Recognition. Presented at 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications VISIGRAPP, Lisbon, Portugal
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications VISIGRAPP |
Start Date | Feb 19, 2023 |
End Date | Feb 21, 2023 |
Acceptance Date | Jan 1, 2023 |
Publication Date | Jan 1, 2023 |
Series Title | 18th International Conference on Computer Vision Theory and Applications |
Series ISSN | 2184-4321 |
ISBN | 978-989-758-634-7 |
DOI | https://doi.org/10.5220/0011688400003417 |
Keywords | Synchronous Attention, Dual Attention Network, Structural Component Recognition |
Public URL | https://keele-repository.worktribe.com/output/425949 |
Publisher URL | https://www.scitepress.org/Link.aspx?doi=10.5220/0011688400003417 |
Related Public URLs | https://visapp.scitevents.org/?y=2023 |
Files
VISAPP_2023_143_CR.pdf
(816 Kb)
PDF
Publisher Licence URL
https://creativecommons.org/licenses/by-nc-nd/4.0/
You might also like
Interpretative Attention Networks for Structural Component Recognition
(2024)
Presentation / Conference Contribution
Grid LSTM based Attention Modelling for Traffic Flow Prediction
(2024)
Presentation / Conference Contribution
Towards Quantification of Eye Contacts Between Trainee Doctors and Simulated Patients in Consultation Videos
(2024)
Presentation / Conference Contribution
Unified Deep Ensemble Architecture for Multiple Classification Tasks
(2024)
Presentation / Conference Contribution
Visual Attention Assisted Games
(2023)
Presentation / Conference Contribution
Downloadable Citations
About Keele Repository
Administrator e-mail: research.openaccess@keele.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search