Enhanced Deep Video Summarization Network

Gonuguntla, N; Mandal, B; Puhan, NB

Interpretative Attention Networks for Structural Component Recognition (2024)
Presentation / Conference Contribution
Uniyal, A., Mandal, B., Puhan, N. B., & Bera, P. Interpretative Attention Networks for Structural Component Recognition. Presented at 27th International Conference on Pattern Recognition, Kolkata, India

Bridges are essential for enabling movement during environmental disasters and serve as crucial links for rescue and aid delivery. Effective bridge inspection and maintenance are more critical than ever due to increasing severity and frequency of env... Read More about Interpretative Attention Networks for Structural Component Recognition.

Grid LSTM based Attention Modelling for Traffic Flow Prediction (2024)
Presentation / Conference Contribution
Biju, R., Goparaju, S. U., Gangadharan, D., & Mandal, B. (2024, June). Grid LSTM based Attention Modelling for Traffic Flow Prediction. Presented at 2024 IEEE 99th Vehicular Technology Conference (VTC2024-Spring), Singapore

Traffic flow prediction is an important task that can directly impact the control of traffic flow positively and improve the overall traffic throughput. Although a large number of studies have been performed to improve traffic flow prediction, there... Read More about Grid LSTM based Attention Modelling for Traffic Flow Prediction.

Towards Quantification of Eye Contacts Between Trainee Doctors and Simulated Patients in Consultation Videos (2024)
Presentation / Conference Contribution
Deshmukh, Y., Mandal, B., Yeates, P., & Watson, J. (2024, September). Towards Quantification of Eye Contacts Between Trainee Doctors and Simulated Patients in Consultation Videos. Presented at First International Conference, AIiH 2024, Swansea, UK

Unified Deep Ensemble Architecture for Multiple Classification Tasks (2024)
Presentation / Conference Contribution
Mistry, K. A. J., & Mandal, B. (2024, August). Unified Deep Ensemble Architecture for Multiple Classification Tasks. Presented at 2024 Intelligent Systems Conference (IntelliSys), Amsterdam, The Netherlands

Banks face regular challenges in making decisions for ever increasing need for bank loans. Most banks use applicant’s financial situations, their past history, affordability checks, credit score and risk assessment, which are time consuming, challeng... Read More about Unified Deep Ensemble Architecture for Multiple Classification Tasks.

Visual Attention Assisted Games (2023)
Presentation / Conference Contribution
Mandal, B., Puhan, N. B., & Homi Anil, V. (2023, August). Visual Attention Assisted Games. Presented at IEEE Symposium on Computational Intelligence and Games, CIG, Boston, MA, USA

In this work, we propose a committee of attention models developed for improving the deep reinforcement learning frequently used for games. The game environment is manifested with spatial and temporal attention mechanisms so as to focus on important... Read More about Visual Attention Assisted Games.

Optimization and Performance Evaluation of Hybrid Deep Learning Models for Traffic Flow Prediction (2023)
Presentation / Conference Contribution
Goparaju, S. U., Biju, R., M, P., MC, B., Gangadharan, D., Mandal, B., & C, P. (2023, June). Optimization and Performance Evaluation of Hybrid Deep Learning Models for Traffic Flow Prediction. Presented at 2023 IEEE 97th Vehicular Technology Conference (VTC2023-Spring), Florence, Italy

Traffic flow prediction has been regarded as a critical problem in intelligent transportation systems. An accurate prediction can help mitigate congestion and other societal problems while facilitating safer, cost and time-efficient travel. However,... Read More about Optimization and Performance Evaluation of Hybrid Deep Learning Models for Traffic Flow Prediction.

Deep Neural Network Based Attention Model for Structural Component Recognition (2023)
Presentation / Conference Contribution
Sarangi, S., & Mandal, B. (2023, February). Deep Neural Network Based Attention Model for Structural Component Recognition. Presented at 18th International Joint Conference on Computer Vision, Imaging and Computer Graphics Theory and Applications VISIGRAPP, Lisbon, Portugal

The recognition of structural components from images/videos is a highly complex task because of the appearance of huge components and their extended existence alongside, which are relatively small components. The latter is frequently overestimated or... Read More about Deep Neural Network Based Attention Model for Structural Component Recognition.

Proceedings of the 35th International BCS Human Computer Interaction Conference (HCI 2022) - Index (2022)
Presentation / Conference Contribution
de Quincey, E., Woolley, S. I., Ortolani, M., Misirli, G., Mandal, B., Kanwal, N., Mitchell, J., & Rooney, J. (2022, July). Proceedings of the 35th International BCS Human Computer Interaction Conference (HCI 2022) - Index. Presented at 35th International BCS Human-Computer Interaction Conference (HCI2022), Keele, Staffordshire, England, UK

Kernelized dynamic convolution routing in spatial and channel interaction for attentive concrete defect recognition (2022)
Journal Article
Mandal, B. (2022). Kernelized dynamic convolution routing in spatial and channel interaction for attentive concrete defect recognition. Signal Processing: Image Communication, 116818 - 116818. https://doi.org/10.1016/j.image.2022.116818

Image/video based defect recognition is a crucial task in automating visual inspection of concrete structures. Although some progress has been made to automatically recognize defects in concrete structural images, significant challenges still exist.... Read More about Kernelized dynamic convolution routing in spatial and channel interaction for attentive concrete defect recognition.

MacularNet: Towards Fully Automated Attention-Based Deep CNN for Macular Disease Classification (2022)
Journal Article
Mandal, B. (2022). MacularNet: Towards Fully Automated Attention-Based Deep CNN for Macular Disease Classification. https://doi.org/10.1007/s42979-022-01024-0

<jats:title>Abstract</jats:title><jats:p>In this work, we propose an attention-based deep convolutional neural network (CNN) model as an assistive computer-aided tool to classify common types of macular diseases: age-related macular degeneration, dia... Read More about MacularNet: Towards Fully Automated Attention-Based Deep CNN for Macular Disease Classification.

StructureNet: Deep Context Attention Learning for Structural Component Recognition (2022)
Presentation / Conference Contribution
Kaothalkar, A., Mandal, B., & Puhan, N. (2022, February). StructureNet: Deep Context Attention Learning for Structural Component Recognition. Presented at 17th International Conference on Computer Vision Theory and Applications, Virtual

Structural component recognition using images is a very challenging task due to the appearance of large components and their long continuation, existing jointly with very small components, the latter are often outcasted/missed by the existing methodo... Read More about StructureNet: Deep Context Attention Learning for Structural Component Recognition.

Perturbed Composite Attention Model for Macular Optical Coherence Tomography Image Classification (2021)
Journal Article
Mishra, S. S., Mandal, B., & Puhan, N. B. (2021). Perturbed Composite Attention Model for Macular Optical Coherence Tomography Image Classification. IEEE Transactions on Artificial Intelligence, 3(4), 625-635. https://doi.org/10.1109/tai.2021.3135797

In this article, we propose a deep architecture stemming from a perturbed composite attention mechanism with the following two novel attention modules: Multilevel perturbed spatial attention (MPSA) and multidimension attention (MDA) for macular optic... Read More about Perturbed Composite Attention Model for Macular Optical Coherence Tomography Image Classification.

Stand-Alone Composite Attention Network for Concrete Structural Defect Classification (2021)
Journal Article
Bhattacharya, G., Puhan, N. B., & Mandal, B. (2021). Stand-Alone Composite Attention Network for Concrete Structural Defect Classification. IEEE Transactions on Artificial Intelligence, 3(2), 265-274. https://doi.org/10.1109/tai.2021.3114385

Automation in structural health monitoring involves a critical step of automatic classification of concrete defect images/videos. Although interdisciplinary research community in AI has responded with some progress, immense challenges are still invol... Read More about Stand-Alone Composite Attention Network for Concrete Structural Defect Classification.

Interleaved Deep Artifacts-Aware Attention Mechanism for Concrete Structural Defect Classification. (2021)
Journal Article
Mandal, B. (2021). Interleaved Deep Artifacts-Aware Attention Mechanism for Concrete Structural Defect Classification. IEEE Transactions on Image Processing, 6957 - 6969. https://doi.org/10.1109/TIP.2021.3100556

Automatic machine classification of concrete structural defects in images poses significant challenges because of multitude of problems arising from the surface texture, such as presence of stains, holes, colors, poster remains, graffiti, marking and... Read More about Interleaved Deep Artifacts-Aware Attention Mechanism for Concrete Structural Defect Classification..

Deep Regularized Discriminative Network (2021)
Journal Article
Mandal, B. (2021). Deep Regularized Discriminative Network. https://doi.org/10.1007/s42979-021-00647-z

Traditional linear discriminant analysis (LDA) approach discards the eigenvalues which are very small or equivalent to zero, but quite often eigenvectors corresponding to zero eigenvalues are the important dimensions for discriminant analysis. We pro... Read More about Deep Regularized Discriminative Network.

GlaucoNet: Patch-Based Residual Deep Learning Network for Optic Disc and Cup Segmentation Towards Glaucoma Assessment (2021)
Journal Article
Mandal, B. (2021). GlaucoNet: Patch-Based Residual Deep Learning Network for Optic Disc and Cup Segmentation Towards Glaucoma Assessment. https://doi.org/10.1007/s42979-021-00491-1

Glaucoma is a chronic eye condition causing irreversible vision damage and presently stands as the second leading cause of blindness worldwide. Damaged optic disc and optic cup assessment in color fundus image has been shown to be a promising method... Read More about GlaucoNet: Patch-Based Residual Deep Learning Network for Optic Disc and Cup Segmentation Towards Glaucoma Assessment.

Twin Deep Convolutional Neural Network-based Cross-spectral Periocular Recognition (2020)
Presentation / Conference Contribution
Behera, S. S., Mandal, B., & Puhan, N. B. (2020, February). Twin Deep Convolutional Neural Network-based Cross-spectral Periocular Recognition. Presented at 2020 National Conference on Communications (NCC), Kharagpur, India

Multi-level Dual-attention Based CNN for Macular Optical Coherence Tomography Classification (2019)
Journal Article
Mandal, B. (2019). Multi-level Dual-attention Based CNN for Macular Optical Coherence Tomography Classification. IEEE Signal Processing Letters, 1793-1797. https://doi.org/10.1109/LSP.2019.2949388

In this letter, we propose a multi-level dual-attention model to classify two common macular diseases, age-related macular degeneration (AMD) and diabetic macular edema (DME) from normal macular eye conditions using optical coherence tomography (OCT)... Read More about Multi-level Dual-attention Based CNN for Macular Optical Coherence Tomography Classification.

Improved Lifelog Ego-centric Video Summarization Using Ensemble of Deep Learned Object Features (2019)
Presentation / Conference
Mandal, B., & Mainwaring, P. (2019, September). Improved Lifelog Ego-centric Video Summarization Using Ensemble of Deep Learned Object Features. Presented at 30th British Machine Vision Conference, Cardiff

The ImageCLEF 2017 lifelog summarization challenge [10, 12] was established to develop a benchmark for summarizing egocentric lifelogging videos based on our daily activities, such as ‘commute to work’ or ‘cooking at home’. In this paper, we propose... Read More about Improved Lifelog Ego-centric Video Summarization Using Ensemble of Deep Learned Object Features.

Enhanced Deep Video Summarization Network (2019)
Presentation / Conference Contribution
Gonuguntla, N., Mandal, B., & Puhan, N. (2019, September). Enhanced Deep Video Summarization Network. Paper presented at 30th British Machine Vision Conference, Cardiff

Video summarization is understanding video which aims to get an abstract view of the original video sequence by the concatenation of keyframes representing the highlights of the video. In this work, we propose an enhanced deep summarization network (... Read More about Enhanced Deep Video Summarization Network.

All Outputs (51)