Skip to main content

Research Repository

Advanced Search

All Outputs (47)

Visual Attention Assisted Games (2023)
Conference Proceeding
Mandal, B., Puhan, N. B., & Homi Anil, V. (2023). Visual Attention Assisted Games. In 2023 IEEE Conference on Games (CoG). https://doi.org/10.1109/cog57401.2023.10333186

In this work, we propose a committee of attention models developed for improving the deep reinforcement learning frequently used for games. The game environment is manifested with spatial and temporal attention mechanisms so as to focus on important... Read More about Visual Attention Assisted Games.

Optimization and Performance Evaluation of Hybrid Deep Learning Models for Traffic Flow Prediction (2023)
Conference Proceeding
Goparaju, S. U., Biju, R., M, P., MC, B., Gangadharan, D., Mandal, B., & C, P. (2023). Optimization and Performance Evaluation of Hybrid Deep Learning Models for Traffic Flow Prediction. . https://doi.org/10.1109/vtc2023-spring57618.2023.10200600

Traffic flow prediction has been regarded as a critical problem in intelligent transportation systems. An accurate prediction can help mitigate congestion and other societal problems while facilitating safer, cost and time-efficient travel. However,... Read More about Optimization and Performance Evaluation of Hybrid Deep Learning Models for Traffic Flow Prediction.

Deep Neural Network Based Attention Model for Structural Component Recognition (2023)
Conference Proceeding
Sarangi, S., & Mandal, B. (2023). Deep Neural Network Based Attention Model for Structural Component Recognition. . https://doi.org/10.5220/0011688400003417

The recognition of structural components from images/videos is a highly complex task because of the appearance of huge components and their extended existence alongside, which are relatively small components. The latter is frequently overestimated or... Read More about Deep Neural Network Based Attention Model for Structural Component Recognition.

Kernelized dynamic convolution routing in spatial and channel interaction for attentive concrete defect recognition (2022)
Journal Article
Mandal, B. (2022). Kernelized dynamic convolution routing in spatial and channel interaction for attentive concrete defect recognition. Signal Processing: Image Communication, 116818 - 116818. https://doi.org/10.1016/j.image.2022.116818

Image/video based defect recognition is a crucial task in automating visual inspection of concrete structures. Although some progress has been made to automatically recognize defects in concrete structural images, significant challenges still exist.... Read More about Kernelized dynamic convolution routing in spatial and channel interaction for attentive concrete defect recognition.

MacularNet: Towards Fully Automated Attention-Based Deep CNN for Macular Disease Classification (2022)
Journal Article
Mandal, B. (2022). MacularNet: Towards Fully Automated Attention-Based Deep CNN for Macular Disease Classification. https://doi.org/10.1007/s42979-022-01024-0

AbstractIn this work, we propose an attention-based deep convolutional neural network (CNN) model as an assistive computer-aided tool to classify common types of macular diseases: age-related macular degeneration, diabetic macular edema, diabetic ret... Read More about MacularNet: Towards Fully Automated Attention-Based Deep CNN for Macular Disease Classification.

StructureNet: Deep Context Attention Learning for Structural Component Recognition (2022)
Conference Proceeding
Kaothalkar, A., Mandal, B., & Puhan, N. (2022). StructureNet: Deep Context Attention Learning for Structural Component Recognition. . https://doi.org/10.5220/0010872800003124

Structural component recognition using images is a very challenging task due to the appearance of large components and their long continuation, existing jointly with very small components, the latter are often outcasted/missed by the existing methodo... Read More about StructureNet: Deep Context Attention Learning for Structural Component Recognition.

Perturbed Composite Attention Model for Macular Optical Coherence Tomography Image Classification (2021)
Journal Article
Mishra, S. S., Mandal, B., & Puhan, N. B. (2022). Perturbed Composite Attention Model for Macular Optical Coherence Tomography Image Classification. IEEE Transactions on Artificial Intelligence, 3(4), 625-635. https://doi.org/10.1109/tai.2021.3135797

In this article, we propose a deep architecture stemming from a perturbed composite attention mechanism with the following two novel attention modules: Multilevel perturbed spatial attention (MPSA) and multidimension attention (MDA) for macular optic... Read More about Perturbed Composite Attention Model for Macular Optical Coherence Tomography Image Classification.

Interleaved Deep Artifacts-Aware Attention Mechanism for Concrete Structural Defect Classification. (2021)
Journal Article
Mandal, B. (2021). Interleaved Deep Artifacts-Aware Attention Mechanism for Concrete Structural Defect Classification. IEEE Transactions on Image Processing, 6957 - 6969. https://doi.org/10.1109/TIP.2021.3100556

Automatic machine classification of concrete structural defects in images poses significant challenges because of multitude of problems arising from the surface texture, such as presence of stains, holes, colors, poster remains, graffiti, marking and... Read More about Interleaved Deep Artifacts-Aware Attention Mechanism for Concrete Structural Defect Classification..

Multi-level Dual-attention Based CNN for Macular Optical Coherence Tomography Classification (2019)
Journal Article
Mandal, B. (2019). Multi-level Dual-attention Based CNN for Macular Optical Coherence Tomography Classification. IEEE Signal Processing Letters, 1793-1797. https://doi.org/10.1109/LSP.2019.2949388

In this letter, we propose a multi-level dual-attention model to classify two common macular diseases, age-related macular degeneration (AMD) and diabetic macular edema (DME) from normal macular eye conditions using optical coherence tomography (OCT)... Read More about Multi-level Dual-attention Based CNN for Macular Optical Coherence Tomography Classification.

Improved Lifelog Ego-centric Video Summarization Using Ensemble of Deep Learned Object Features (2019)
Presentation / Conference
Mandal, B., & Mainwaring, P. (2019, September). Improved Lifelog Ego-centric Video Summarization Using Ensemble of Deep Learned Object Features. Presented at 30th British Machine Vision Conference, Cardiff

The ImageCLEF 2017 lifelog summarization challenge [10, 12] was established to develop a benchmark for summarizing egocentric lifelogging videos based on our daily activities, such as ‘commute to work’ or ‘cooking at home’. In this paper, we propose... Read More about Improved Lifelog Ego-centric Video Summarization Using Ensemble of Deep Learned Object Features.

Enhanced Deep Video Summarization Network (2019)
Presentation / Conference
Gonuguntla, N., Mandal, B., & Puhan, N. (2019, September). Enhanced Deep Video Summarization Network. Paper presented at 30th British Machine Vision Conference, Cardiff

Video summarization is understanding video which aims to get an abstract view of the original video sequence by the concatenation of keyframes representing the highlights of the video. In this work, we propose an enhanced deep summarization network (... Read More about Enhanced Deep Video Summarization Network.

Cross-spectral Periocular Recognition: a Survey (2019)
Conference Proceeding
Behera, S., Mandal, B., & Puhan, N. (2019). Cross-spectral Periocular Recognition: a Survey. In Emerging Research in Electronics, Computer Science and Technology (731–741). https://doi.org/10.1007/978-981-13-5802-9_64

Among many biometrics such as face, iris, fingerprint and others, periocular region has the advantages over other biometrics because it is non-intrusive and serves as a balance between iris or eye region (very stringent, small area) and the whole fac... Read More about Cross-spectral Periocular Recognition: a Survey.

Deep Convolutional Generative Adversarial Network-Based Food Recognition Using Partially Labeled Data (2019)
Journal Article
Mandal, B., Puhan, N. B., & Verma, A. (2019). Deep Convolutional Generative Adversarial Network-Based Food Recognition Using Partially Labeled Data. IEEE Sensors Letters, 3(2), Article ARTN 7000104. https://doi.org/10.1109/LSENS.2018.2886427

Traditional machine learning algorithms using hand-crafted feature extraction techniques (such as local binary pattern) have limited accuracy because of high variation in images of the same class (or intraclass variation) for food recognition tasks.... Read More about Deep Convolutional Generative Adversarial Network-Based Food Recognition Using Partially Labeled Data.

DeepPCA Based Objective Function for Melanoma Detection (2018)
Conference Proceeding
Sultana, N. N., Puhan, N. B., & Mandal, B. (2018). DeepPCA Based Objective Function for Melanoma Detection. In 2018 International Conference on Information Technology (ICIT). https://doi.org/10.1109/icit.2018.00025

In this paper, we propose an objective function for the convolutional neural network to acquire the variation separability as opposed to the categorical cross entropy which maximizes according to the target labels. This approach is an unsupervised le... Read More about DeepPCA Based Objective Function for Melanoma Detection.

Deep Residual Network with Regularized Fisher Framework for Detection of Melanoma (2018)
Journal Article
Mandal, B. (2018). Deep Residual Network with Regularized Fisher Framework for Detection of Melanoma. IET Computer Vision, 1096-1104. https://doi.org/10.1049/iet-cvi.2018.5238

Of all the skin cancer that is prevalent, melanoma has the highest mortality rates. Melanoma becomes life threatening when it penetrates deep into the dermis layer unless detected at an early stage, it becomes fatal since it has a tendency to migrate... Read More about Deep Residual Network with Regularized Fisher Framework for Detection of Melanoma.