Review Paper on Enhanced Image Captioning with Deep Learning: Encoder-Decoder and Attention Mechanism

Main Article Content

Vikash Kumar Singh, Ankita Gandhi, Brijesh Vala

Abstract

Image captioning involves the generation of textual descriptions that describe the content within an image. This process finds extensive utility in diverse applications, including the analysis of large, unlabelled image datasets, uncovering concealed patterns to facilitate machine learning applications, guiding self-driving vehicles, and developing software solutions to aid visually impaired individuals. The implementation of image captioning relies heavily on deep learning models, a technological frontier that has simplified the task of generating captions for images. This paper focuses on the utilisation of encoder-decoder model with attention mechanism for image captioning. In classic image captioning model, the words usually describe only a part of the image, however with attention mechanism special attention is given to the low level and high level features of the image. Object detection using attention mechanism has shown to have increased the CIDEr score by 15%. With the use of stable dataset of MSCOCO through keras datasets, it is possible to score more on caption generation and accurate description of image.

Article Details

How to Cite
Vikash Kumar Singh, et al. (2023). Review Paper on Enhanced Image Captioning with Deep Learning: Encoder-Decoder and Attention Mechanism. International Journal on Recent and Innovation Trends in Computing and Communication, 11(9), 733–738. https://doi.org/10.17762/ijritcc.v11i9.8866
Section
Articles
Author Biography

Vikash Kumar Singh, Ankita Gandhi, Brijesh Vala

Vikash Kumar Singh1, Ankita Gandhi2, Brijesh Vala3

1Department of Computer Engineering

Parul University

Vadodara, Gujarat

vik439@gmail.com

2Department of Computer Engineering

Parul University

Vadodara, Gujarat

ankita.gandhi@paruluniversity.ac.in

3Department of Computer Engineering

Parul University

Vadodara, Gujarat

brijesh.vala@paruluniversity.ac.in