Image Recognition Using Text and Audio Translation for the Visually Challenged

et al. Rishita Khurana

doi:10.17762/ijritcc.v11i10.8904

PDF

Published: Nov 7, 2023

DOI: https://doi.org/10.17762/ijritcc.v11i10.8904

Keywords:

image recognition; audio translation; blind assistive system; attention-based; artificial intelligence; neural networks; API

Rishita Khurana, Preeti Manani, Nripendra Narayan Das, Manika, Madhulika, Ashish Grover, Richa Adlakha

Abstract

WHO has expressed that out of the general populace on the planet there are 253 million individuals are outwardly impeded around the world. It comes to the standpoint that visually impaired individuals are finding burdensome to curve out their ordinary life. It is vital for take significant measure with the current innovations so they can experience the ongoing scene with next to no troubles. To lift the visually impaired people in the public, this project has been proposed, which can identify images and translates the description of image into text and then produce the audio. This can assist the individual with perusing any text and recognize the image and get the result in vocal structure. Motivated by late work in machine interpretation also, object recognition, a CNN-RNN based attention model is presented in this project. Through the proposed framework, an image is converted into text description first; then, utilizing a basic text-to-speech API, the extracted caption/subtitle is converted into speech which further assists the visually impaired to understand the image or visuals they are looking at. So, the focal part is centered on building the subtitle/text model while the subsequent part, which is changing the text-to-speech, is moderately simple with the text-to-speech API. When the model is fabricated, it is deployed on the local framework utilizing a Flask-based model to produce audio-based caption for any image fed to the model.

How to Cite

Rishita Khurana, et al. (2023). Image Recognition Using Text and Audio Translation for the Visually Challenged. International Journal on Recent and Innovation Trends in Computing and Communication, 11(10), 2164–2181. https://doi.org/10.17762/ijritcc.v11i10.8904

Issue

Vol. 11 No. 10 (2023)

Section

Articles

Author Biography

Rishita Khurana, Preeti Manani, Nripendra Narayan Das, Manika, Madhulika, Ashish Grover, Richa Adlakha

Rishita Khurana¹, Preeti Manani², Nripendra Narayan Das³, Manika⁴, Madhulika⁵, Ashish Grover⁶, Richa Adlakha⁷

¹Department of Computer Science and Engineering , Amity University, Noida,India

rishitaakhurana14@gmail.com

²Faculty of Education ,Dayalbagh Educational institute, (deemed to be university), Agra

preetimanani.1708@gmail.com

Department of Information Technology

³Corresponding Author, Department of Information Technology , Manipal University Jaipur, Rajasthan, India

nripendradas@gmail.com

⁴Department of Computer Science and Engineering , Amity University, Noida,India

manikachoudhary58@gmail.com

⁵Department of Computer Science and Engineering , Amity University, Noida,India

drmadhulikabhatia@gmail.com

⁶Department of Electrical and Electronics Engineering, MRIIRS,Faridabad

Ashi.21s@gmail.com

⁷Department of Electrical and Electronics Engineering, MRIIRS,Faridabad

Richaadlakaha.fet@mriu.edu.in

Citation Indices	All	Since 2018
Citation	5854	3996
h-index	28	23
i10-index	119	72

Year	Rate
2019	12.6%
2018	18.3%
2017	16.9%
2016	18.8%
2015	22.9%
2014	28.9%
2013	26.1%

Image Recognition Using Text and Audio Translation for the Visually Challenged

Abstract

Rishita Khurana, Preeti Manani, Nripendra Narayan Das, Manika, Madhulika, Ashish Grover, Richa Adlakha

Contact Us:

Auricle Global Society of Education and Research

Y-18-A, Near Sanskar Play School, Sudarshana Nagar,

Bikaner, Rajasthan (India). Pin 334003

: editor@ijritcc.org

Quick Links:

Article Sidebar

Main Article Content

Abstract

Article Details

Rishita Khurana, Preeti Manani, Nripendra Narayan Das, Manika, Madhulika, Ashish Grover, Richa Adlakha

Contact Us:

Auricle Global Society of Education and Research

Y-18-A, Near Sanskar Play School, Sudarshana Nagar,

Bikaner, Rajasthan (India). Pin 334003

: editor@ijritcc.org

Quick Links: