Integration of MFCC Extraction and LSTM Algorithm on PYNQ-Z2 for Enhanced Audio Analysis

et al. Sheetal U. Bhandari

doi:10.17762/ijritcc.v11i10.8659

PDF

Published: Nov 2, 2023

DOI: https://doi.org/10.17762/ijritcc.v11i10.8659

Keywords:

Speech, Emotion, Feature Extraction, Deep Learning, MFCC, LSTM

Sheetal U. Bhandari, Deepti Khurge, Rajani PK, Varsha Bendre, Ashwini S. Shinde

Abstract

The need for Speech Emotion Recognition (SER) is growing since researchers have found it difficult to interpret human emotions from speech data. SER is very interesting yet very challenging task of human-computer interaction (HCI). The SER application can be benefitted depending on the type of feature extraction technique and model used for classification. Deep Learning has made a great impact in the field of audio, image, video, EEG and ECG classification. The speech signal characteristics and classification model affect how well the SER application performs. The paper briefs about deploying Deep Learning Algorithm on FPGA based board i.e., PYNQ-Z2. MFCC feature extraction technique and LSTM model used for classification of human emotion is implemented on the board. Emotion can be predicted using led buttons on the board.

How to Cite

Sheetal U. Bhandari, et al. (2023). Integration of MFCC Extraction and LSTM Algorithm on PYNQ-Z2 for Enhanced Audio Analysis. International Journal on Recent and Innovation Trends in Computing and Communication, 11(10), 1177–1185. https://doi.org/10.17762/ijritcc.v11i10.8659

Issue

Vol. 11 No. 10 (2023)

Section

Articles

Author Biography

Sheetal U. Bhandari, Deepti Khurge, Rajani PK, Varsha Bendre, Ashwini S. Shinde

Sheetal U. Bhandari¹, Deepti Khurge¹, Rajani PK¹, Varsha Bendre¹, Ashwini S. Shinde¹

¹Department of Electronics and Telecommunication Engineering, Pimpri Chinchwad College of Engineering, Pune, India.

sheetal.bhandari@pccoepune.org, dipti.khurge@pccoepune.org, rajani.pk@pccoepune.org, varsha.bendre@pccoepune.org, ashwinik09@gmail.com

Citation Indices	All	Since 2018
Citation	5854	3996
h-index	28	23
i10-index	119	72

Year	Rate
2019	12.6%
2018	18.3%
2017	16.9%
2016	18.8%
2015	22.9%
2014	28.9%
2013	26.1%

Integration of MFCC Extraction and LSTM Algorithm on PYNQ-Z2 for Enhanced Audio Analysis

Abstract

Sheetal U. Bhandari, Deepti Khurge, Rajani PK, Varsha Bendre, Ashwini S. Shinde

Contact Us:

Auricle Global Society of Education and Research

Y-18-A, Near Sanskar Play School, Sudarshana Nagar,

Bikaner, Rajasthan (India). Pin 334003

: editor@ijritcc.org

Quick Links:

Article Sidebar

Main Article Content

Abstract

Article Details

Sheetal U. Bhandari, Deepti Khurge, Rajani PK, Varsha Bendre, Ashwini S. Shinde

Contact Us:

Auricle Global Society of Education and Research

Y-18-A, Near Sanskar Play School, Sudarshana Nagar,

Bikaner, Rajasthan (India). Pin 334003

: editor@ijritcc.org

Quick Links: