Automated Video and Audio based Stress Detection using Deep Learning Techniques

Main Article Content

Deepali Godse
Nilofar Mulla
Rohini Jadhav
Milind Gayakwad
Rahul Joshi
Kalyani Kadam
Jayashree Jadhav

Abstract

In today's world, stress has become an undoubtedly severe problem that affects people's health. Stress can modify a person's behavior, ideas, and feelings in addition to having an impact on mental health. Unchecked stress can contribute to chronic illnesses including high blood pressure, diabetes, and obesity. Early stress detection promotes a healthy lifestyle in society. This work demonstrates a deep learning-based method for identifying stress from facial expressions and speech signals.An image dataset formed by collecting images from the web is used to construct and train the model Convolution Neural Network (CNN), which then divides the images into two categories: stressed and normal. Recurrent Neural Network (RNN), which is used to categorize speech signals into stressed and normal categories based on the features extracted by the MFCC (Mel Frequency Cepstral Coefficient), is thought to perform better on sequential data since it maintains the past results to determine the final output.

Article Details

How to Cite
Godse, D. ., Mulla, N. ., Jadhav, R. ., Gayakwad, M. ., Joshi, R. ., Kadam, K. ., & Jadhav, J. . (2023). Automated Video and Audio based Stress Detection using Deep Learning Techniques. International Journal on Recent and Innovation Trends in Computing and Communication, 11(11s), 487–492. https://doi.org/10.17762/ijritcc.v11i11s.8178
Section
Articles

References

G. W. Evans and J. M. McCoy, ‘‘When buildings don’t work: The role of architecture in human health,’’ J. Environ. Psychol., vol. 18, no. 1, pp. 85–94, 1998. [ 10.1006/jevp.1998.0089 ]

A. Liebl, J. Haller, B. Jödicke, H. Baumgartner, S. Schlittmeier, and J. Hellbrück, ‘‘Combined effects of acoustic and visual distraction on cognitive performance and well-being,’’ Appl. Ergonom., vol. 43, no. 2, pp. 424–434, 2012. [ 10.1016/j.apergo.2011.06.017 ]

Sara Aritizabal, Kunjoon Byun and Nadia Wood, “The Feasibility of Wearable and Self-Report Stress Detection Measures in a Semi-Controlled Lab Environment,” IEEE Access, vol. 9, pp. 102053-102068, 2021. [ 10.1109/ACCESS.2021.3097038 ]

SadilChamishka, Ishara Madhavi and RashmikaNawaratne, “A voice-based real-time emotion detection technique using recurrent neural network empowered feature modelling,” Multimedia Tools and Applications, vol. 81, pp. 35173–35194, 2022. [ https://doi.org/10.1007/s11042-022-13363-4 ]

Huijun Zhang and Ling Feng, “Video-Based Stress Detection through Deep Learning,” Sensors, vol. 20, 2020. [ https://doi.org/10.3390/s20195552 ]

Ciabattoni L., Ferracuti F., Longhi S., Pepa L., Romeo L. and Verdini F., “Real-time mental stress detection based on smartwatch,” in Proc. 2017 IEEE International Conference on Consumer Electronics (ICCE), pp. 110–111, 2017. [ 10.1109/ICCE.2017.7889247 ]

Mike Thelwall, “TensiStrength : Stress and relaxation magnitude detection for social media texts,” Elsevier Information Processing and Management, vol. 53, pp. 106-121, 2016. [10.1016/j.ipm.2016.06.009 ]

Russell Li and Zhandong Liu, “Stress Detection using Deep Neural Networks,” in Proc. The International Conference on Intelligent Biology and Medicine (ICIBM), vol. 20, 2020. [ 10.1186/s12911-020-01299-4]

Rikiya Yamashita, Mizuho Nishio, Richard Kinh Gian Do and Kaori Togashi, “Convolutional neural networks: an overview and application in radiology,” Insights into Imaging, vol. 9, pp. 611–629, 2018. https://insightsimaging.springeropen.com/articles/10.1007/s13244-018-0639-9 ]

Nadia Jmour, SehlaZayen and AfefAbdelkrim, “Convolutional neural networks for image classification,” in Proc. International Conference on Advanced Systems and Electric Technologies (IC_ASET), pp. 397-402, 2018. [ 10.1109/ASET.2018.8379889 ]

Alex Graves, Abdel-rahman Mohamed and Geoffrey Hinton, “Speech Recognition with Deep Recurrent Neural Networks,” in Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 6645-6649, 2013. [ 10.1109/ICASSP.2013.6638947 ]

Aditya Amberkar, Parikshit Awasarmol, Gaurav Deshmukh and Piyush Dave, “Speech Recognition using Recurrent Neural Networks,” in Proc. IEEE International Conference on Current Trends toward Converging Technologies, 2018 [ 10.1109/ICCTCT.2018.8551185 ]

Harshawardhan S. Kumbhar and Sheetal U. Bhandari, “Speech Emotion Recognition using MFCC features and LSTM network,” in Proc. 5th International Conference On Computing, Communication, Control And Automation (ICCUBEA), 2020. [ 10.3390/app11219897 ]

Sudharsan.R, Hands-On Deep Learning Algorithms with Python, 1st ed. ,Packt Publishing, 2019. [ https://www.packtpub.com/product/hands-on-deep-learning-algorithms-with-python/9781789344158 ]

Introduction to Supervised Deep Learning Algorithms, 2021. [Online]. Available: https://www.analyticsvidhya.com/blog/2021/05/introduction-to-supervised-deep-learning-algorithms/

People feeling more stressed in India, 2022. [Online]. Available: https://www.statista.com/statistics/1320246/india-people-feeling-more-stressed-by-age/