PEMO: A New Validated Dataset for Punjabi Speech Emotion Detection

Chaitanya Singla
Sukhdev Singh


This research work presents a new valid dataset for Punjabi called the Punjabi Emotional Speech Database (PEMO) which has been developed to assess the ability to recognize emotions in speech by both computers and humans. The PEMO includes speech samples from about 60 speakers with an age range between 20 and 45 years, for four fundamental emotions, including anger, sad, happy and neutral. In order to create the data, Punjabi films are retrieved from different multimedia websites such as YouTube. The movies are processed and transformed into utterances with software called PRAAT. The database contains 22,000 natural utterances. This is equivalent to 12 hours and 35 min of speech information taken from online Punjabi movies and web series. Three annotators categorize the emotional content of the utterances. The common label that is labelled by all annotators becomes the final label for the utterance. The annotators have a thorough knowledge of Punjabi Language. The data is used to determine the expression of emotions in speech in the Punjabi Language.

Chaitanya Singla, and Sukhdev Singh. “PEMO: A New Validated Dataset for Punjabi Speech Emotion Detection”. International Journal on Recent and Innovation Trends in Computing and Communication, vol. 10, no. 10, Oct. 2022, pp. 52-58, doi:10.17762/ijritcc.v10i10.5734.


