End-to-End Speech Emotion Recognition With Gender Information

Project Code :TCMAPY298

Objective

In this paper, we propose a novel emotion recognition algorithm that does not rely on any speech acoustic features and combines speaker gender information. The proposed algorithm combines a Residual Convolutional Neural Network (R-CNN) and a gender information block. Here, we aim to benefit from the rich information from speech raw data, without any artificial intervention.

Abstract

Many works have focused on speech emotion recognition algorithms. However, most rely on the proper selection of speech acoustic features. This study, propose a novel emotion recognition algorithm that does not rely on any speech acoustic features and combines speaker gender information. We aim to benefit from the rich information from speech raw data, without any artificial intervention. In general, speech emotion recognition systems require manual selection of appropriate traditional acoustic features as classifier input for emotion recognition. Utilizing deep learning algorithms, and the network automatically select important information from raw speech signal for the classification layer to accomplish emotion recognition. It can prevent the omission of emotion information that cannot be direct mathematically modeled as a speech acoustic characteristic. We also add speaker gender information to the proposed algorithm to further improve recognition accuracy. The proposed algorithm combines a Residual Convolutional Neural Network (R-CNN) and a gender information block. The raw speech data is sent to these two blocks simultaneously. The R-CNN network obtains the necessary emotional information from the speech data and classifies the emotional category. The proposed algorithm is evaluated on three public databases with different language systems.

Keywords: Affective Computing, Speech Emotion Recognition, Gender Classifier, Deep-Learning, Interpretability Of Deep-Learning.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

SOFTWARE SPECIFICATIONS:

  • Technology: Deep Learning.
  • Libraries: Pandas, Numpy, TensorFlow.
  • Version: Python 3.6+
  • Server-side scripts: HTML, CSS, JS
  • Frame works: Flask
  • IDE: Pycharm

HARDWARE SPECIFICATIONS:

  • RAM: 8GB, 64-bit os.
  • Processor: I3/Intel processor
  • Hard Disk Capacity: 128 GB +

Learning Outcomes

  • Scope of Real Time Application Scenarios
  • What is speech recognition system and emotions system
  • How Internet Works
  • What is a search engine and how browser can work
  • What type of technology versions are used
  • Use of HTML , CSS on UI Designs
  • Data Parsing Front-End to Back-End
  • Working Procedure
  • Introduction to basic technologies used for
  • How project works.
  • Input and Output modules
  • Frame work use
  • Datsets properties
  • Deep learning algorithms.
  • Data preprocessing methods
  • What is R-CNN model.
  • Benefits of RCNN over CNN.
  • Project Development Skills:
    • Problem analyzing skills.
    • Problem solving skills.
    • Creativity and imaginary skills.
    • Programming skills.
    • Deployment.
    • Testing skills.
    • Debugging skills.
    • Project presentation skills.
    • Thesis writing skills.

Demo Video

mail-banner
call-banner
contact-banner
Request Video