Multimodal Emotion recognition

Project Code :TCMAPY2329

Objective

This project aims to develop a multimodal emotion recognition system that integrates audio and video data for accurate emotion classification. By leveraging CNN, Conformer, LSTM, and SVC models, the system will classify emotions while addressing challenges like background noise and lighting variations. The system will be evaluated based on accuracy, precision, recall, and F1-score, ensuring robust performance across emotional categories. A user-friendly web application will be built using Flask to allow users to upload audio and video files for emotion recognition. This scalable system holds potential for applications in healthcare, entertainment, and customer service.

Abstract

Emotion recognition is essential in enhancing human-computer interaction across various applications. This project aims to develop a multimodal emotion recognition system that processes both audio and video data to classify emotions into six categories: Happy, Sad, Angry, Fearful, Disgusted, and Surprised. The dataset used for this project is sourced from Kaggle, containing audio and video recordings of individuals exhibiting different emotional expressions. The system employs three advanced algorithms: Convolutional Neural Networks (CNN) combined with Conformer for video data, Long Short-Term Memory (LSTM) networks for sequential audio data processing, and Support Vector Classifier (SVC) for emotion classification. A Flask-based web application offers an interface that allows users to upload their media for emotion classification. The goal is to achieve high accuracy in emotion detection by leveraging the complementary features of both audio and video. The system is evaluated based on accuracy, precision, recall, and F1-score metrics. This approach highlights the potential of multimodal models in improving emotion recognition capabilities.


Keywords: Emotion Recognition, Multimodal Learning, Audio Classification, Video Classification, CNN, Conformer, LSTM, SVC, Flask, Deep Learning.

NOTE: Without the concern of our team, please don't submit to the college. This Abstract varies based on student requirements.

Block Diagram

Specifications

HARDWARE REQUIREMENTS

β€’       Processor                                 - I5/Intel Processor

β€’       RAM                                       - 8GB (min)

β€’       Hard Disk                                - 160 GB

β€’       Key Board                               - Standard Windows Keyboard

β€’       Mouse                                      - Two or Three Button Mouse

β€’       Monitor                                    - Any

SOFTWARE REQUIREMENS

β€’       Operating System                   :  Windows 7/8/10

β€’       Server side Script                    :  HTML, CSS, & JS

β€’       Programming Language         :  Python

β€’       Libraries                                  :  Flask, Pandas, MySQL. connector, Os, NumPy, tensorflow, keras, Scikit- learn, sklearn, Preprocessor

β€’        IDE/Workbench                    :  VS-Code

β€’       Technology                             :  Python 3.10+,

β€’       Database                                 :  SqlLite

 

 

Demo Video

mail-banner
call-banner
contact-banner
Request Video