Speech Emotion Recognition

This is a university project to build a speech emotion recognition with multiple modalities. In this case we sample UniMSE paper:

Datasets

MOSEI
MOSI
IEMOCAP

The dataset folders are to be found under UniMSE>Simcse>dataset folder

Installation and Environment

Unimse_Submission.ipynb contains a walkthrough of all the installation steps. In the end of the notebook, main.py is tested for three different datasets (MOSEI, MOSI, IEMOCAP)

Process

First, every single dataset (MOSI, MOSEI, IEMOCAP，MELD) is preprocessed by running preprocess.py. For every dataset, a train, test and validation pkl is created and saved to the according folders.
Under UniMSE>src, the paths to the created .pkl files must be set correctly for every dataset. The train dataset and number of epochs can be changed on the config.py.
Please also add t5-base to ./src folder.
Lastly, model is trained by running main.py.

Current results (will be updated soon)

Current Goals

Run on IEMOCAP
Fine Tune on Emodb

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
UniMSE		UniMSE
.DS_Store		.DS_Store
README.md		README.md
Unimse_Submission.ipynb		Unimse_Submission.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Speech Emotion Recognition

Datasets

Installation and Environment

Process

Current results (will be updated soon)

Current Goals

About

Releases

Packages

Contributors 3

Languages

taliapandans/SER_unimse

Folders and files

Latest commit

History

Repository files navigation

Speech Emotion Recognition

Datasets

Installation and Environment

Process

Current results (will be updated soon)

Current Goals

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages