Textualized and Feature-based Models for Compound Multimodal Emotion Recognition in the Wild, ABAW 7th - Challenge - Compound Expression (CE) Recognition Challenge

Python 3 Updated Sep 20, 2024

zehuiwu / VoiceERC

Python 4 1 Updated Aug 11, 2024

Dreamyao516 / DialogueLLM

Python 4 Updated Jan 18, 2024

LIN-SHANG / InstructERC

The offical realization of InstructERC

Python 121 7 Updated Jul 16, 2024

yingjie7 / BiosERC

Python 7 Updated Sep 22, 2024

nicolas-richet / feature-vs-text-compound-emotion

Python 3 2 Updated Aug 30, 2024

roudimit / whisper-flamingo

[Interspeech 2024] Whisper-Flamingo: Integrating Visual Features into Whisper for Audio-Visual Speech Recognition and Translation

Jupyter Notebook 70 3 Updated Aug 22, 2024

nickjw0205 / Improving-ASR-with-LLM-Description

Python 10 Updated Sep 2, 2024

modelscope / FunASR

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 6,205 659 Updated Sep 30, 2024

mtkresearch / generative-fusion-decoding

Generative Fusion Decoding (GFD) is a novel framework for integrating Large Language Models (LLMs) into multi-modal text recognition systems like ASR and OCR, improving performance and efficiency b…

Python 63 8 Updated Sep 2, 2024

kniter1 / TAILOR

Pytorch implementation for Tailor Versatile Multi-modal Learning for Multi-label Emotion Recognition

Python 53 13 Updated Nov 16, 2022

hingston / japanese

This repo contains a list of the 44,998 most common Japanese words in order of frequency, as determined by the University of Leeds Corpus.

66 11 Updated Sep 13, 2018

Priberam / Enhance-CB-Whisper

Python 1 Updated Jul 22, 2024

DCDmllm / Momentor

Python 43 1 Updated Jun 27, 2024

kiva12138 / CubeMLP

The implementation of CubeMLP

Python 40 5 Updated May 8, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

JiajunHe1025

Block or report JiajunHe1025

Stars

tango4j / llm_speaker_tagging

kjw11 / CSEnet-ASR

Sreyan88 / LipGER

GeorgeEfstathiadis / LLM-Diarize-ASR-Agnostic

PoloWlg / Joint-Multimodal-Transformer-6th-ABAW

DresvyanskiyDenis / ABAW_2024

Hypotheses-Paradise / Hypo2Trans

rithiksachdev / PostASR-Correction-SLT2024

MooreThreads / MooER

ASolitaryMan / Foal-Net

gmftbyGMFTBY / Copyisallyouneed

zsLin177 / SpeechNER

zsLin177 / CSNER

tzyll / ChineseHP

chuyq / MESC

sbelharbi / feature-vs-text-compound-emotion