Skip to content
View WarmCongee's full-sized avatar
🍦
🍦

Organizations

@V5Hub @NPU-Java-Web

Block or report WarmCongee

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

Official implementation of paper "Knowledge Distillation from A Stronger Teacher", NeurIPS 2022

Python 132 18 Updated Dec 28, 2022

第一个支持中英文双语语音-文本多模态对话的开源可商用对话模型。便捷的语音输入将大幅改善以文本为输入的大模型的使用体验,同时避免了基于 ASR 解决方案的繁琐流程以及可能引入的错误。

Python 505 53 Updated Sep 11, 2023

Speech, Language, Audio, Music Processing with Large Language Model

Python 448 35 Updated Aug 20, 2024

PyTorch implementation of "Supervised Contrastive Learning" (and SimCLR incidentally)

Python 3,032 524 Updated Dec 26, 2023

Papers and codes collection for customized, personalized and editable generative models

22 Updated Aug 20, 2024

🔨AI 方向好用的科研工具

2,237 338 Updated Jun 10, 2024

Mamba SSM architecture

Python 12,249 1,029 Updated Aug 15, 2024

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 557 40 Updated Aug 21, 2024

Deformable Speech Transformer (DST)

Python 25 2 Updated Aug 8, 2024

A collection of datasets for the purpose of emotion recognition/detection in speech.

HTML 276 38 Updated Jun 23, 2024

[Information Fusion 2024] HiCMAE: Hierarchical Contrastive Masked Autoencoder for Self-Supervised Audio-Visual Emotion Recognition

Python 82 7 Updated Aug 16, 2024

Reading list for research topics in multimodal machine learning

5,803 840 Updated Aug 20, 2024

Open-source KVM software

C 27,035 1,492 Updated Jun 22, 2024
Jupyter Notebook 4 Updated Aug 14, 2024

vits2 backbone with multilingual-bert

Python 7,692 1,092 Updated Aug 19, 2024

Make Acoustic and Visual Cues Matter: CH-SIMS v2.0 Dataset and AV-Mixup Consistent Module

Python 51 8 Updated Sep 8, 2022

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 30,051 6,351 Updated Jul 26, 2024

Pytorch implementation for codes in Noise Imitation Based Adversarial Training for Robust Multimodal Sentiment Analysis (Accepted by IEEE Transactions on Multimedia).

Python 8 2 Updated Feb 2, 2024
Python 5 2 Updated Feb 8, 2022

✨✨Latest Advances on Multimodal Large Language Models

11,316 739 Updated Aug 22, 2024

Code for the InterSpeech 2023 paper: MMER: Multimodal Multi-task learning for Speech Emotion Recognition

Python 59 14 Updated Mar 12, 2024

AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in the SUPERB Benchmark. Interspeech 2023

Python 10 Updated Feb 23, 2024

2023年推免工作由线上转线下,夏令营数量众多易冲突,本仓库用于高效观察时间重合问题,方便制定策略

Python 37 6 Updated Jun 29, 2023

发布23年计算机保研夏令营和预推免通知,往年的保研经验帖;需要带保研或计算机保研资料联系qq:1585601434

311 5 Updated Mar 11, 2024

Zero-shot multimodal punctuation insertion and truecasing using Whisper

Python 95 5 Updated Feb 4, 2023

A Unified Library for Parameter-Efficient and Modular Transfer Learning

Jupyter Notebook 2,506 336 Updated Aug 22, 2024

注意力机制实践

Jupyter Notebook 378 47 Updated Jul 11, 2022

Toolkits for Multimodal Emotion Recognition

Python 144 12 Updated May 26, 2024

The code repository for NAACL 2021 paper "Multimodal End-to-End Sparse Model for Emotion Recognition".

Python 92 14 Updated Feb 9, 2023
HTML 30 Updated Aug 22, 2024
Next