Stars
ChatGLM3 series: Open Bilingual Chat LLMs | 开源双语对话语言模型
A comprehensive overview of affective computing research in the era of large language models (LLMs).
心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1
This is the repository of our ACL 2024 paper "ESCoT: Towards Interpretable Emotional Support Dialogue Systems".
Official repo of the paper "MMWorld: Towards Multi-discipline Multi-faceted World Model Evaluation in Videos"
The code for the paper "ECR-Chain: Advancing Generative Language Models to Better Emotion Cause Reasoners through Reasoning Chains" (IJCAI-2024).
🔊 Text-Prompted Generative Audio Model
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
A Python wrapper for the high-quality vocoder "World"
A generative speech model for daily dialogue.
(TPAMI 2024) A Survey on Open Vocabulary Learning
Making large AI models cheaper, faster and more accessible
[CVPR 2024] OneLLM: One Framework to Align All Modalities with Language
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)
Official code for Goldfish model for long video understanding and MiniGPT4-video for short video understanding
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Explainable Multimodal Emotion Reasoning (EMER) and AffectGPT
Toolkits for Multimodal Emotion Recognition
A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.
The official repo of Qwen-Audio (通义千问-Audio) chat & pretrained large audio language model proposed by Alibaba Cloud.
mPLUG-Owl: The Powerful Multi-modal Large Language Model Family
Fast and memory-efficient exact attention
Code for "AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling"
text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。