Starred repositories
A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
Official PyTorch implementation for "Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels"
[ICLR 2024] SemiReward: A General Reward Model for Semi-supervised Learning
😎 An up-to-date & curated list of awesome semi-supervised learning papers, methods & resources.
Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents
This is the official code repository for "MedMamba: Vision Mamba for Medical Image Classification"
The official implementation of ECCV2024 paper "Facial Affective Behavior Analysis with Instruction Tuning"
A feature-rich command-line audio/video downloader
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Official PyTorch code for training and inference pipeline for DepMamba: Progressive Fusion Mamba for Multimodal Depression Detection
ViLa-MIL: Dual-scale Vision-Language Multiple Instance Learning for Whole Slide Image Classification (CVPR 2024)
State-of-the-art 2D and 3D Face Analysis Project
Toolkits for Multimodal Emotion Recognition
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
dimensional emotion classifier based bio-signal(ECG)
Two-stage Temporal Modelling Framework for Video-based Depression Recognition using Graph Representation
cube studio开源云原生一站式机器学习/深度学习/大模型AI平台,支持sso登录,多租户,大数据平台对接,notebook在线开发,拖拉拽任务流pipeline编排,多机多卡分布式训练,超参搜索,推理服务VGPU,边缘计算,serverless,标注平台,自动化标注,数据集管理,大模型微调,vllm大模型推理,llmops,私有知识库,AI模型应用商店,支持模型一键开发/推理/微调,…
Command-line program to download videos from YouTube.com and other video sites
Official source code for the paper: "Reading Between the Frames Multi-Modal Non-Verbal Depression Detection in Videos"
✨✨Latest Advances on Multimodal Large Language Models
RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference,…
[WACV 2024] LibreFace: An Open-Source Toolkit for Deep Facial Expression Analysis
心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1
[CVPR 2024] Official implementation of "ViTamin: Designing Scalable Vision Models in the Vision-language Era"
A high-throughput and memory-efficient inference and serving engine for LLMs