Stars
SEED-Story: Multimodal Long Story Generation with Large Language Model
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
CCL2022汉语学习者文本纠错评测任务赛道二——CGED-8第一名解决方案
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.
VirtualWife是一个虚拟数字人项目,支持B站直播,支持openai、ollama
👩🏿💻👨🏾💻👩🏼💻👨🏽💻👩🏻💻中国独立开发者项目列表 -- 分享大家都在做什么
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …
The open-source LLM implementation of paper: RecurrentGPT: Interactive Generation of (Arbitrarily) Long Text. AI 写小说,AI写作
Reaching LLaMA2 Performance with 0.1M Dollars
Evaluation tools for Retrieval-augmented Generation (RAG) methods.
StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text
Generative Models by Stability AI
汉字字形/拼音/语义相似度(单字, 可用于数据增强, CSC错别字检测识别任务(构建混淆集)) Chinese character font/pinyin/semantic similarity (single character, can be used for data augmentation, CSC misclassified character detection and rec…
near-synonym, 基于大模型LLM的中文反义词/近义词(antonym/synonym)工具包.
State-of-the-art bilingual open-sourced Math reasoning LLMs.
Qwen2 is the large language model series developed by Qwen team, Alibaba Cloud.