Block or Report
Block or report FangGet
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Repository for Meta Chameleon, a mixed-modal early-fusion foundation model from FAIR.
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
A generative speech model for daily dialogue.
Schedule-Free Optimization in PyTorch
An open-source toolbox for fast sampling of diffusion models. Official implementations for our [CVPR-2024, ICML-2024] papers
A collection of resources on controllable generation with text-to-image diffusion models.
A Generalizable World Model for Autonomous Driving
OpenBA-V2: 3B LLM (Large Language Model) with T5 architecture, utilizing model pruning technique and continuing pretraining from OpenBA-15B.
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
MuLan: Adapting Multilingual Diffusion Models for 110+ Languages (无需额外训练为任意扩散模型支持多语言能力)
Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'
Lumina-T2X is a unified framework for Text to Any Modality Generation
Gaussian Opacity Fields: Efficient and Compact Surface Reconstruction in Unbounded Scenes
[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …
Awesome Pretrained Chinese NLP Models,高质量中文预训练模型&大模型&多模态模型&大语言模型集合
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
Its an open source LLM based on MOE Structure.
Mixture-of-Experts for Large Vision-Language Models
[arXiv] The official code for "UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation".