Skip to content
View RayeRen's full-sized avatar
🎯
Focusing
🎯
Focusing

Organizations

@msra-alumni @MLNLP-World @NATSpeech

Block or report RayeRen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

High-resolution models for human tasks.

Python 3,260 168 Updated Aug 31, 2024

Text-to-video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 6,480 581 Updated Aug 30, 2024

Official inference repo for FLUX.1 models

Python 12,351 850 Updated Aug 29, 2024

Bring portraits to life!

Python 11,094 1,115 Updated Aug 29, 2024

[ECCV 2024] Official implementation of the paper "X-Pose: Detecting Any Keypoints"

Python 397 17 Updated Aug 16, 2024

Stable Video Diffusion Training Code and Extensions.

Python 533 47 Updated Jul 25, 2024
Python 438 38 Updated Jun 7, 2024

Official repo for "Mini-Gemini: Mining the Potential of Multi-modality Vision Language Models"

Python 3,153 277 Updated May 4, 2024

One-click Face Swapper and Restoration powered by insightface 🔥

Python 479 71 Updated Apr 16, 2024

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

Python 565 40 Updated Aug 28, 2024

The official GitHub page for the survey paper "A Survey of Large Language Models".

Python 9,900 778 Updated Aug 20, 2024

We unified the interfaces of instruction-tuning data (e.g., CoT data), multiple LLMs and parameter-efficient methods (e.g., lora, p-tuning) together for easy use. We welcome open-source enthusiasts…

Jupyter Notebook 2,551 241 Updated Dec 12, 2023

Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance

Python 3,584 423 Updated Jul 10, 2024

Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)

Python 29,842 3,678 Updated Aug 30, 2024

PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation

Python 1,570 76 Updated Aug 5, 2024

✨✨Latest Advances on Multimodal Large Language Models

11,447 744 Updated Aug 30, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,066 187 Updated Aug 31, 2024

Foundational model for human-like, expressive TTS

Python 3,679 644 Updated Jul 30, 2024

GPT-style network for phonemization with durations of text

Jupyter Notebook 61 9 Updated Mar 21, 2024

リアルタイムボイスチェンジャー Realtime Voice Changer

Python 15,838 1,707 Updated Aug 27, 2024

[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation

Python 6,693 514 Updated Jul 17, 2024

Generative models for conditional audio generation

Python 2,469 230 Updated Jul 15, 2024

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 5,932 524 Updated May 31, 2024

Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis; ICLR 2024 Spotlight; Official code

Python 863 97 Updated Jul 4, 2024
Python 248 17 Updated Jun 8, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 31,836 3,668 Updated Aug 28, 2024

Flexible Python configuration system. The last one you will ever need.

Python 1,923 104 Updated May 30, 2024

Zero-shot multimodal punctuation insertion and truecasing using Whisper

Python 95 5 Updated Feb 4, 2023

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audi…

Python 4,419 373 Updated Aug 29, 2024

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,181 170 Updated Aug 13, 2024
Next