Skip to content
View Cathy0908's full-sized avatar

Block or report Cathy0908

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-Sora: Democratizing Efficient Video Production for All

Python 21,484 2,059 Updated Aug 9, 2024

A one-stop data processing system to make data higher-quality, juicier, and more digestible for (multimodal) LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大模型提供更高质量、更丰富、更易”消化“的数据!

Python 2,446 150 Updated Sep 2, 2024

LlamaIndex is a data framework for your LLM applications

Python 35,126 4,927 Updated Sep 2, 2024

Animate Anyone: Consistent and Controllable Image-to-Video Synthesis for Character Animation

14,239 954 Updated Jul 26, 2024

Video-LLaVA: Learning United Visual Representation by Alignment Before Projection

Python 2,819 202 Updated Jul 27, 2024

Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)

Python 25,278 2,899 Updated Sep 2, 2024

Generative Models by Stability AI

Python 23,950 2,663 Updated Aug 21, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 66,932 7,886 Updated Aug 19, 2024

Consistency Distilled Diff VAE

Python 2,122 74 Updated Nov 7, 2023
Python 3 Updated Oct 18, 2023

An easy-to-use framework for modular RAG

Python 275 38 Updated Sep 2, 2024

[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.

Python 19,040 2,085 Updated Aug 12, 2024

The official PyTorch implementation of the paper "Human Motion Diffusion Model"

Python 3,038 326 Updated Aug 4, 2024

The official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.

Python 626 87 Updated Aug 30, 2024

MagicAvatar: Multimodal Avatar Generation and Animation

616 31 Updated Aug 29, 2023

📷 EasyPhoto | Your Smart AI Photo Generator.

Python 4,887 387 Updated Jul 10, 2024

A PyTorch-based Speech Toolkit

Python 8,492 1,354 Updated Sep 2, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 10,711 1,041 Updated Aug 15, 2024

GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

Python 35,441 5,870 Updated Jul 26, 2024

Generative Agents: Interactive Simulacra of Human Behavior

16,154 2,062 Updated Aug 5, 2024

[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Jupyter Notebook 1,519 95 Updated Apr 22, 2024

SoftVC VITS Singing Voice Conversion

Python 25,234 4,742 Updated Nov 11, 2023

A 13B large language model developed by Baichuan Intelligent Technology

Python 2,978 237 Updated Sep 6, 2023

SAM-PT: Extending SAM to zero-shot video segmentation with point-based tracking.

Python 950 60 Updated Jan 27, 2024
Python 297 27 Updated Apr 2, 2024

ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型

6,794 534 Updated Jun 4, 2024

Fast Segment Anything

Python 7,303 683 Updated Jul 30, 2024

ChatGLM2-6B: An Open Bilingual Chat LLM | 开源双语对话语言模型

Python 15,678 1,849 Updated Jun 27, 2024

Specify what you want it to build, the AI asks for clarification, and then builds it. Completely separate team and codebase from the AI Web App builder https://gptengineer.app

Python 51,866 6,753 Updated Aug 10, 2024

A curated list of awesome research papers, projects, code, dataset, workshops etc. related to virtual try-on.

2,091 262 Updated Jul 27, 2024
Next