Skip to content
View yushuiwx's full-sized avatar
Block or Report

Block or report yushuiwx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation

Python 1,091 39 Updated Jul 14, 2024

Official Pytorch Implementation of Our CVPR2023 Paper: "Towards Accurate Image Coding: Improved Autoregressive Image Generation with Dynamic Vector Quantization"

Python 125 2 Updated Jul 23, 2023

🔥🔥🔥 A curated list of papers on LLMs-based multimodal generation (image, video, 3D and audio).

HTML 231 16 Updated Jul 8, 2024

Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources

874 59 Updated Jul 4, 2024

🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.

1,039 56 Updated Jul 30, 2024

A repository for research on medium sized language models.

Python 434 58 Updated Jul 31, 2024

Pytorch implementation of the models RT-1-X and RT-2-X from the paper: "Open X-Embodiment: Robotic Learning Datasets and RT-X Models"

Python 141 14 Updated Jul 27, 2024
32 Updated Apr 14, 2024

Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models" (ICLR 2024)

Jupyter Notebook 2,751 243 Updated May 3, 2024

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 14,350 1,321 Updated Jul 16, 2024
Jupyter Notebook 4 Updated Jun 16, 2023

Implementation of Natural Speech 2, Zero-shot Speech and Singing Synthesizer, in Pytorch

Python 1,245 99 Updated Sep 24, 2023

StreamingT2V: Consistent, Dynamic, and Extendable Long Video Generation from Text

Python 1,132 105 Updated Apr 10, 2024

Code of the paper: Finetuning Text-to-Image Diffusion Models for Fairness

Python 35 Updated Apr 26, 2024

Mora: More like Sora for Generalist Video Generation

Python 1,454 91 Updated Jun 21, 2024

Open-Sora: Democratizing Efficient Video Production for All

Python 20,989 1,992 Updated Jul 25, 2024

An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images

Python 350 18 Updated Dec 15, 2023

📚 A collection of resources and papers on Vector Quantized Variational Autoencoder (VQ-VAE) and its application

181 5 Updated Mar 27, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,114 195 Updated Jul 21, 2024
Python 7,036 544 Updated Jul 25, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

2,922 178 Updated Jul 25, 2024

[Arxiv] A Survey on Video Diffusion Models

1,575 78 Updated Jul 25, 2024

A comprehensive list of papers using large language/multi-modal models for Robotics/RL, including papers, codes, and related websites

2,586 206 Updated Jul 29, 2024

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,335 250 Updated Jan 27, 2024

Analysis of evidential models

Python 10 Updated Jun 22, 2023

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

Python 324 27 Updated Jan 25, 2024

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 2,333 247 Updated Jul 27, 2024

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

587 42 Updated Jun 18, 2024

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,450 2,204 Updated Jul 29, 2024