Skip to content
View zhifengkong's full-sized avatar

Block or report zhifengkong

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

💿 Free software that works great, and also happens to be open-source Python.

Jupyter Notebook 16,683 2,677 Updated Jun 30, 2024

[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…

Python 4,066 304 Updated Oct 6, 2024

An Audio Language model for Audio Tasks

Python 283 15 Updated Apr 19, 2024

This repository is an implementation of this article: https://arxiv.org/pdf/2107.03312.pdf

Python 347 51 Updated Apr 21, 2022

Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable…

Python 20,763 2,117 Updated Jul 18, 2024

This is the repository for the distill web framework

JavaScript 799 131 Updated Dec 5, 2022

A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model

Python 491 30 Updated Oct 11, 2024

Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

Python 3,142 254 Updated Sep 6, 2023

🔊 Text-Prompted Generative Audio Model

Jupyter Notebook 35,683 4,196 Updated Aug 19, 2024
Python 7,664 497 Updated Apr 14, 2024

Paper List for a new paradigm of NLP: Interactive NLP (https://arxiv.org/abs/2305.13246) 🔥

Python 208 13 Updated Jun 23, 2023

Audio generation using diffusion models, in PyTorch.

Python 1,936 167 Updated Jun 12, 2023

[ICML'23] StyleGAN-T: Unlocking the Power of GANs for Fast Large-Scale Text-to-Image Synthesis

Python 1,148 53 Updated Apr 7, 2023

DALL·E Mini - Generate images from a text prompt

Python 14,744 1,207 Updated Nov 9, 2023

Versatile Diffusion: Text, Images and Variations All in One Diffusion Model, arXiv 2022 / ICCV 2023

Python 1,314 82 Updated Aug 10, 2023

Official repository of the paper "Solving Audio Inverse Problems with a Diffusion Model", submitted to ICASSP 23

Jupyter Notebook 106 11 Updated Mar 14, 2023

Score-based Generative Models (Diffusion Models) for Speech Enhancement and Dereverberation

Python 473 72 Updated Oct 10, 2024

[NeurIPS 2022] Denoising Diffusion Restoration Models -- Official Code Repository

Python 568 53 Updated Oct 10, 2022

Implementation of AudioLM, a SOTA Language Modeling Approach to Audio Generation out of Google Research, in Pytorch

Python 2,406 256 Updated Oct 11, 2024

95.47% on CIFAR10 with PyTorch

Python 5,947 2,138 Updated Feb 24, 2023

MIDI Piano synthesizer using DDSP.

Python 73 3 Updated May 24, 2024
Jupyter Notebook 381 27 Updated May 21, 2024

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Python 1,966 484 Updated Dec 19, 2023
Python 34 5 Updated Jun 13, 2023

Official PyTorch Implementation of CleanUNet (ICASSP 2022)

Python 289 50 Updated Oct 11, 2023

A collection of resources and papers on Diffusion Models

HTML 10,891 937 Updated Aug 1, 2024

PyTorch implementation of Glow

Python 508 97 Updated Nov 20, 2021

Implementation of Glow in PyTorch

Python 79 20 Updated Jan 18, 2021

Simple, extendable, easy to understand Glow implementation in PyTorch

Python 374 63 Updated Jul 16, 2022
Next