Skip to content
View zjlww's full-sized avatar

Highlights

  • Pro

Block or report zjlww

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.

Starred repositories

Showing results

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 25,556 3,704 Updated Aug 28, 2024

Kolors Team

Python 3,234 199 Updated Aug 6, 2024

Efficient Triton Kernels for LLM Training

Python 2,461 103 Updated Aug 28, 2024

An open source implementation of CLIP.

Python 9,625 946 Updated Aug 19, 2024

Modern columnar data format for ML and LLMs implemented in Rust. Convert from parquet in 2 lines of code for 100x faster random access, vector index, and data versioning. Compatible with Pandas, Du…

Rust 3,740 204 Updated Aug 27, 2024

UTokyo-SaruLab MOS Prediction System

Python 44 5 Updated Jul 28, 2024

Repository for Show-o, One Single Transformer to Unify Multimodal Understanding and Generation.

Python 514 24 Updated Aug 28, 2024

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Python 49 4 Updated Aug 24, 2024

A list of publicly available room impulse response datasets and scripts to download them.

Shell 378 28 Updated Apr 23, 2024

Evaluation Protocol for Large-Scale Zero-Shot TTS Literature

Python 38 2 Updated Aug 19, 2024

A sequence-to-sequence voice conversion toolkit.

Python 79 9 Updated Jul 5, 2024
Python 35 5 Updated Aug 23, 2024

PyTorch reimplementation of the DiracGAN proposed in the paper "Which Training Methods for GANs do actually Converge?" [ICML 2018].

Python 18 5 Updated Jul 12, 2021

Enjoy the magic of Diffusion models!

Python 6,247 552 Updated Aug 27, 2024
Python 406 23 Updated Jul 10, 2024

The open source code for SimpleSpeech series

Python 66 4 Updated Aug 19, 2024

Reading list for research topics in Sound AI

159 8 Updated Aug 8, 2024

Source for the Interspeech 2024 Paper "Scaling up masked audio encoder learning for general audio classification"

Python 33 2 Updated Aug 13, 2024

A paper collection of recent diffusion models for text-image generation tasks, e,g., visual text generation, font generation, text removal, text image super resolution, text editing, handwritten ge…

178 4 Updated Aug 2, 2024

PyTorch implementation of MAR+DiffLoss https://arxiv.org/abs/2406.11838

Python 651 29 Updated Aug 20, 2024

The official Implementation of PeriodWave and PeriodWave-Turbo

99 7 Updated Aug 19, 2024

A curated list of reinforcement learning with human feedback resources (continually updated)

3,187 201 Updated Jul 21, 2024

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

Python 3,824 380 Updated Aug 22, 2024

📖 A curated list of resources dedicated to talking face.

1,236 104 Updated Aug 20, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 942 44 Updated Aug 13, 2024

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Jupyter Notebook 1,335 121 Updated Aug 15, 2024

Automatically Update Text-to-speech (TTS) Papers Daily using Github Actions (Update Every 12th hours)

Python 189 17 Updated Aug 28, 2024
Python 20 Updated Aug 27, 2024

real time face swap and one-click video deepfake with only a single image

Python 31,355 4,348 Updated Aug 27, 2024

JS tokenizer for LLaMA 3 and LLaMA 3.1

JavaScript 76 6 Updated Aug 11, 2024
Next