Zhe Niu enhuiz

💭

207 followers · 52 following

Hong Kong
15:05 (UTC +08:00)

Highlights

Stars

VisionRush / DeepFakeDefenders

Image forgery recognition algorithm

Python 466 64 Updated Sep 9, 2024

harlanhong / awesome-talking-head-generation

1,391 109 Updated Aug 20, 2024

gpt-omni / mini-omni

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,236 229 Updated Sep 13, 2024

microsoft / playwright-python

Python version of the Playwright testing and automation library.

Python 11,497 875 Updated Sep 12, 2024

idootop / mi-gpt

🏠 将小爱音箱接入 ChatGPT 和豆包，改造成你的专属语音助手。

TypeScript 7,124 657 Updated Aug 26, 2024

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 503 17 Updated Sep 11, 2024

jndean / LossRider

A plotting tool that outputs Line Rider maps, so you can watch a man on a sled scoot down your loss curves. 🎿

Python 292 5 Updated Aug 23, 2024

qiuk2 / AAR

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Python 52 5 Updated Aug 24, 2024

unslothai / unsloth

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 15,499 1,039 Updated Sep 9, 2024

meta-llama / llama3

The official Meta Llama 3 GitHub site

Python 26,073 2,921 Updated Aug 12, 2024

EurekaLabsAI / mlp

The Multilayer Perceptron Language Model

Python 502 44 Updated Aug 9, 2024

brucemiller / LaTeXML

LaTeXML: a TeX and LaTeX to XML/HTML/ePub/MathML translator.

Perl 915 96 Updated Sep 3, 2024

lucidrains / ema-pytorch

A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model

Python 473 29 Updated Aug 27, 2024

NVlabs / edm2

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Python 479 19 Updated May 30, 2024

jishengpeng / Languagecodec

Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models

Python 207 16 Updated Aug 29, 2024

innnky / descript-audio-vae

VAE modified from Descript Audio Codec, which replaces the RVQ with VAE

Python 42 5 Updated Apr 2, 2024

FoundationVision / VAR

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,993 302 Updated Jul 16, 2024