Skip to content
View enhuiz's full-sized avatar
💭
💭
  • Hong Kong
  • 15:05 (UTC +08:00)

Highlights

  • Pro

Block or report enhuiz

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Image forgery recognition algorithm

Python 466 64 Updated Sep 9, 2024

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 2,236 229 Updated Sep 13, 2024

Python version of the Playwright testing and automation library.

Python 11,497 875 Updated Sep 12, 2024

🏠 将小爱音箱接入 ChatGPT 和豆包,改造成你的专属语音助手。

TypeScript 7,124 657 Updated Aug 26, 2024

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 503 17 Updated Sep 11, 2024

A plotting tool that outputs Line Rider maps, so you can watch a man on a sled scoot down your loss curves. 🎿

Python 292 5 Updated Aug 23, 2024

[Official Implementation] Acoustic Autoregressive Modeling 🔥

Python 52 5 Updated Aug 24, 2024

Finetune Llama 3.1, Mistral, Phi & Gemma LLMs 2-5x faster with 80% less memory

Python 15,499 1,039 Updated Sep 9, 2024

The official Meta Llama 3 GitHub site

Python 26,073 2,921 Updated Aug 12, 2024

The Multilayer Perceptron Language Model

Python 502 44 Updated Aug 9, 2024

LaTeXML: a TeX and LaTeX to XML/HTML/ePub/MathML translator.

Perl 915 96 Updated Sep 3, 2024

A simple way to keep track of an Exponential Moving Average (EMA) version of your pytorch model

Python 473 29 Updated Aug 27, 2024

Analyzing and Improving the Training Dynamics of Diffusion Models (EDM2)

Python 479 19 Updated May 30, 2024

Language-Codec: Reducing the Gaps Between Discrete Codec Representation and Speech Language Models

Python 207 16 Updated Aug 29, 2024

VAE modified from Descript Audio Codec, which replaces the RVQ with VAE

Python 42 5 Updated Apr 2, 2024

[GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly …

Python 3,993 302 Updated Jul 16, 2024

A family of diffusion models for text-to-audio generation.

Python 982 77 Updated Jul 3, 2024

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,113 176 Updated Jul 17, 2024

A library for efficient similarity search and clustering of dense vectors.

C++ 30,455 3,556 Updated Sep 12, 2024

Multi-level network clustering based on the Map Equation

C++ 426 88 Updated Sep 2, 2024

SimPO: Simple Preference Optimization with a Reference-Free Reward

Python 636 37 Updated Aug 22, 2024

Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis

Python 760 86 Updated Aug 7, 2024

A generative speech model for daily dialogue.

Python 30,572 3,321 Updated Sep 4, 2024

Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"

Python 10,312 658 Updated Aug 14, 2024

High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

C++ 7,867 403 Updated Sep 6, 2024

Mamba SSM architecture

Python 12,520 1,053 Updated Aug 15, 2024

Instant voice cloning by MIT and MyShell.

Python 28,356 2,773 Updated Aug 21, 2024

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 4,208 279 Updated Jun 21, 2024

PyTorch implementation of normalizing flow models

Python 680 104 Updated Aug 25, 2024
Next