Skip to content
View mazzzystar's full-sized avatar
☂️
Focusing
☂️
Focusing

Block or report mazzzystar

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Interface for OuteTTS models.

Python 244 11 Updated Nov 6, 2024

InstantIR: Blind Image Restoration with Instant Generative Reference 🔥

Python 137 6 Updated Nov 7, 2024

Official repository of In-Context LoRA for Diffusion Transformers

325 11 Updated Nov 7, 2024

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Python 163 12 Updated Nov 5, 2024

Text-To-Speech for NotebookLM

14 Updated Oct 31, 2024

Inference script for Oasis 500M

Python 1,101 80 Updated Nov 7, 2024
Python 23 Updated Oct 30, 2024

O1 Replication Journey: A Strategic Progress Report – Part I

1,234 32 Updated Oct 28, 2024

[NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Python 675 34 Updated Oct 25, 2024
Python 16 Updated Oct 30, 2024
Python 20 1 Updated Oct 29, 2024

A general fine-tuning kit geared toward diffusion models.

Python 1,760 166 Updated Nov 7, 2024
Python 1,072 71 Updated Oct 30, 2024
Jupyter Notebook 7 Updated Oct 29, 2024

一个超轻量级、可以在移动端实时运行的数字人模型

Python 797 134 Updated Nov 4, 2024

A Streamlined Multimodal Agent Framework for Smart Hardware and More

Python 1,111 92 Updated Nov 7, 2024

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 45 2 Updated Nov 5, 2024

Converts text to speech in realtime

Python 1,977 200 Updated Nov 1, 2024

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 301 31 Updated Sep 29, 2024

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,052 85 Updated Nov 5, 2024

GLM-4-Voice | 端到端中英语音对话模型

Python 2,111 169 Updated Oct 31, 2024

Making the community's best AI chat models available to everyone.

Swift 1,521 57 Updated Oct 24, 2024
TypeScript 2,908 259 Updated Oct 24, 2024

A Swift client library for interacting with Ollama

Swift 113 6 Updated Oct 28, 2024

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,128 179 Updated Oct 25, 2024

BELLE: Be Everyone's Large Language model Engine(开源中文对话大模型)

HTML 7,907 758 Updated Oct 16, 2024

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

Python 156 10 Updated Jul 12, 2024

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI

Python 234 12 Updated Nov 3, 2024

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 749 47 Updated Oct 28, 2024

Official inference framework for 1-bit LLMs

C++ 10,850 733 Updated Nov 6, 2024
Next