mazzzystar

☂️

Focusing

Ke Fang mazzzystar

☂️

Focusing

Computer Vision & Generative AI. "We create the world we live in."

723 followers · 440 following

Lists (3)

Sort

Beta Lists are currently in beta. Share feedback and report bugs.

Stars

edwko / OuteTTS

Interface for OuteTTS models.

Python 244 11 Updated Nov 6, 2024

instantX-research / InstantIR

InstantIR: Blind Image Restoration with Instant Generative Reference 🔥

Python 137 6 Updated Nov 7, 2024

ali-vilab / In-Context-LoRA

Official repository of In-Context LoRA for Diffusion Transformers

325 11 Updated Nov 7, 2024

Haiyang-W / TokenFormer

Official Implementation of TokenFormer: Rethinking Transformer Scaling with Tokenized Model Parameters

Python 163 12 Updated Nov 5, 2024

lifeiteng / NotebookTTS

Text-To-Speech for NotebookLM

14 Updated Oct 31, 2024

etched-ai / open-oasis

Inference script for Oasis 500M

Python 1,101 80 Updated Nov 7, 2024

google / sequence-layers

Python 23 Updated Oct 30, 2024

GAIR-NLP / O1-Journey

O1 Replication Journey: A Strategic Progress Report – Part I

1,234 32 Updated Oct 28, 2024

shallowdream204 / DreamClear

[NeurIPS 2024🔥] DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation

Python 675 34 Updated Oct 25, 2024

bytedance / Hybrid-SD

Python 16 Updated Oct 30, 2024

Sakshi113 / MMAU

Python 20 1 Updated Oct 29, 2024

bghira / SimpleTuner

A general fine-tuning kit geared toward diffusion models.

Python 1,760 166 Updated Nov 7, 2024

XLabs-AI / x-flux-comfyui

Python 1,072 71 Updated Oct 30, 2024

OpenPipe / best-hn

Jupyter Notebook 7 Updated Oct 29, 2024

anliyuan / Ultralight-Digital-Human

一个超轻量级、可以在移动端实时运行的数字人模型

Python 797 134 Updated Nov 4, 2024

om-ai-lab / OmAgent

A Streamlined Multimodal Agent Framework for Smart Hardware and More

Python 1,111 92 Updated Nov 7, 2024

JishengBai / AudioSetCaps

A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline

Python 45 2 Updated Nov 5, 2024

KoljaB / RealtimeTTS

Converts text to speech in realtime

Python 1,977 200 Updated Nov 1, 2024

haoheliu / audioldm_eval

This toolbox aims to unify audio generation model evaluation for easier comparison.

Python 301 31 Updated Sep 29, 2024

usefulsensors / moonshine

Fast and accurate automatic speech recognition (ASR) for edge devices

Python 2,052 85 Updated Nov 5, 2024

THUDM / GLM-4-Voice

GLM-4-Voice | 端到端中英语音对话模型

Python 2,111 169 Updated Oct 31, 2024

huggingface / chat-macOS

Making the community's best AI chat models available to everyone.

Swift 1,521 57 Updated Oct 24, 2024

corbt / agent.exe

TypeScript 2,908 259 Updated Oct 24, 2024

mattt / ollama-swift

A Swift client library for interacting with Ollama

Swift 113 6 Updated Oct 28, 2024

Text-to-Audio / AudioLCM

PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.

Python 1,128 179 Updated Oct 25, 2024

LianjiaTech / BELLE

BELLE: Be Everyone's Large Language model Engine（开源中文对话大模型）

HTML 7,907 758 Updated Oct 16, 2024

mct10 / RepCodec

Models and code for RepCodec: A Speech Representation Codec for Speech Tokenization

Python 156 10 Updated Jul 12, 2024

lucidrains / nGPT-pytorch

Quick implementation of nGPT, learning entirely on the hypersphere, from NvidiaAI

Python 234 12 Updated Nov 3, 2024

facebookresearch / spiritlm

Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".

Python 749 47 Updated Oct 28, 2024

microsoft / BitNet

Official inference framework for 1-bit LLMs

C++ 10,850 733 Updated Nov 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly