Skip to content
View chaosnytez's full-sized avatar

Block or report chaosnytez

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

Audio Editor

C 12,537 2,266 Updated Nov 6, 2024

Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"

Python 6,655 772 Updated Nov 5, 2024

The best OSS video generation models

Python 1,768 175 Updated Nov 4, 2024

MINT-1T: A one trillion token multimodal interleaved dataset.

768 20 Updated Jul 31, 2024

The Blazor WebAssembly app that inspired the Microsoft //Build 2023 demo app.

C# 96 20 Updated Jan 19, 2024

TEN Agent is the world’s first real-time multimodal agent integrated with the OpenAI Realtime API, RTC, and features weather checks, web search, vision, and RAG capabilities.

Python 1,372 154 Updated Nov 6, 2024

We write your reusable computer vision tools. 💜

Python 24,025 1,793 Updated Nov 7, 2024

[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"

Jupyter Notebook 1,435 133 Updated Aug 15, 2024

Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

Python 9,445 1,296 Updated Sep 14, 2024

Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning

Python 2,858 336 Updated Aug 15, 2024

JoyHallo: Digital human model for Mandarin

Python 275 28 Updated Oct 8, 2024

Bring portraits to life!

Python 12,758 1,352 Updated Oct 20, 2024

Local Vector Database coded in c# supports Cosine Similarity, Jaccard Dissimilarity as well as Euclidean , Manhattan, ChebyShev and Canberra distances

C# 10 2 Updated Sep 26, 2024

A tool to download whole playlists, channels or single videos from youtube and also optionally convert them to almost any format you would like

C# 1,660 262 Updated Oct 21, 2024

The Open Toolkit library is a fast, low-level C# wrapper for OpenGL, OpenAL & OpenCL. It also includes windowing, mouse, keyboard and joystick input and a robust and fast math library, giving you e…

C# 3,239 632 Updated Sep 25, 2024

Examples using MLX Swift

Swift 1,000 107 Updated Nov 6, 2024
Python 353 26 Updated Jun 6, 2024

This is a ComfyUi-windows implementation for the image animation project -> UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation

Python 54 6 Updated Oct 10, 2024

This node was designed to help AI image creators to generate prompts for human portraits.

Python 902 185 Updated Sep 21, 2024

Stable Diffusion and Flux in pure C/C++

C++ 3,451 294 Updated Oct 24, 2024

Whisper.net. Speech to text made simple using Whisper Models

C# 570 87 Updated Nov 5, 2024

C# Wrapper for StableDiffusion.cpp

C# 60 10 Updated Oct 24, 2024

LLM inference in C/C++

C++ 67,381 9,672 Updated Nov 6, 2024

Official implementation of OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Python 243 13 Updated Sep 15, 2024

CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simpl…

Python 887 104 Updated Oct 21, 2024

InstantDrag: Improving Interactivity in Drag-based Image Editing

Python 191 17 Updated Oct 14, 2024

⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。

Python 12,258 1,241 Updated Sep 28, 2024

3DTopia-XL: High-Quality 3D PBR Asset Generation via Primitive Diffusion

Python 974 24 Updated Oct 17, 2024
Next