Skip to content
View xuanhan863's full-sized avatar
👀
In machine learning...
👀
In machine learning...
  • Los Angeles, USA
Block or Report

Block or report xuanhan863

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Stars

Showing results

A lightning-fast workflow builder, it supports multimodal interaction, highly customizable extensions, and is intuitive to use even without any coding knowledge.

Go 221 26 Updated Jul 26, 2024

Agentic components of the Llama Stack APIs

Python 2,226 207 Updated Jul 27, 2024

This repository is an implementation that recreates the SketchGuidance feature of "ToonCrafter".

Python 52 Updated Jul 13, 2024

[ACL 2024] This is the Pytorch code for our paper "StyleDubber: Towards Multi-Scale Style Learning for Movie Dubbing"

Python 28 2 Updated Jul 24, 2024

Multitrack music mixing style transfer given a reference song using differentiable mixing console.

Jupyter Notebook 14 1 Updated Jul 11, 2024
Rust 45 Updated Jul 4, 2024

Generative models for conditional audio generation

Python 69 3 Updated Jul 25, 2024

Prompty makes it easy to create, manage, debug, and evaluate LLM prompts for your AI applications. Prompty is an asset class and format for LLM prompts designed to enhance observability, understand…

Python 230 16 Updated Jul 26, 2024

A big_vision inspired repo that implements a generic Auto-Encoder class capable in representation learning and generative modeling.

Python 26 1 Updated Jun 26, 2024

Learn How Transformers work in Generative AI with Interactive Visualization

JavaScript 127 17 Updated Jul 20, 2024

A novel framework manipulating CLIP embeddings via projection to remove objects using Stable Diffusion prior.

Python 32 Updated Jun 24, 2024

Implementation of "Disentangled Motion Modeling for Video Frame Interpolation"

Python 57 Updated Jul 2, 2024

code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"

Python 367 11 Updated Jul 26, 2024

Efficient Multi-modal Models via Stage-wise Visual Context Compression

Python 28 2 Updated Jul 3, 2024
Python 80 9 Updated Jul 2, 2024

Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls

Python 68 7 Updated Jul 16, 2024

Gemma 2 optimized for your local machine.

Python 268 19 Updated Jul 25, 2024

OpenAI Triton backend for Intel® GPUs

MLIR 115 31 Updated Jul 26, 2024

AuraSR: GAN-based Super-Resolution for real-world

Python 309 17 Updated Jul 23, 2024

A project that optimizes Whisper for low latency inference using NVIDIA TensorRT

Python 38 4 Updated Jul 3, 2024

Cambrian-1 is a family of multimodal LLMs with a vision-centric design.

Python 1,602 98 Updated Jul 26, 2024

libsoni: A Python Toolbox for Sonifying Music Annotations and Feature Representations

Jupyter Notebook 16 3 Updated Jun 11, 2024

Plug and Play XAI: Explain Your AI Models with Ease

Python 36 1 Updated Jul 24, 2024

Official Implementation for "The Devil is in the Details: StyleFeatureEditor for Detail-Rich StyleGAN Inversion and High Quality Image Editing"

Jupyter Notebook 83 7 Updated Jul 26, 2024

Fourier123: One Image to High-Quality 3D Object Generation with Hybrid Fourier Score Distillation

Python 66 1 Updated Jun 16, 2024

Official implementation of "AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising"

Python 123 5 Updated Jul 18, 2024

Orchestrate zero-shot computer vision models

HTML 346 6 Updated Jul 25, 2024
Next