- Toronto
- https://ryankelln.com/
Lists (10)
Sort Name ascending (A-Z)
Stars
A Godot GDExtension designed to run Python code in real time.
Open Source framework for voice and multimodal conversational AI
Code for "Director3D: Real-world Camera Trajectory and 3D Scene Generation from Text" (NeurIPS 2024).
Official code for the paper "StreamMultiDiffusion: Real-Time Interactive Generation with Region-Based Semantic Control."
A collection of some of my basic ComfyUI workflows. These are meant to act as building block to construct larger workflows of your own.
Interpolate and Upscale easily on Linux/MacOS/Windows.
Pythonic tool for orchestrating machine-learning/high performance/quantum-computing workflows in heterogeneous compute environments.
React UI + elegant infrastructure for AI Copilots, in-app AI agents, AI chatbots, and AI-powered Textareas 🪁
This Godot addon provides two simple APIs for using normal Input Actions, but spread out across a Keyboard player and up to 8 Joypad players.
Stable Diffusion WebUI extension for GPT4V-Image-Captioner
Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder
🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement for OpenAI, running on consumer-grade hardware. No GPU required. Runs gguf, transf…
Code and data for "AnyV2V: A Tuning-Free Framework For Any Video-to-Video Editing Tasks"
[Neurips 2024] 💫CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept Matching
WebUI extension for ControlNet
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
[CVPR 2024] Depth Anything: Unleashing the Power of Large-Scale Unlabeled Data. Foundation Model for Monocular Depth Estimation
The fast Continuous Wavelet Transform (fCWT) is a library for fast calculation of CWT.
Distribute and run LLMs with a single file.
ENFUGUE is an open-source web app for making studio-grade images and video using generative AI.
Back In Time - An easy-to-use backup tool for GNU Linux using rsync in the back
This is the Mov2mov plugin for Automatic1111/stable-diffusion-webui.
An Implementation of Ebsynth for video stylization, and the original ebsynth for image stylization as an importable python library!
SUGAN:A Stable U-Net based Generative Adversarial Network
A software tool for creating musical gesture datasets and real-time recognition of musical gestures based on live audio input.
The ffmpegcv is a ffmpeg backbone for open-cv like Video Reader and Writer