Block or Report
Block or report utopic-dev
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseLists (4)
Sort Name descending (Z-A)
Language
Sort by: Recently starred
Starred repositories
Companion repository which facilitates the creation of Gradio endpoints which are accessible from within Digital Audio Workstations (DAWs) through HARP.
WIP - Allows you to create DSPy pipelines using ComfyUI
Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs
Automatically find issues in image datasets and practice data-centric computer vision.
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
Awesome-LLM-3D: a curated list of Multi-modal Large Language Model in 3D world Resources
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
Open, Multi-modal Catalog for Data & AI
Perplexica is an AI-powered search engine. It is an Open source alternative to Perplexity AI
Simplifying reinforcement learning for complex game environments
Code for the paper "Neural MMO: A Massively Multiagent Game Environment for Training and Evaluating Intelligent Agents"
Visualize streams of multimodal data. Fast, easy to use, and simple to integrate. Built in Rust using egui.
SakanaAI / DiscoPOP
Forked from luchris429/DiscoPOPCode for Discovering Preference Optimization Algorithms with and for Large Language Models
YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/
mPLUG-Owl & mPLUG-Owl2: Modularized Multimodal Large Language Model
Code for Husky, an open-source language agent that solves complex, multi-step reasoning tasks. Husky v1 addresses numerical, tabular and knowledge-based reasoning tasks.
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
Official repository for "Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing"
sd3 dreambooth lora training book, adapted from the diffusers doc... wip
Code for Paper "UniAnimate: Taming Unified Video Diffusion Models for Consistent Human Image Animation".
On-device Inference of Diffusion Models for Apple Silicon
Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
A flexible and efficient codebase for training visually-conditioned language models (VLMs)
Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation