Lists (1)
Sort Name ascending (A-Z)
Stars
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.
A high-throughput and memory-efficient inference and serving engine for LLMs
An OAI compatible exllamav2 API that's both lightweight and fast
SGLang is a fast serving framework for large language models and vision language models.
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-spee…
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Inference and training library for high-quality TTS models.
GoodbyeDPI — Deep Packet Inspection circumvention utility (for Windows)
Drag & drop UI to build your customized LLM flow
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
FOSS Image background remover with 10 open source rmbg models
Multilingual Voice Understanding Model
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.
RSDuck / duckstation
Forked from stenzek/duckstationFast PlayStation 1 emulator for x86-64/AArch32/AArch64
DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, …
Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models
Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation
MARS5 speech model (TTS) from CAMB.AI
Upgraded repo includes more capabilities, converted the cmd .py scripts to function more intuitively, added 147 different depth output colour map methods, introduced batch image as well as video pr…