Skip to content
View constasmile's full-sized avatar

Block or report constasmile

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Beta Lists are currently in beta. Share feedback and report bugs.
Showing results

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

Python 703 48 Updated Sep 12, 2024

Brand new TTS solution

Python 9,475 742 Updated Sep 13, 2024

A high-throughput and memory-efficient inference and serving engine for LLMs

Python 26,655 3,905 Updated Sep 14, 2024

An OAI compatible exllamav2 API that's both lightweight and fast

Python 447 64 Updated Sep 11, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,094 356 Updated Sep 14, 2024

Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition (ASR), and text-to-spee…

Python 281 41 Updated Sep 14, 2024
Python 758 54 Updated Sep 6, 2024

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 638 31 Updated Sep 6, 2024

Inference and training library for high-quality TTS models.

Python 4,179 410 Updated Aug 19, 2024

GoodbyeDPI — Deep Packet Inspection circumvention utility (for Windows)

C 22,851 1,664 Updated Sep 6, 2024

Blazingly fast LLM inference.

Rust 3,396 244 Updated Sep 14, 2024

Scrape reddit posts into a single markdown file

Python 11 Updated Jul 28, 2024

Drag & drop UI to build your customized LLM flow

TypeScript 29,791 15,344 Updated Sep 14, 2024

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 166,462 44,058 Updated Sep 14, 2024

Anole: An Open, Autoregressive and Native Multimodal Models for Interleaved Image-Text Generation

Python 641 36 Updated Aug 5, 2024

Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation

Python 3,286 258 Updated Aug 14, 2024

FOSS Image background remover with 10 open source rmbg models

JavaScript 209 26 Updated Jul 12, 2024

Multilingual Voice Understanding Model

Python 2,602 246 Updated Sep 2, 2024

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

Python 4,719 474 Updated Sep 6, 2024

A CLI tool to convert your codebase into a single LLM prompt with source tree, prompt templating, and token counting.

Rust 1,519 83 Updated Sep 13, 2024
Python 35 4 Updated Jun 6, 2024

Fast PlayStation 1 emulator for x86-64/AArch32/AArch64

C++ 50 2 Updated Jun 5, 2024

DeepFuze is a state-of-the-art deep learning tool that seamlessly integrates with ComfyUI to revolutionize facial transformations, lipsyncing, Face Swapping, Lipsync Translation, video generation, …

Python 282 32 Updated Aug 1, 2024

Together Mixture-Of-Agents (MoA) – 65.1% on AlpacaEval with OSS models

Python 2,526 348 Updated Aug 10, 2024

Custom ComfyUI nodes for Vision Language Models, Large Language Models, Image to Music, Text to Music, Consistent and Random Creative Prompt Generation

Python 360 29 Updated Jul 31, 2024

Your image is almost there!

Python 7,199 416 Updated Jul 26, 2024

MARS5 speech model (TTS) from CAMB.AI

Jupyter Notebook 2,439 196 Updated Aug 1, 2024

Upgraded repo includes more capabilities, converted the cmd .py scripts to function more intuitively, added 147 different depth output colour map methods, introduced batch image as well as video pr…

Python 65 4 Updated Jul 24, 2024
122 4 Updated Sep 2, 2024
Next