Skip to content
View UranusSeven's full-sized avatar
🎯
Focusing
🎯
Focusing
Block or Report

Block or report UranusSeven

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

Local GPT🤖

26 repositories

A Blazing Fast AI Gateway. Route to 200+ LLMs with 1 fast & friendly API.

TypeScript 5,237 359 Updated Jul 10, 2024

AI chat for every model.

TypeScript 27,377 7,597 Updated Jul 8, 2024

Enhanced ChatGPT Clone: Features OpenAI, Assistants API, Azure, Groq, GPT-4 Vision, Mistral, Bing, Anthropic, OpenRouter, Vertex AI, Gemini, AI model switching, message search, langchain, DALL-E-3,…

TypeScript 15,332 2,553 Updated Jul 10, 2024

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 3,626 301 Updated Jul 10, 2024

The official repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 12,538 1,015 Updated Jun 27, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 3,261 290 Updated Jul 10, 2024

Inference code for CodeLlama models

Python 15,440 1,789 Updated May 21, 2024

SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

Python 2,811 180 Updated Jul 10, 2024

Official Implementation of EAGLE-1 and EAGLE-2

Python 649 63 Updated Jul 1, 2024

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 1,760 164 Updated Jul 1, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 64,339 7,487 Updated Jul 2, 2024

Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Baichuan, Mixtral, Gemma, Phi, etc.) on Intel CPU and GPU (e.g., local PC with iGPU, discrete GPU such as Arc, Flex and…

Python 6,279 1,226 Updated Jul 10, 2024

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…

TypeScript 36,883 5,001 Updated Jul 10, 2024

Self-hosted AI coding assistant

Rust 18,361 775 Updated Jul 10, 2024

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Python 128,858 25,564 Updated Jul 10, 2024

Stable Diffusion web UI

Python 135,721 25,903 Updated Jul 9, 2024

🦜🔗 Build context-aware reasoning applications

Python 88,867 13,982 Updated Jul 10, 2024

Examples and guides for using the OpenAI API

MDX 57,563 9,082 Updated Jul 10, 2024

LLM inference in C/C++

C++ 61,433 8,783 Updated Jul 10, 2024

Inference code for Llama models

Python 54,206 9,325 Updated May 15, 2024

A natural language interface for computers

Python 50,760 4,430 Updated Jul 9, 2024

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

Python 35,670 4,384 Updated Jul 9, 2024

LlamaIndex is a data framework for your LLM applications

Python 33,444 4,685 Updated Jul 10, 2024

Port of OpenAI's Whisper model in C/C++

C++ 33,038 3,304 Updated Jul 9, 2024

Structured Text Generation

Python 7,083 365 Updated Jul 9, 2024