Block or Report
Block or report arch-btw
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
An alternative for the World Wide Web - browse websites such as buss:https://yippie.rizz made in HTML, CSS and Lua. Custom web browser, custom HTML rendering engine, custom search engine, and more.
Open source alternative to Perplexity AI with ability to run locally
🔮 TinyChat is a lightweight Desktop client for modern Language Models designed for straightforward comprehension. Supports OpenAI, Anthropic, Meta, Mistral, Google and Cohere APIs.
dawidpotocki / llama.cpp
Forked from ggerganov/llama.cppLLM inference in C/C++
Type-safe YAML integration tests. Tests that write your docs. Tests that rewrite themselves.
Open-Sora: Democratizing Efficient Video Production for All
LocalChat is a ChatGPT-like chat that runs on your computer
Tensor parallelism is all you need. Run LLMs on an AI cluster at home using any device. Distribute the workload, divide RAM usage, and increase inference speed.
iohub / collama
Forked from sourcegraph/codyVSCode AI coding assistant powered by self-hosted llama.cpp endpoint.
Llama.cpp-qt is a Python-based GUI wrapper for the LLama.cpp server, providing a user-friendly interface for configuring and running the server. LLama.cpp is a lightweight implementation of GPT-lik…
High-speed Large Language Model Serving on PCs with Consumer-grade GPUs
Insomnium is a fast local API testing tool that is privacy-focused and 100% local. For testing GraphQL, REST, WebSockets and gRPC. This is a fork of Kong/insomnia
OpenAI API client library for Rust (unofficial)
An unofficial API for Node.js based off of the Perplexity.AI website and mobile app
Scripts for fine-tuning Meta Llama3 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization and Q&A. Supportin…
Landmark Attention: Random-Access Infinite Context Length for Transformers
Let ChatGPT teach your own chatbot in hours with a single GPU!