-
ETH Zurich
- Zurich, Switzerland
Stars
The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
pyright fork with various type checking improvements, improved vscode support and pylance features built into the language server
An efficient implementation of a rate limiter for asyncio.
A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
SGLang is a fast serving framework for large language models and vision language models.
Adding guardrails to large language models.
This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient LLM GPU selections and cost-effective AI models. LLM provide…
VS Code extension that provides type checking and analysis for Python code using mypy.
An elegant HTTP Cache implementation for HTTPX and HTTP Core.
Create partial models from pydantic models
High-performance retrieval engine for unstructured data
Agentic components of the Llama Stack APIs
A framework for few-shot evaluation of language models.
SpotServe: Serving Generative Large Language Models on Preemptible Instances
FlexFlow Serve: Low-Latency, High-Performance LLM Serving
Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.
The all-in-one solution for RAG. Build, scale, and deploy state of the art Retrieval-Augmented Generation applications
BAML is a language that helps you get structured data from LLMs, with the best DX possible. Works with all languages. Check out the promptfiddle.com playground
A self-organizing file system with llama 3