Skip to content
View roberthoenig's full-sized avatar
  • ETH Zurich
  • Zurich, Switzerland

Organizations

@FluxML

Block or report roberthoenig

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Get your documents ready for gen AI

Python 7,705 368 Updated Nov 9, 2024

A Django plugin for pytest.

Python 1,394 344 Updated Nov 1, 2024

The pytest framework makes it easy to write small tests, yet scales to support complex functional testing

Python 12,102 2,678 Updated Nov 10, 2024

Structured generation in Rust

Python 114 5 Updated Nov 8, 2024

pyright fork with various type checking improvements, improved vscode support and pylance features built into the language server

TypeScript 1,159 21 Updated Nov 10, 2024

An efficient implementation of a rate limiter for asyncio.

Python 515 22 Updated Oct 23, 2024

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。

Python 13,810 1,032 Updated Nov 10, 2024

SGLang is a fast serving framework for large language models and vision language models.

Python 5,951 490 Updated Nov 10, 2024

Next Generation Vue UI Component Library

Vue 10,514 1,228 Updated Nov 9, 2024

First ever client for Territorial.io !

JavaScript 22 20 Updated Oct 28, 2024

Adding guardrails to large language models.

Python 4,067 310 Updated Nov 6, 2024

✨ Build AI interfaces that spark joy

Python 5,270 345 Updated Nov 4, 2024

This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient LLM GPU selections and cost-effective AI models. LLM provide…

TypeScript 208 7 Updated Sep 27, 2024

VS Code extension that provides type checking and analysis for Python code using mypy.

TypeScript 99 18 Updated Aug 24, 2024

An elegant HTTP Cache implementation for HTTPX and HTTP Core.

Python 178 22 Updated Nov 2, 2024

Create partial models from pydantic models

Python 50 8 Updated Nov 7, 2024

High-performance retrieval engine for unstructured data

Python 852 61 Updated Nov 1, 2024

PDF to Markdown with vision models

Python 5,987 320 Updated Nov 10, 2024

Agentic components of the Llama Stack APIs

Python 3,863 553 Updated Nov 9, 2024

A framework for few-shot evaluation of language models.

Python 6,918 1,851 Updated Nov 9, 2024

SpotServe: Serving Generative Large Language Models on Preemptible Instances

99 8 Updated Feb 22, 2024

FlexFlow Serve: Low-Latency, High-Performance LLM Serving

C++ 1,699 226 Updated Nov 10, 2024

A trace analysis tool for AI agents.

Python 118 9 Updated Oct 11, 2024

LLM inference in C/C++

C++ 67,528 9,697 Updated Nov 10, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,171 861 Updated Jul 1, 2024

Online playground for OpenAPI tokenizers

TypeScript 703 88 Updated Oct 17, 2024
Python 31 Updated Jun 19, 2024

The all-in-one solution for RAG. Build, scale, and deploy state of the art Retrieval-Augmented Generation applications

Python 3,564 265 Updated Nov 9, 2024

BAML is a language that helps you get structured data from LLMs, with the best DX possible. Works with all languages. Check out the promptfiddle.com playground

Rust 1,314 49 Updated Nov 10, 2024

A self-organizing file system with llama 3

Jupyter Notebook 4,934 308 Updated Oct 24, 2024
Next