Stars
flannel is a network fabric for containers, designed for Kubernetes
Recursive diff and patch for nested structures
Dictdiffer is a module that helps you to diff and patch dictionaries.
Terraform module for creating NAT instances in GCP.
Build and run containers leveraging NVIDIA GPUs
YugabyteDB - the cloud native distributed SQL database for mission-critical applications.
A list of papers about distributed consensus.
Analyzes resource usage and performance characteristics of running containers.
Open-source observability for your LLM application, based on OpenTelemetry
Enhancements tracking repo for Kubernetes
Archive of Kubernetes Design Proposals
Kubernetes metrics-related API types and clients
π A simple command-line utility for querying and monitoring GPU status
Open source platform for AI Engineering: OpenTelemetry-native LLM Observability, GPU Monitoring, Guardrails, Evaluations, Prompt Management, Vault, Playground. ππ» Integrates with 40+ LLM Providers,β¦
Contrib repository for the OpenTelemetry Collector
Evolving the Prometheus exposition format into a standard.
NVIDIA GPU metrics exporter for Prometheus leveraging DCGM
The Prometheus monitoring system and time series database.
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Felafax is building AI infra for non-NVIDIA GPUs
π» A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
Neon: Serverless Postgres. We separated storage and compute to offer autoscaling, code-like database branching, and scale to zero.
Migrate to PostgreSQL in a single command!
Train transformer language models with reinforcement learning.