Stars
The Internals of Spark Structured Streaming
The best place to learn data engineering. Built and maintained by the data engineering community.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
A boilerplate-free library for loading configuration files
Easy to maintain open source documentation websites.
This is a repo with links to everything you'd ever want to learn about data engineering
The Patterns of Scalable, Reliable, and Performant Large-Scale Systems
A guide series explaining how to setup a personal small homelab running a Kubernetes cluster with VMs on a Proxmox VE standalone server node.
Virtual whiteboard for sketching hand-drawn like diagrams
Podman: A tool for managing OCI containers and pods.
Manifold is a Java compiler plugin, its features include Metaprogramming, Properties, Extension Methods, Operator Overloading, Templates, a Preprocessor, and more.
Redpanda is a streaming data platform for developers. Kafka API compatible. 10x faster. No ZooKeeper. No JVM!
The web framework for content-driven websites. ⭐️ Star to support our work!
A vulnerability scanner for container images and filesystems
Open-source feature management solution built for developers.
The conventional commits specification
Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.
Master programming by recreating your favorite technologies from scratch.