Block or Report
Block or report minguyen9988
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, short-term tokens, and lineage.
Distributed SQL Query Engine in Python using Ray
A generic framework for on-demand, incrementalized computation. Inspired by adapton, glimmer, and rustc's query system.
A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.
Production-ready C++ Asynchronous Framework with rich functionality
Free universal database tool and SQL client
Jackrabbit Relay is an API endpoint for cryptocurrency/forex exchanges.
Reverse proxy for AWS S3 with basic authentication.
High-performance diffing of large datasets across databases
Tool for easy backup and restore for ClickHouse® using object storage for backup files.
📊 Cube — The Semantic Layer for Building Data Applications
A REST API template with FastAPI and Django integration.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
A multi-cluster batch queuing system for high-throughput workloads on Kubernetes.
Scalable, redundant, and distributed object store for Apache Hadoop
Open Policy Agent (OPA) is an open source, general-purpose policy engine.
Open, Multi-modal Catalog for Data & AI
Spark on Kubernetes infrastructure Helm charts repo
Cluster API Provider for Nested Clusters
Kamaji is the Hosted Control Plane Manager for Kubernetes.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Apache DataFusion Comet Spark Accelerator
Turning PySpark Into a Universal DataFrame API
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Apache Beam is a unified programming model for Batch and Streaming data processing.
Open-Source Web UI for Apache Kafka Management
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.