Skip to content
View minguyen9988's full-sized avatar
🇸🇬
Focusing
🇸🇬
Focusing
Block or Report

Block or report minguyen9988

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Rokku project. This project acts as a proxy on top of any S3 storage solution providing services like authentication, authorization, short-term tokens, and lineage.

Scala 66 17 Updated Feb 22, 2024

Distributed SQL Query Engine in Python using Ray

Rust 224 14 Updated Nov 20, 2023

A generic framework for on-demand, incrementalized computation. Inspired by adapton, glimmer, and rustc's query system.

Rust 2,070 140 Updated Aug 14, 2024

A portable Pythonic Data Catalog API powered by Ray that brings exabyte-level scalability and fast, ACID-compliant, change-data-capture to your big data workloads.

Python 138 21 Updated Aug 12, 2024

A kubernetes operator for Apache NiFi

Rust 27 3 Updated Aug 15, 2024

Production-ready C++ Asynchronous Framework with rich functionality

C++ 2,346 273 Updated Aug 15, 2024

Free universal database tool and SQL client

Java 38,857 3,357 Updated Aug 15, 2024

Jackrabbit Relay is an API endpoint for cryptocurrency/forex exchanges.

Python 79 20 Updated Aug 11, 2024

Reverse proxy for AWS S3 with basic authentication.

Go 323 121 Updated Jun 10, 2023

High-performance diffing of large datasets across databases

Python 303 3 Updated Aug 15, 2024

Tool for easy backup and restore for ClickHouse® using object storage for backup files.

Go 1,213 218 Updated Aug 14, 2024

Golang connection multiplexing library

Go 2,178 231 Updated Aug 14, 2024

📊 Cube — The Semantic Layer for Building Data Applications

Rust 17,606 1,750 Updated Aug 15, 2024

A REST API template with FastAPI and Django integration.

Python 4 Updated Oct 1, 2020

🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.

Python 10,355 777 Updated Aug 14, 2024

A multi-cluster batch queuing system for high-throughput workloads on Kubernetes.

Go 455 132 Updated Aug 14, 2024

Scalable, redundant, and distributed object store for Apache Hadoop

Java 813 490 Updated Aug 15, 2024

Open Policy Agent (OPA) is an open source, general-purpose policy engine.

Go 9,429 1,311 Updated Aug 15, 2024

Open, Multi-modal Catalog for Data & AI

Java 2,100 308 Updated Aug 15, 2024

Spark on Kubernetes infrastructure Helm charts repo

Mustache 199 76 Updated Oct 20, 2022

Cluster API Provider for Nested Clusters

Go 297 65 Updated Apr 19, 2024

Kamaji is the Hosted Control Plane Manager for Kubernetes.

Go 960 85 Updated Aug 14, 2024

Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.

Scala 1,113 404 Updated Aug 15, 2024

Apache DataFusion Comet Spark Accelerator

Rust 706 136 Updated Aug 15, 2024

Turning PySpark Into a Universal DataFrame API

Python 209 4 Updated Aug 15, 2024

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 7,361 1,658 Updated Aug 15, 2024

Apache Beam is a unified programming model for Batch and Streaming data processing.

Java 7,729 4,211 Updated Aug 15, 2024

Open-Source Web UI for Apache Kafka Management

Java 9,304 1,139 Updated Jul 26, 2024

StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.

Java 8,496 1,716 Updated Aug 15, 2024
Next