Block or Report
Block or report minguyen9988
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseStars
Language
Sort by: Recently starred
A REST API template with FastAPI and Django integration.
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
A multi-cluster batch queuing system for high-throughput workloads on Kubernetes.
Scalable, redundant, and distributed object store for Apache Hadoop
Open Policy Agent (OPA) is an open source, general-purpose policy engine.
Open, Multi-modal Catalog for Data & AI
Spark on Kubernetes infrastructure Helm charts repo
Cluster API Provider for Nested Clusters
Kamaji is the Hosted Control Plane Manager for Kubernetes.
Gluten is a middle layer responsible for offloading JVM-based SQL engines' execution to native engines.
Apache DataFusion Comet Spark Accelerator
Turning PySpark Into a Universal DataFrame API
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
Apache Beam is a unified programming model for Batch and Streaming data processing.
Open-Source Web UI for Apache Kafka Management
StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
A distributed, fast open-source graph database featuring horizontal scalability and high availability
Wren AI makes your database RAG-ready. Implement Text-to-SQL more accurately and securely.
Database connectivity API standard and libraries for Apache Arrow
An awesome & curated list of best LLMOps tools for developers
aider is AI pair programming in your terminal
Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
Replicate data from MySQL, Postgres and MongoDB to ClickHouse
S3 Reverse Proxy with GET, PUT and DELETE methods and authentication (OpenID Connect and Basic Auth)
SeaweedFS is a fast distributed storage system for blobs, objects, files, and data lake, for billions of files! Blob store has O(1) disk seek, cloud tiering. Filer supports Cloud Drive, cross-DC ac…