Skip to content
View luo-geng's full-sized avatar

Block or report luo-geng

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

etl

19 repositories

Pentaho Data Integration ( ETL ) a.k.a Kettle

Java 7,657 3,445 Updated Oct 3, 2024

Apache Doris is an easy-to-use, high performance and unified analytics database.

Java 12,384 3,218 Updated Oct 2, 2024

One framework to develop, deploy and operate data workflows with Python and SQL.

Python 425 56 Updated Sep 30, 2024

The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.

Python 15,740 4,035 Updated Oct 3, 2024

🧙 Build, run, and manage data pipelines for integrating and transforming data.

Python 7,787 741 Updated Oct 1, 2024

Python Extract Transform and Load Tables of Data

Python 1,241 193 Updated May 12, 2024

Official repository for pygrametl - ETL programming in Python

Python 289 41 Updated May 4, 2024

Build data pipelines, the easy way 🛠️

TypeScript 4,055 256 Updated Jun 6, 2023

Workflow Engine for Kubernetes

Go 14,922 3,179 Updated Oct 3, 2024

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Python 17,753 2,393 Updated Sep 24, 2024

Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code

Java 12,736 4,585 Updated Sep 29, 2024

A lightweight opinionated ETL framework, halfway between plain scripts and Apache Airflow

Python 2,071 101 Updated Dec 15, 2023

Developer-friendly, minimalism Cron alternative, but with much more capabilities. It aims to solve greater problems.

Go 1,469 138 Updated Oct 1, 2024

A lightweight stream processing library for Go

Go 1,876 158 Updated Sep 14, 2024

A high-performance observability data pipeline.

Rust 17,586 1,542 Updated Oct 3, 2024

Free and source-available fair-code licensed workflow automation tool. Easily automate tasks across different services.

TypeScript 46,515 6,679 Updated Oct 3, 2024

An orchestration platform for the development, production, and observation of data assets.

Python 11,277 1,422 Updated Oct 3, 2024

Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.

Go 5,485 590 Updated Oct 3, 2024

Super simple build framework with fast, repeatable builds and an instantly familiar syntax – like Dockerfile and Makefile had a baby.

Go 11,338 398 Updated Sep 13, 2024