Stars
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
An orchestration platform for the development, production, and observation of data assets.
dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
Access large language models from the command-line
Dependency injection framework for Python
Compare tables within or across databases
Easy benchmarking of all publicly accessible implementations of convnets
data load tool (dlt) is an open source Python library that makes data loading easy 🛠️
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Efficient data transformation and modeling framework that is backwards compatible with dbt.
A website aiming to provide more accessible documentation for JSON schema.
An open-source ML pipeline development platform
dbt (https://getdbt.com) adapter for DuckDB (https://duckdb.org)
do more with dbt. dbt-fal helps you run Python alongside dbt, so you can send Slack alerts, detect anomalies and build machine learning models.
🎣 List of `pre-commit` hooks to ensure the quality of your `dbt` projects.
Provides automated YAML management, a dbt server, streamlit workbench, and git-integrated dbt model output diff tools
Dockerfiles to be used to create Dockerhub trusted builds of NetflixOSS
sqlfmt formats your dbt SQL files so you don't have to
dbt package that is part of Elementary, the dbt-native data observability solution for data & analytics engineers. Monitor your data pipelines in minutes. Available as self-hosted or cloud service …
Dagster Labs' open-source data platform, built with Dagster.
A CLI and set of pre-commit hooks for jsonschema validation with built-in support for GitHub Workflows, Renovate, Azure Pipelines, and more!
A collection of Airflow operators, hooks, and utilities to elevate dbt to a first-class citizen of Airflow.
dbt-sugar is a CLI tool that allows users of dbt to have fun and ease performing actions around dbt models
Query Snowflake tables locally with DuckDB, without any need for a running warehouse