Stars
Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache Polaris, the interoperable, open source catalog for Apache Iceberg
Visualize Your Ideas With Code
A set of examples for Motion Canvas
Trino Group Provider LDAP is a Trino (formerly Presto SQL) plugin to map user names to groups using an LDAP server
Trino plugin for logging query events into a separate log file.
Official repository of Trino, the distributed SQL query engine for big data, formerly known as PrestoSQL (https://trino.io)
🎨 Diagram as Code for prototyping cloud system architectures
Jupyterlab Extensions for the Impatient
JupyterLab Extensions by Examples
A command-line tool for launching Apache Spark clusters.
[NOT MAINTAINED] Light-weight Python OLAP framework for multi-dimensional data analysis
Find dates inside text using Python and get back datetime objects
Docker image with uWSGI and Nginx for Flask applications in Python running in a single container.