Skip to content
View salice's full-sized avatar

Block or report salice

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fast, Simple and a cost effective tool to replicate data from Postgres to Data Warehouses, Queues and Storage

Go 2,229 92 Updated Nov 7, 2024

A collective list of free APIs

Python 316,918 33,782 Updated Oct 31, 2024

A demo for Ventura Analytics meetup - scheduling dbt jobs with Airflow

Python 1 Updated Nov 3, 2024

The most popular ClickHouse plugin for Airflow. 🔝 Top-1% downloads on PyPI: https://pypi.org/project/airflow-clickhouse-plugin! Based on mymarilyn/clickhouse-driver.

Python 143 36 Updated Aug 23, 2024

Bigquery ETL

Python 256 101 Updated Nov 6, 2024

Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 2024

Python 1,384 121 Updated Oct 18, 2024

An extremely fast Python linter and code formatter, written in Rust.

Rust 32,512 1,083 Updated Nov 7, 2024

⚡ Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

Java 12,617 1,076 Updated Nov 6, 2024

A high-performance observability data pipeline.

Rust 17,829 1,579 Updated Nov 7, 2024

Venice, Derived Data Platform for Planet-Scale Workloads.

Java 488 85 Updated Nov 7, 2024

An asyncio ClickHouse Python Driver with native (TCP) interface support.

Python 184 43 Updated Nov 1, 2024

ClickHouse dialect for SQLAlchemy

Python 437 130 Updated Oct 28, 2024

Fast, accurate and scalable probabilistic data linkage with support for multiple SQL backends

Python 1,352 147 Updated Nov 6, 2024

Resources for my talk at Data Con LA 2023: "Predicting Purchases, Rare Diseases, and More: Using Ordinal Regression to Estimate Rare Event Probabilities"

R 3 Updated Aug 12, 2023

Apache Airflow - OpenApi Client for Python

Python 356 53 Updated Oct 3, 2024

Python library providing function decorators for configurable backoff and retry

Python 2,604 148 Updated May 2, 2024

Who Are You? Bayesian Prediction of Racial Category Using Surname and Geolocation

R 130 31 Updated Jun 14, 2024

Demo code to illustrate the execution of PyTest unit test cases for AWS Glue jobs in AWS CodePipeline using AWS CodeBuild projects

Python 39 21 Updated May 27, 2024

Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

Python 17,850 2,393 Updated Sep 24, 2024

Jupyter Notebook tutorials using astronomical databases and virtual observatory tools

Jupyter Notebook 69 15 Updated Nov 3, 2024

Vertica dialect for SQLAlchemy using the vertica-python client

Python 13 14 Updated Dec 19, 2023

Free Data Engineering course!

Jupyter Notebook 25,069 5,363 Updated Nov 4, 2024

Podman: A tool for managing OCI containers and pods.

Go 23,688 2,406 Updated Nov 6, 2024

Project demonstrating how to automate Prefect 2.0 deployments to AWS ECS Fargate

Python 113 24 Updated Jul 21, 2023

The open-source alert management and AIOps platform

TypeScript 4,763 682 Updated Nov 6, 2024

Official native Python client for the Vertica Analytics Database.

Python 379 180 Updated Oct 25, 2024

Python SQL Parser and Transpiler

Python 6,689 700 Updated Nov 6, 2024

OpenTelemetry Python API and SDK

Python 1,799 624 Updated Nov 6, 2024

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 274,551 46,169 Updated Aug 7, 2024

PyPi module for Graphlet AI Knowledge Graph Factory

Python 28 1 Updated Apr 1, 2023
Next