Block or Report
Block or report josephmachado
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
beginner_de_project_stream Public
Simple stream processing pipeline
-
bitcoinMonitor Public
Near real time ETL to populate a dashboard.
-
data_engineering_project_template Public template
A template repository to create a data project with IAC, CI/CD, Data migrations, & testing
-
beginner_de_project Public
Beginner data engineering project - batch edition
-
e2e_datapipeline_test Public
Example repo to create end to end tests for data pipeline.
-
change_data_capture Public
Repo for CDC with debezium blog post
-
Cost Efficient Data Pipelines with DuckDB
-
Code for blog at https://www.startdataengineering.com/post/python-for-de/
-
sde_de101_josephmachado Public
Sample repo for startdataengineering DE 101 free course
-
-
simple_dbt_project Public
Code for dbt tutorial
-
simple_dbt_project_dev Public
Code for blog at https://www.startdataengineering.com/post/uplevel-dbt-workflow/
-
Code for "Efficient Data Processing in Spark" Course
-
data_helper Public
Code to help generate SQL for stakeholders. Code at https://www.startdataengineering.com/post/data-democratize-llm/
-
docker_for_data_engineers Public
Code for blog at: https://www.startdataengineering.com/post/docker-for-de/
-
-
Code to demonstrate data engineering metadata & logging best practices
-
Sample project to demonstrate data engineering best practices
-
-
-
analytical_dp_with_sql Public
Code for my "Efficient Data Processing in SQL" book.
-
socialetl Public
Project for "Data pipeline design patterns" blog.
-
hive-metastore Public
Forked from bitsondatadev/hive-metastore -
local_dev Public
Local development environment for python data projects, with Docker
-
online_store Public
End to end data engineering project
-
-
docker-trino-cluster Public
Forked from Lewuathe/docker-trino-clusterMultiple node presto cluster on docker container
-
data_test_ci Public
Repository showing how to automate data testing as part of CI
-
idempotent-data-pipeline Public
Making data pipelines idempotent
-
trigger_spark_with_lambda Public
Simple example showing how to trigger a spark job with AWS Lambda