Skip to content
#

aws-emr

Here are 129 public repositories matching this topic...

aws-data-pipeline

A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from locally hosted Airflow containers. The end product is a Superset dashboard and a Postgres database, hosted on an EC2 instance at this address (powered down):

  • Updated May 14, 2022
  • Python

Improve this page

Add a description, image, and links to the aws-emr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the aws-emr topic, visit your repo's landing page and select "manage topics."

Learn more