In this tutorial, we are going to work with the public dataset of google cloud covid19_open_data
where we are going to calculate the total covid19 confirmed cases for every country. The total amount of records is more than 19M recs.
This repo is built mainly for the blog: Building an ETL pipeline on Google Cloud, A beginner guide to Apache Beam and Dataflow