Apache Beam Example Pipelines

Description

This project contains three example pipelines that demonstrate some of the capabilities of Apache Beam.

Running the example

Project setup

Please follow the steps below to run the example:

Configure gcloud with your credentials
Enable Cloud Dataflow API in your Google Cloud Platform project

Batch pipeline:

Run the following command to execute the batch pipeline:

python -m complete.batch_pipeline.batch_pipeline \
--input gs:https://[DATA FILE BUCKET]/users.csv \
--output [PROJECT ID]:beam.users \
--temp_location gs:https://[DATAFLOW STAGING BUCKET]/temp/ \
--staging_location gs:https://[DATAFLOW STAGING BUCKET]/stage/ \
--project [PROJECT ID] \
--runner DataflowRunner

Sum pipeline:

Run the following command to execute the sum pipeline:

python -m template.sum_pipeline.sum_pipeline \
--input gs:https://[DATA FILE BUCKET]/retail.csv \
--output [PROJECT ID]:beam.retail \
--temp_location gs:https://[DATAFLOW STAGING BUCKET]/temp/ \
--staging_location gs:https://[DATAFLOW STAGING BUCKET]/stage/ \
--project [PROJECT ID] \
--runner DataflowRunner

Streaming pipeline:

Run the following command to execute the streaming pipeline:

python -m template.streaming_pipeline.streaming_pipeline \
--input projects/[PROJECT ID]/topics/[TOPIC NAME] \
--output [PROJECT ID]:beam.streaming_sum \
--temp_location gs:https://[DATAFLOW STAGING BUCKET]/temp/ \
--staging_location gs:https://[DATAFLOW STAGING BUCKET]/stage/ \
--project [PROJECT ID] \
--runner DataflowRunner \
--streaming

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
complete		complete
data_files		data_files
template		template
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Apache Beam Example Pipelines

Description

Running the example

Project setup

Batch pipeline:

Sum pipeline:

Streaming pipeline:

About

Releases

Packages

Contributors 2

Languages

License

asaharland/apache-beam-python-examples

Folders and files

Latest commit

History

Repository files navigation

Apache Beam Example Pipelines

Description

Running the example

Project setup

Batch pipeline:

Sum pipeline:

Streaming pipeline:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages