Getting started with Greenplum and Apache Spark in minutes

This page provides how to get started with Greenplum and Apache Spark. You can use these examples to apply these use cases.

Greenplum - Spark Architecture:

Use Cases:

Reference

Pivotal Greenplum

The Pivotal Greenplum Database (GPDB) is an advanced, fully featured, open source data warehouse. It provides powerful and rapid analytics on petabyte scale data volumes. Uniquely geared toward big data analytics, Greenplum Database is powered by the world’s most advanced cost-based query optimizer delivering high analytical query performance on large data volumes.

https://pivotal.io/pivotal-greenplum

Pivotal Greenplum-Spark Connector

The Pivotal Greenplum-Spark Connector provides high speed, parallel data transfer between Greenplum Database and Apache Spark clusters to support:

Apache Spark

Spark is a fast and general cluster computing system for Big Data. It provides high-level APIs in Scala, Java, Python, and R, and an optimized engine that supports general computation graphs for data analysis. It also supports a rich set of higher-level tools including Spark SQL for SQL and DataFrames, MLlib for machine learning, GraphX for graph processing, and Spark Streaming for stream processing. http:https://spark.apache.org/

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
conf/master		conf/master
data		data
doc		doc
docker		docker
docs		docs
images		images
scripts		scripts
usecase1		usecase1
usecase2		usecase2
usecase3		usecase3
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PITCHME.md		PITCHME.md
README.md		README.md
README_DB.md		README_DB.md
README_FAA.md		README_FAA.md
README_SparkR.md		README_SparkR.md
README_WRITE_JDBC.md		README_WRITE_JDBC.md
_config.yml		_config.yml
config.sh		config.sh
runDocker.sh		runDocker.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Getting started with Greenplum and Apache Spark in minutes

Greenplum - Spark Architecture:

Use Cases:

Reference

Pivotal Greenplum

Pivotal Greenplum-Spark Connector

Apache Spark

License

About

Releases

Packages

Languages

License

kongyew/greenplum-spark-connector

Folders and files

Latest commit

History

Repository files navigation

Getting started with Greenplum and Apache Spark in minutes

Greenplum - Spark Architecture:

Use Cases:

Reference

Pivotal Greenplum

Pivotal Greenplum-Spark Connector

Apache Spark

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages