Skip to content

Real-time, End-to-End, Advanced Analytics and Machine Learning Recommendation Pipeline

License

Notifications You must be signed in to change notification settings

jparap/pipeline

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

##PANCAKE STACK

End-to-End Streaming Advanced Analytics and Machine Learning Recommendation Pipeline

Follow Wiki to Setup Docker-based Environment

PANCAKE STACK

Architecture Overview

Pipeline Architecture Overview

Screenshots

Apache Zeppelin Notebooks

Apache Zeppelin Notebooks

Stanford CoreNLP Sentiment Analysis

Stanford CoreNLP Sentiment

Jupyter/iPython Notebooks

Jupyter/iPython Notebooks

SparkR Notebooks

SparkR Notebooks

TensorFlow Notebooks

TensorFlow Notebooks

Apache NiFi Data Flows

Apache NiFi Data Flows

AirFlow Workflows

AirFlow Workflows

Presto Queries

Presto Queries

Tableau Integration

Tableau Integration

Beeline Command-line Hive Client

Beeline Command-line Hive Client

Log Visualization with Kibana & Logstash

Log Visualization with Kibana & Logstash

Spark, Spark Streaming, and Spark SQL Admin UIs

Spark Admin UI Spark Admin UI Spark Admin UI Spark Admin UI Spark Admin UI Spark Admin UI

Ganglia System and JVM Metrics Monitoring UIs

Ganglia Metrics UI Ganglia Metrics UI Ganglia Metrics UI

Tools Overview

Apache Spark Redis Apache Cassandra Apache Kafka NiFi ElasticSearch Logstash Kibana Apache Zeppelin Ganglia Hadoop HDFS iPython Notebook Docker Tachyon

About

Real-time, End-to-End, Advanced Analytics and Machine Learning Recommendation Pipeline

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 98.0%
  • Python 0.6%
  • Shell 0.4%
  • C++ 0.4%
  • Scala 0.4%
  • Java 0.1%
  • Other 0.1%