Big-Data

This repo is my experience and code with big data technologies, including Kafka, Cassandra, Spark, ElasticSearch, Node.js, Redis, Bootstrap,jQuery,D3.js. All the code is written in Python. Kafka as the high volume data transmitter, Cassandra as the NoSQL database, Spark can do streaming process, ElasticSearch as the fast search engine, node.js as the server.

What did I do?

Firstly, I get stock data from google finance and transmitted the data by Kafka; Then utilized spark streaming processed the raw data from KafkaBroker and computed the average price of stock of every timestamp; Pushed the data to redis hub for server to read; Finally, displaying the real-time dynamic data using Bootstrap，jQuery and D3.js.

How to run?

Suppose your docker virtual machine ip is 192.168.99.100, first run flask-data-producer (include port, kafka_broker ip, kafka topic in your dev.cfg)

export ENV_CONFIG_FILE=`pwd`/config/dev.cfg

python flask_data_producer.py

Run redis_publisher

python redis_publisher.py `your kafka topic` 192.168.99.100:9092 `your redis channel` 192.168.99.100 6379

Run spark streaming, please include spark-streaming-kafka-0-8-assembly_2.11-2.0.0.jar in your spark classpath

spark-submit pyspark_streaming.py `your kafka producer topic` `another kafka topic you send to after processing data` 192.168.99.100:9092

Start server

node index.js --port=3000 --redis_host=192.168.99.100 --redis_port=6379 --subscribe_topic=`kafka topic you send to after processing data`

Final Results

UI Interface:

Backend data pipeline:

Name		Name	Last commit message	Last commit date
Latest commit History 69 Commits
Cassandra		Cassandra
ElasticSearch		ElasticSearch
ImagesSet		ImagesSet
Kafka		Kafka
Kibana		Kibana
Nodejs		Nodejs
Redis		Redis
SparkStreaming		SparkStreaming
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Big-Data

What did I do?

How to run?

Final Results

About

Releases

Packages

Languages

License

Dukecat0613/Big-Data

Folders and files

Latest commit

History

Repository files navigation

Big-Data

What did I do?

How to run?

Final Results

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages