Skip to content

A fun side project to learn more about crypto currency and code python

Notifications You must be signed in to change notification settings

jckail/jordan_kail_crypto_project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

jordan_kail_crypto_project

Hey! Thanks for checking out my github jordan_kail_crypto_project repo!

Lots of fun examples of using Pandas and BOTO3 to call web apis and generate good usable data via multi threaded python.

Instructions: Clone this Repo to Your local/ ec2!

Download Anaconda https://www.anaconda.com/download/#macos (Python 3.6) ***INSTALL TO /Users/YOURUSERNAME/ *** In Anaconda install instructions ***

On Terminal:

cd ~/Github/lit_crypto/requirements.txt

conda create --name lit_crypto --file requirements.txt (this requirements file is located at Github/lit_crypto/) (conda create --name lit_crypto --file ~/Documents/Github/lit_crypto/requirements.txt)

cd ~/Github/lit_crypto/

Create AWS User with admin level permissions or (GLUE, S3) Download install awscli (pip3 install awscli)

aws configure (input key) (input secret) (default region) (json)

Run the alpha runner ./runner.sh

You've now refreshed the litcrypto s3 data bucket, now create an AWS Glue Crawler ontop of s3:https://litcryptodata/ (There is python code within this repo to auto run crawlers with each run, just difficult/ unecessary to retrive roles at this time)

With your newly created Crawler you have access to query the data via Athena!

Questions? Comments? Want to Just talk tech? Reachout!

Output Data Avalible at https://s3.console.aws.amazon.com/s3/buckets/litcryptodata/?region=us-east-1

-Jordan [email protected]



Notes



Alpha:

(80% Completion) Data Ingestion acquisition and storage

Omega:

(50% Completion Not Published to Public Repo) Automation of reporting/ machine learning components to detect trends amongst data sets.

This is just a fun side project to better understand crypto currency and its correlations to other data sources such as social mentions, prices of rare materials in foreign countries, or even us stock prices.

All that is needed at this time is to download awscli to your local and running aws_config. Then running the balpha_runner.sh file under alpha, this should launch the runner with hourly params.

If you'd like to setup cron jobs and deploy on ec2 its fully compatible to run hourly or even minute runner due to the fact most of the python is multithreaded.

*Unfortunately I haven't had enough time in the past few months to work on this component as much as I hoped and cut out all of the "half baked" step wise regression and k means clustering python to save the time and frustration. (I currently do this for a living) Just Plug and Play with your favorite ML After you get the data.

I recommend using dbt to manage data models if you decide to warehouse i.e. Redshift/Postgres the data, sadly it does not currently support Athena.

Luckily AWS glue does a great job of categorizing data in combination with using the "data models " directory attached to this repo and just saving as standard sql Queries.

PLEASE NOTE: Do what ever with my code, its public on github for a reason.

ALSO! There are a few keys from API's hard coded in some of they python scripts, this is a bad practice and laziness, pretty easy to fix by passing variables from a config file to the mt_alpha_runner.py

REQUIREMENTS: BUGS: VERSION 1.0 Dependencies:

--- Example Workflow --- Configure AWS (Check Connection) Launch Shell (Calls mt_alpha_runner.py w/Params) Run Setup (check for and/or create directories)

Call API to mine data

Store data on local

Transform to JSON and Zip

Save to S3

Run Glue Crawler

Run Query via athena /

Visualize data via Tableau(athena driver) /

Athena Query --> Pandas data frame --> Scikit learn ML

Should Log and save data to off repo directory so you don't have to worry about size on commits -- END --

About

A fun side project to learn more about crypto currency and code python

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages