Stars
⚡ Data quality testing for the modern data stack (SQL, Spark, and Pandas) https://www.soda.io
Apache Superset is a Data Visualization and Data Exploration Platform
A library that scrapes Linkedin for user data
NHS England and NHS Improvement AU-Data Engineering - Azure Databricks Analytics
Pyspark RDD, DataFrame and Dataset Examples in Python language
Implementing best practices for PySpark ETL jobs and applications.
My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggregates Twitter and US stock market data for user sentiment anal…
What financial info would I have wanted to know when I was 22 and jumping into tech?
For the curious minds who want to understand how Bitcoin Blockchain works!
A simple implementation of blockchain in java
Trained models with fast variant of the "best" LSTM models + legacy models
A deck tracker and deck manager for Hearthstone on Windows
A complete computer science study plan to become a software engineer.
Tesseract Open Source OCR Engine (main repository)
A list of awesome beginners-friendly projects.