Stars
6
results
for source starred repositories
Clear filter
A sample project designed to demonstrate ETL process using Pyspark & Spark SQL API in Apache Spark.
collections of data science, machine learning and data visualization projects with pandas, sklearn, matplotlib, tensorflow2, Keras, various ML algorithms like random forest classifier, boosting, etc
Apache Spark (PySpark) Practice on Real Data
This SQL-based Walmart data analysis project aims to identify top-performing branches and products, optimize sales strategies using Kaggle's Walmart Sales Forecasting Competition dataset.
Sample MySQL and MariaDB database data that can be used for upgrade testing