-
A big data project to develop a real-time data pipeline for analyzing the popularity and sentiments of trending topics on Twitter.
-
dolphinnext-docker Public
Forked from Leonguos/dolphinnext-dockerRun dolphinnext in a docker image
Dockerfile UpdatedMar 2, 2021 -
superstore-tableau-analysis Public
The intelligent dashboard provides an overview of the performance of Superstore by tracking various KPIs and analyzing product categories
UpdatedAug 30, 2020 -
hadoop-mapReduce-spark Public
Directory contains homework assignment from CS 6240 - Large-Scale Parallel Data Processing
Java UpdatedAug 30, 2020 -
-
loan-extension-analysis Public
This project aims to analyze data for loans through 2007-2015 from LendingClub available on Kaggle.
Jupyter Notebook MIT License UpdatedJul 25, 2020 -
Build a Flask web application to help users retrieve key restaurant information and feature-based reviews (generated by applying market-basket model – Apriori algorithm and NLP on user reviews).
-
Implement K-means and Hierarchical clustering algorithms to segment customers in order to gain insight into shopping behaviors, analyze product affinity, measure marketing effectiveness, and better…
-
Analyze and model player value & work-rate to support transfer decisions of football clubs.
random-forest linear-regression exploratory-data-analysis lasso feature-selection logistic-regression supervised-machine-learningUpdatedMay 7, 2020 -
item-based-recommender Public
An item-based recommender model that computes cosine similarity for each item pairs using the item factors matrix generated by Spark MLlib’s ALS algorithm and recommends top 5 items based on the se…
-
product-sales-forecasting Public
Forecasted product sales using time series models such as Holt-Winters, SARIMA and causal methods, e.g. Regression. Evaluated performance of models using forecasting metrics such as, MAE, RMSE, MAP…
-
-
Data-Science--Cheat-Sheet Public
Forked from georgearun/Data-Science--Cheat-SheetCheat Sheets
UpdatedJul 7, 2019 -
ProgrammingAssignment2 Public
Forked from rdpeng/ProgrammingAssignment2Repository for Programming Assignment 2 for R Programming on Coursera
R UpdatedJun 3, 2019 -
-
deeplearning-network-traffic Public
Network Traffic Identification with Convolutional Neural Networks
-
socialapp-2fa Public
A social networking service that implements best practices for performing two-factor authentication.
JavaScript MIT License UpdatedMay 17, 2018 -
-
python-spark-streaming Public
Forked from jleetutorial/python-spark-streamingJupyter Notebook UpdatedApr 4, 2018 -
chromium-vulnerabilities Public
Forked from VulnerabilityHistoryProject/chromium-vulnerabilitiesData for vulnerabilityhistory.org
Ruby UpdatedMar 30, 2018