- Singapore
Block or Report
Block or report kohjiaxuan
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuse-
Wikipedia-Article-Scraper Public
A complete Python text analytics package that allows users to search for a Wikipedia article, scrape it, conduct basic text analytics and integrate it to a data pipeline without writing excessive c…
-
Stock-Market-Dashboard Public
Creating a stock market dashboard from an external API that tracks daily performance of stocks
-
-
Fraud-Detection-Pipeline Public
A structured data science pipeline for classification problems that does scaling, sampling, k-fold cross validation with evaluation metrics
-
A NLP algorithm I developed to determine the similarity or relation between two documents/Wikipedia articles. Inspired by the cosine similarity algorithm and built from WordNet.
-
Data Project of Predicting HDB Resale Flat Prices with data cleaning, feature engineering and machine learning. Models used: Random Forest, XGBoost, Neural Networks, Decision Tree, Support Vector R…
-
Practice Juypter Notebooks for my machine learning journey with Python. Please refer to other repositories for completed projects!
Jupyter Notebook UpdatedAug 6, 2019 -
Data Science Competition that challenged teams to come up with creative ways to increase the revenue of an e-commerce company. Won 1st place! Write-up in repository
-
By visualizing the gradient descent algorithm applied on a set of points that fits a quadratic equation, we understand better how the algorithm works in machine learning
-
100-pandas-puzzles Public
Forked from ajcr/100-pandas-puzzles100 data puzzles for pandas, ranging from short and simple to super tricky (60% complete)
Jupyter Notebook MIT License UpdatedApr 15, 2019