Skip to content
View MrSathekge's full-sized avatar
😃
😃

Block or report MrSathekge

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
MrSathekge/README.md

Hi there, I'm Caron

My AWS Certified Cloud Practitioner Badge

I am a Mathematical & Computer Science graduate, with experience as an intern Data Scientist at ExploreAI Academy.

Proficient with SQL, Python, Power BI, Exploratory Data Analysis, Machine Learning, AWS EC2.

Highly Skilled In: Team Leading, Project Management, Time Management, Communication, Data Science Life Cycle.

Projects I have worked on:

  • Regression: Used regression machine learning models to predict the three-hourly electricity load shortfall.
  • Classification: Using Natural Language Processing (NLP) and classification machine learning models to classify tweets into negative, positive, neutral, or factual news.
  • Unsupervised learning: Built a movie recommender system that recommends movies to a user, and hosted the web application with the help of AWS.
  • Credit card fraud detection project where I built machine learning models to predict whether a credit transaction is fraudulent or not.
  • Loan default prediction project where the models predict whether a person is most likely to default on their loan or not.

Machine Learning models I have worked on:

  • Regression: Linear Regression, Decision Tree, Random Forest, XGBoost, Voting Regressor, Stacking Regressor.
  • Classification: Logistic Regression, Random Forest Classifier, KNN, Naive Bayes Classifier.
  • Unsupervised: Content-based and collaborative filtering using Singular Value Decomposition, Non-negative Matrix Factorization, Clustering to group movies together based on their similarity in terms of genre, director, actors, and other features, and Principal Component Analysis(PCA) to identify the most important features that contribute to a user's movie preferences.

Languages and tools ⚙️

AWS, Jupyter Notebook, Git and GitHub, Virtual Studio Code, PowerBI, Trello, Discord, Slack.

Python Logo Bash Logo AWS Logo VSCode Logo


Feel free to view more on Linkedin 😄 .

A picture of Caron Sathekge

Pinned Loading

  1. load-shortfall-regression-predict-api load-shortfall-regression-predict-api Public template

    Forked from Explore-AI/load-shortfall-regression-predict-api

    Forked from ExploreAI - Predict slides

    Python

  2. Classification-Streamlit-2207FTDS-Team-JM4 Classification-Streamlit-2207FTDS-Team-JM4 Public

    Forked from Explore-AI/classification-predict-streamlit-template

    Template repository for the EDSA Classification Predict

    Python

  3. Sentiment_Analysis_Vidhya_Analytics Sentiment_Analysis_Vidhya_Analytics Public

    Jupyter Notebook

  4. Credit_Card_Fraud_Detection Credit_Card_Fraud_Detection Public

    The data set contains transactions made by credit cards in September 2013 by European cardholders. It consists of transactions that occured in 2 days, where 492 of the transactions are fraud, out o…

    Jupyter Notebook

  5. Credit_Risk_Modeling Credit_Risk_Modeling Public

    Credit Risk : The possibility of a loss resulting from a borrower's failure to repay a loan or meet contractual obligations.

    Jupyter Notebook

  6. Twitter-Sentiment-Analysis-NLP-Hackathon Twitter-Sentiment-Analysis-NLP-Hackathon Public

    Forked from akshaydnicator/Twitter-Sentiment-Analysis-NLP-Hackathon

    Problem Statement: Given the tweets from customers about various tech firms who manufacture and sell mobiles, computers, laptops, etc, the task is to identify if the tweets have a negative sentimen…

    Python 1