Stars
This app is a serverless, micro service-driven web application created completely using AWS cloud services. The main application of this chatbot is to provide restaurant suggestions to its users ba…
In this project, we will perform customer segmentation on a dataset containing customer demographics and transactions data from an Indian bank. Using KMeans Algorithm.
Customer Segmentation by Creditcard Data in a Bank Using KMeans
Collection of Summer 2025 tech internships!
Loan Default Prediction using PySpark, with jobs scheduled by Apache Airflow and Integration with Spark using Apache Livy
Indian Bank Transactions: RFM Clustering and CLTV Prediction Model
This project involves grouping bank customers based on their spending behaviour using RFM analysis.
Teaching repo for Applied Data Science @ Columbia, a project-based course for data science skills (statistical thinking, machine learning, data engineering, team work, presentation, endurance of fr…
Using simple risk models (VaR and efficient frontier) to perform a study on the efficacy of using ESG scores alone as a risk measure. Data collected from the YahooFinance API in Python using adjust…
Enviroment, Social, Governance Ratings make companies work on sustainabilty but it also dents their bottomline. This project takes few stocks from forbes 100 esg stocks for 2022 and uses fbprophet …
Collaborative Repo for Group 2's project
This project examine the relationship between Sustainalytics Environmental, Social, and Governance (ESG) risk ratings and the financial performance of US S&P 500 listed firms.
This project used GARCH type models to estimate volatility and used delta hedging method to make a profit.
Financial risk analysis on a stocks portfolio through the VaR (Value at Risk), using Monte Carlo Simulation and Multiple Linear Regression.
Simulate and estimate volatility by GARCH with/without leverage, riskmetriks. Compute Value-at-Risk and Test on VaR Violation
Exploratory Data Analysis of MTA Turnstile Data
Xiaoyang Song and Han Liu's project for NYC restaurant inspection and rating database: a web application that enables users to interactively access, modify, and browse the inspection records, food …
Using K-Means algorithm for customer segmentation due to credit card behavior
My solution to the book <A collection of Data Science Take-home Challenges>
My solution to the book A Collection of Data Science Take-Home Challenges
Credit Risk analysis by using Python and ML
jun_bigdata大数据平台服务框架。实现了Kafka实时数据过滤、清洗、转换、消费,实现了Spark SQL对Redis、MongoDB等非关系型数据库的数据的读写;集成了规则引擎,可基于规则引擎实现客户标签、画像等相关功能。输出各类大屏展示看板DashBoard等
Machine Learning Case Study on Credit Loans.