This project shows how to use Machine Learning in business. By clustering your customer data you can find groups of them by age, salary, gender or any other feature that you have in your customer dataset. By doing this you can assure to find the best strategy for a market.
The dataset used in this project can be found in the folder of the project itself.
- Python version used in this project: 3.5+
- Pandas 0.18.0
- Numpy 1.10.4
- Matplotlib 1.5.1
- Scikit-learn 0.17.1
The main code used in this project is inside customer_clustering.ipynb. The custom algorith KMeans is in file kmeans_numpy_improved.py in the prooject folder. It is optional to use it.
To run this project you will need some software, like Anaconda, which provides support for running .ipynb files (Jupyter Notebook).
After making sure you have that, you can run from a terminal or cmd next lines:
ipython notebook customer_clustering.ipynb
or
jupyter notebook customer_clustering.ipynb