Applying unsupervised learning techniques in scikit-learn
Example screen:
This project is part of a Udacity program: Data Science degree, Project 3.
The objective of the project is to apply unsupervised learning techniques to identify segments of the population that form the core customer base for a mail-order sales company in Germany. These segments can then be used to deploy direct marketing campaigns towards audiences that will have the highest expected rate of return. The data was provided by Udacity's partners at Bertelsmann Arvato Analytics.
Main files in the repository:
Identify_Customer_Segments.ipynb
: Jupyter notebook including main Python code.
Identify segments of the population, based on historical customer data, that form the core customer base for a mail-order sales company in Germany. These segments can then be used to deploy direct marketing campaigns towards specific audiences in the general population which will have the highest expected rate of return.
Apply unsupervised learning tools included in scikit-learn
for dimensionality reduction and clustering:
The Jupyter Project highly recommends new users to install Anaconda; since it conveniently installs Python, the Jupyter Notebook, and other commonly used packages for scientific computing and data science.
Use the following installation steps:
-
Download Anaconda.
-
Install the version of Anaconda which you downloaded, following the instructions on the download page.
-
To run the notebook:
jupyter notebook Identify_Customer_Segments.ipynb
3.7.1 (default, Oct 23 2018, 14:07:42)
The Jupyter Notebook and the Python modules require the following Python libraries:
- collections
- matplotlib
- numpy
- pandas
- seaborn
- sklearn
- Udacity: Data Scientist Nanodegree program
- Bertelsmann Arvato Analytics: datasets
- Jupyter Documentation: Installing Jupyter Notebook
Juan Carlos Lopez
- Fork it (https://github.com/jclh/customers-PCA-clustering/fork)
- Create your feature branch (git checkout -b feature/fooBar)
- Commit your changes (git commit -am 'Add some fooBar')
- Push to the branch (git push origin feature/fooBar)
- Create a new Pull Request