Skip to content
A curated list of practical business machine learning (BML) and business data science (BDS) applications for Accounting, Banking, Finance and Insurance, Customer, Employee, Legal, Management, Operations and Public matters.
Branch: master
Clone or download
Latest commit 5751c87 Mar 16, 2019
Type Name Latest commit message Commit time
Failed to load latest commit information.
assets Add files via upload Mar 5, 2019
LICENSE Initial commit Feb 19, 2019

Business Machine Learning and Data Science Applications


A curated list of applied business machine learning (BML) and business data science (BDS) examples and libraries. The code in this repository is in Python (primarily using jupyter notebooks) unless otherwise stated. The catalogue is inspired by awesome-machine-learning.

Caution: This is a work in progress, please contribute, especially if you are a subject expert in ML/DS for Accounting, Banking, Finance and Insurance, Customer, Employee, Legal, Management, Operations and Public matters.

If you want to contribute to this list (please do), send me a pull request or contact me @dereknow. Also, a listed repository should be deprecated if:

  • Repository's owner explicitly say that "this library is not maintained".
  • Not committed for long time (2~3 years).

Table of Contents

Department Applications


Machine Learning


Textual Analysis

Data, Parsing and APIs

Research And Articles

  • Understanding Accounting Analytics - An article that tackles the importance of accounting analytics.
  • VLFeat - VLFeat is an open and portable library of computer vision algorithms, which has Matlab toolbox.


  • Rutgers Raw - Good digital accounting research from Rutgers.


Banking, Finance and Insurance

Consumer Finance

Management and Operation


  • Zillow Prediction - Zillow valuation prediction as performed on Kaggle.
  • Real Estate - Predicting real estate prices from the urban environment.
  • Used Car - Used vehicle price prediction.


Insurance and Risk

Trading and Investment




Lifetime Value

  • Pareto/NBD Model - Calculate the CLV using a Pareto/NBD model.
  • Gamma-Gamma Model - Using deep-learning frameworks to identify accounting anomalies.
  • Cohort Analysis - Cohort analysis to group customers into mutually exclusive cohorts measured over time.


  • E-commerce - E-commerce customer segmentation.
  • Groceries - Segmentation for grocery customers.
  • Online Retailer - Online retailer segmentation.
  • Bank - Bank customer segmentation.
  • Wholesale - Clustering of wholesale customers.
  • Various - Multiple types of segmentation and clustering techniques.


  • RNN - Investigating customer behaviour over time with sequential analysis using an RNN model.
  • Neural Net - Demand forecasting using artificial neural networks.
  • Temporal Analytics - Investigating customer temporal regularities.
  • POS Analytics - Analytics driven customer behaviour ranking for retail promotions using POS data.
  • Wholesale Customer - Wholesale customer exploratory data analysis.
  • RFM - Doing a RFM (recency, frequency, monetary) analysis.
  • Returns Behaviour - Predicting total returns and fraudulent returns.
  • Visits - Predicting which day of week a customer will visit.
  • Bank: Next Purchase - A project to predict bank customers' most probable next purchase.
  • Bank: Customer Prediction - Predicting Target customers who will subscribe the new policy of the bank.
  • Next Purchase - Predict a customers’ next purchase also using feature engineering.
  • Customer Purchase Repeats - Using the lifetimes python library and real jewellery retailer data analyse customer repeat purchases.
  • AB Testing - Find the best KPI and do A/B testing.
  • Customer Survey (FirmAI) - Example of parsing and analysing a customer survey.
  • Happiness - Analysing customer happiness from hotel stays using reviews.
  • Miscellaneous Customer Analytics - Various tools and techniques for customer analysis.


Churn Prediction

  • Ride Sharing - Identify customer churn rates in order to target customers for retention campaigns.
  • KKDBox I - Variational deep autoencoder to predict churn customer
  • KKDBox II - A three step customer churn prediction framework using feature engineering.
  • Personal Finance - Predict customer subscription churn for a personal finance business.
  • ANN - Churn analysis using artificial neural networks.
  • Bike - Customer bike churn analysis.
  • Cost Sensitive - Cost sensitive churn analysis drivenby economic performance.










Policy and Regulatory

Judicial Applied



  • Topic Model Reviews - Amazon reviews for product development.
  • Patents - Forecasting strategy using patents.
  • Networks - Business categories from Yelp reviews using networks can help to identify pockets of demand.
  • Company Clustering - Hierarchical clusters and topics from companies by extracting information from their descriptions on their websites
  • Marketing Management - Programmatic marketing management.

Decision Optimisation

Casual Inference


  • Various - Various applies statistical solutions


  • Applied RL - Reinforcement Learning and Decision Making tutorials explained at an intuitive level and with Jupyter Notebooks
  • Process Mining - Leveraging A-priori Knowledge in Predictive Business Process Monitoring
  • TS Forecasting - Time series forecasting for important business applications.


  • Web Scraping (FirmAI) - Web scraping solutions for Facebook, Glassdoor, Instagram, Morningstar, Similarweb, Yelp, Spyfu, Linkedin, Angellist.


Failure and Anomalies

Load and Capacity Management

Prediction Management


Social Policies

  • Triage - General Purpose Risk Modeling and Prediction Toolkit for Policy and Social Good Problems.
  • World Bank Poverty I - A comparative assessment of machine learning classification algorithms applied to poverty prediction.
  • World Bank Poverty II - Repository for the World Bank Pover-t Test Competition Solution Overseas Company Land Ownership .
  • Overseas Company Land Ownership - Identifying foreign ownership in the UK.
  • CFPB - Consumer Finances Protection Bureau complaints analysis.
  • Cannabis Legalisation Effect - Effects of cannabis legalization on crime.

Election Analysis

Disaster Management

Urban Planning

  • Traffic Prediction - Multi attention recurrent neural networks for time-series (city traffic)
  • Predict Crashes - Crash prediction modeling application that leverages multiple data sources.
  • Predict Household Poverty - Predict the poverty of households in Costa Rica using automated feature engineering.


You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.