Full Stack Data Science

"Jack of all trades, master of none, though oft times better than master of one."

One of the common pain points that we have come across in big organizations is the last-mile delivery of data science applications.

You code, you test, you ship and you maintain

One common delivery vehicle is to create dashboards(BI). But the one, that's very useful and neglected more often than not, is to create APIs and provide seamless integration with other applications within the company. This requires you to have a basic understanding of machine learning, server-side programming and front-end application.

In this workshop, you would learn how to build a seamless end-to-end data driven application - Data Exploration, Machine Learning Model, RESTful API and Web Application - to solve a business prediction problem.

Course Content

Introduction to Data Science Process
Introduction to Data Exploration
Introduction to Machine Learning
Overview of the case we will be solving in the workshop
A simple ML Model
Creating RESTful API
Persisting model output
Updating the model as more data comes in (batch only - no streaming)
A simple webpage front-end to visualise the results and interact with the API.
Creating a simple application that accomplishes this end-to-end

An advanced version of the workshop, taught over two days, will cover the following additional topics

Building data pipeline and models
Deployment on cloud
Automate the workflow (eg: using airflow)

Target Audience

A programmer but not a data science practioner: A programmer with experience in server-side or front-end development and maybe has some familiarity with doing data analysis. You could be looking to transition in to building data driven products or a create a richer product experience with data.
A data science practioner but not a programmer: A data science with some experience in doing data analysis, preferably in a scripting language (R/Python/Scala), but wants to get a deeper and a more applied perspective on creating data driven products.

Pre-requisites

Programming knowledge is mandatory. Attendee should be able to write conditional statements, use loops, be comfortable writing functions and be able to understand code snippets and come up with programming logic.
Participants should have a basic familiarity of Python. Specifically, we expect participants to know the first four sections from this: http://anandology.com/python-practice-book/
Participants should also have some experience with using Python for Data Science. Specifically, participants should be able to work with the following python libraries
- jupyter: For doing literate programming in notebooks
- numpy: For scientific computation
- pandas: For data wrangling and transformation of tabular data (dataframes)
- scikit-learn: For building machine learning models

Software Requirements

We will be using Python data stack for the workshop. Please install Ananconda for Python 3.5 or 3.6 for the workshop. Additional requirement will be communicated to participants.

Install the required packages using conda.

conda install numpy pandas matplotlib seaborn scikit-learn pydotplus flask flask-wtf
conda install -c ioam holoviews bokeh

We'll also need a python library firefly-python that is not available as conda package. Install it using pip.

pip install firefly-python rorolite

pagarba.io

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
credit-risk-deploy		credit-risk-deploy
credit-risk		credit-risk
employee-attrition		employee-attrition
firefly-examples		firefly-examples
product-buy		product-buy
server-setup		server-setup
.gitignore		.gitignore
ConciseML.ipynb		ConciseML.ipynb
LICENSE		LICENSE
README.md		README.md
introduction-to-firefly.md		introduction-to-firefly.md
introduction-to-firefly.pdf		introduction-to-firefly.pdf
learning-journey.md		learning-journey.md
outline.md		outline.md
overview-3days.pdf		overview-3days.pdf
overview-day2.md		overview-day2.md
overview-day2.pdf		overview-day2.pdf
overview.md		overview.md
overview.pdf		overview.pdf

License

pagarba/full-stack-data-science-intern

Folders and files

Latest commit

History

Repository files navigation

Full Stack Data Science

Course Content

Target Audience

Pre-requisites

Software Requirements

About

Resources

License

Stars

Watchers

Forks

Languages