Skip to content
Code for my Dataiku blog posts: http://www.dataiku.com/blog/
Branch: master
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
building-data-pipeline-data-science-studio
README.md

README.md

#Dataiku Post Code and Projects

This repo contains the code and project files for my posts on the Dataiku blog. They work for me, and should work for you. Use them wisely, and enjoy!

##Requirements

In order to run the code and sample projects, you'll need a few things:

  1. Python >= 3.5.1
  2. Java >= 1.8.0_74
  3. Data Science Studio >= 2.2.3

##What You'll Find

building-data-pipeline-data-science-studio

Learn how to use DSS to create a data pipeline to clean messy data.

  1. Create a Fake Dataset.ipynb: Jupyter notebook for creating a fake dataset like the one I used in the DSS project.
  2. dss_data_pipeline_example.zip: DSS project you can import and run.
You can’t perform that action at this time.