User guide for Dato's Machine Learning Platform.
Switch branches/tags
Clone or download
Latest commit f1bcc96 Jul 30, 2016
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
anomaly_detection Dato to Turi Changes Jun 24, 2016
applications Add a chapter for the new lead scoring toolkit. Jun 29, 2016
churn_prediction update explore, evaluate .png per ellen Jun 30, 2016
clustering additional dato-->turi changes Jun 27, 2016
data_formats_and_sources Url changes Jul 20, 2016
data_matching additional dato-->turi changes Jun 27, 2016
deployment Add link to v1.10 user guide Jul 7, 2016
evaluation [Churn prediction] Updated churn prediction based on the new views. Jun 29, 2016
feature-engineering additional dato-->turi changes Jun 27, 2016
graph_analytics additional dato-->turi changes Jun 27, 2016
lead_scoring Small fixes to the lead scoring chapter. Jun 30, 2016
model_parameter_search replace s3.amazonaws.com Jun 27, 2016
modeling-data Update intro.md Dec 17, 2015
nearest_neighbors additional dato-->turi changes Jun 27, 2016
pattern_mining file URL changes Jun 27, 2016
recommender Url changes Jul 20, 2016
sentiment_analysis Url changes Jul 20, 2016
sframe additional dato-->turi changes Jun 27, 2016
sgraph additional dato-->turi changes Jun 27, 2016
supervised-learning Remove max_iterations Jul 19, 2016
text additional dato-->turi changes Jun 27, 2016
timeseries file URL changes Jun 27, 2016
turi Url changes Jul 20, 2016
vis include reference to Jupyter Notebook also Jun 30, 2016
.gitignore Fix formatting of footnote in visualization Jun 12, 2015
.travis.yml Maybe it was right the first time. Jul 7, 2016
LICENSE Update LICENSE Jun 27, 2016
README.md Update README.md Jul 28, 2016
SUMMARY.md Add company name to PS TOC entry Jul 5, 2016
book.json Update book.json Jul 29, 2016
conclusion.md Url changes Jul 20, 2016
contributing.md Dato to Turi Changes Jun 24, 2016
exercises.md Update exercises.md Jun 27, 2016
faq.md Dato to Turi Changes Jun 24, 2016
install.md Merge remote-tracking branch 'vnext/master' Jul 7, 2016
package.json Fixing version number in package.son Jun 28, 2016

README.md

Turi Machine Learning Platform User Guide

Our mission at Turi is to build the most powerful and usable data science tools that enable you to go quickly from inspiration to production.

GraphLab Create is a Python package that allows programmers to perform end-to-end large-scale data analysis and data product development.

  • Data ingestion and cleaning with SFrames. SFrame is an efficient disk-based tabular data structure that is not limited by RAM. This lets you scale your analysis and data processing to handle terabytes of data, even on your laptop.

  • Data exploration and visualization with GraphLab Canvas. GraphLab Canvas is a browser-based interactive GUI that allows you to explore tabular data, summary plots and statistics.

  • Network analysis with SGraph. SGraph is a disk-based graph data structure that stores vertices and edges in SFrames.

  • Predictive model development with machine learning toolkits. GraphLab Create includes several toolkits for quick prototyping with fast, scalable algorithms.

  • Production automation with data pipelines. Data pipelines allow you to assemble reusable code tasks into jobs and automatically run them on common execution environments (e.g. Amazon Web Services, Hadoop).

In this guide, you will learn how to use GraphLab Create to:

  • munge and explore both structured and unstructured data
  • use advanced machine learning methods to build predictive models and recommender systems
  • put your code into production and use it for real-world applications

Open source

The source for this userguide is available on Github under the 3-clause BSD license.

To build the userguide, install npm and run the following:

npm install
npm run gitbook-dep
npm run gitbook

The generated html will be located at _book/index.html.