Simple usage of Pipeline that runs successively a univariate feature selection with anova and then a C-SVM of the selected features.

#### New to Plotly?
Plotly's Python library is free and open source! [Get started](https://plot.ly/python/getting-started/) by downloading the client and [reading the primer](https://plot.ly/python/getting-started/).
<br>You can set up Plotly to work in [online](https://plot.ly/python/getting-started/#initialization-for-online-plotting) or [offline](https://plot.ly/python/getting-started/#initialization-for-offline-plotting) mode, or in [jupyter notebooks](https://plot.ly/python/getting-started/#start-plotting-online).
<br>We also have a quick-reference [cheatsheet](https://images.plot.ly/plotly-documentation/images/python_cheat_sheet.pdf) (new!) to help you get started!

### Version

In [1]:
import sklearn
sklearn.__version__

'0.18.1'

### Imports

This tutorial imports [SelectKBest](http://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.SelectKBest.html#sklearn.feature_selection.SelectKBest), [f_regression](http://scikit-learn.org/stable/modules/generated/sklearn.feature_selection.f_regression.html#sklearn.feature_selection.f_regression) and [make_pipeline](http://scikit-learn.org/stable/modules/generated/sklearn.pipeline.make_pipeline.html#sklearn.pipeline.make_pipeline).

In [2]:
print(__doc__)

from sklearn import svm
from sklearn.datasets import samples_generator
from sklearn.feature_selection import SelectKBest, f_regression
from sklearn.pipeline import make_pipeline

Automatically created module for IPython interactive environment


### Calculations

In [3]:
# import some data to play with
X, y = samples_generator.make_classification(
    n_features=20, n_informative=3, n_redundant=0, n_classes=4,
    n_clusters_per_class=2)

# ANOVA SVM-C
# 1) anova filter, take 3 best ranked features
anova_filter = SelectKBest(f_regression, k=3)
# 2) svm
clf = svm.SVC(kernel='linear')

anova_svm = make_pipeline(anova_filter, clf)
anova_svm.fit(X, y)
anova_svm.predict(X)

array([0, 0, 3, 2, 1, 0, 3, 3, 2, 2, 1, 0, 3, 3, 3, 0, 2, 3, 1, 1, 0, 2, 1,
       0, 2, 2, 2, 1, 0, 2, 0, 1, 3, 0, 2, 3, 1, 3, 1, 2, 1, 1, 2, 2, 0, 0,
       1, 1, 3, 0, 2, 1, 0, 0, 2, 2, 3, 0, 0, 2, 0, 1, 3, 0, 0, 3, 1, 1, 2,
       1, 0, 2, 1, 2, 3, 2, 1, 0, 3, 3, 0, 1, 1, 1, 2, 1, 3, 1, 3, 1, 3, 3,
       1, 0, 3, 1, 0, 1, 3, 2])

In [5]:

from IPython.display import display, HTML

display(HTML('<link href="//fonts.googleapis.com/css?family=Open+Sans:600,400,300,200|Inconsolata|Ubuntu+Mono:400,700" rel="stylesheet" type="text/css" />'))
display(HTML('<link rel="stylesheet" type="text/css" href="http://help.plot.ly/documentation/all_static/css/ipython-notebook-custom.css">'))

! pip install git+https://github.com/plotly/publisher.git --upgrade
import publisher
publisher.publish(
    'Pipeline Anova SVM.ipynb', 'scikit-learn/eature-selection-pipeline/', 'Pipeline Anova SVM | plotly',
    ' ',
    title = 'Pipeline Anova SVM | plotly',
    name = 'Pipeline Anova SVM',
    has_thumbnail='true', thumbnail='thumbnail/scikit-default.jpg', 
    language='scikit-learn', page_type='example_index',
    display_as='feature_selection', order=1,
    ipynb= '~Diksha_Gabha/3072')

Collecting git+https://github.com/plotly/publisher.git
  Cloning https://github.com/plotly/publisher.git to /tmp/pip-jreY54-build
Installing collected packages: publisher
  Found existing installation: publisher 0.10
    Uninstalling publisher-0.10:
      Successfully uninstalled publisher-0.10
  Running setup.py install for publisher ... [?25l- done
[?25hSuccessfully installed publisher-0.10
