# BentoML Scikit-learn Tutorial


This is a sample project demonstrating basic usage of [BentoML](https://github.com/bentoml) with
Scikit-learn.

In this project, we will train a classifier model using Scikit-learn and the Iris dataset, build
an prediction service for serving the trained model via an HTTP server, and containerize the 
model server as a docker image for production deployment.


Link to source code: https://github.com/bentoml/gallery/tree/main/quickstart

### Install Dependencies

Install required python packages:

In [1]:
!pip install -r https://raw.githubusercontent.com/bentoml/gallery/main/quickstart/requirements.txt   --user





##  Model Training

In [3]:
from sklearn import svm, datasets

# Load training data
iris = datasets.load_iris()
X, y = iris.data, iris.target

# Model Training
clf = svm.SVC()
clf.fit(X, y)

Save the `clf` model instance to BentoML local model store:

In [3]:
import bentoml
bentoml.sklearn.save_model("iris_clf", clf)

Model(tag="iris_clf:5fjpqlp7isa7j3c4", path="C:\Users\hwang\bentoml\models\iris_clf\5fjpqlp7isa7j3c4\")

Models saved can be accessed via `bentoml models` CLI command:

In [4]:
!bentoml models get iris_clf:latest

name: iris_clf                                                                 
version: 5fjpqlp7isa7j3c4                                                      
module: bentoml.sklearn                                                        
labels: {}                                                                     
options: {}                                                                    
metadata: {}                                                                   
context:                                                                       
  framework_name: sklearn                                                      
  framework_versions:                                                          
    scikit-learn: 1.1.1                                                        
  bentoml_version: 1.0.0rc3                                                    
  python_version: 3.9.7                                                        
signatures:                             

In [5]:
!bentoml models list

 Tag                        Module              Size       Creation Time       
 iris_clf:5fjpqlp7isa7j3c4  bentoml.sklearn     5.77 KiB   2022-07-09 01:06:37 
 iris_clf:gnq6orh7iomjl3c4  bentoml.sklearn     5.77 KiB   2022-07-09 00:54:23 
 iris_clf2:d3bdoqh7hom773���  bentoml.sklearn     5.77 KiB   2022-07-08 23:56:32 
 iris_clf:ecx4ls77hc2rh3c4  bentoml.sklearn     5.77 KiB   2022-07-08 23:35:07 
 iris_clf:kh3nvy74qwl5d3c4  bentoml.sklearn     5.77 KiB   2022-07-05 13:10:07 
 iris_clf:5vmudih4gcs7p3c4  bentoml.sklearn     5.77 KiB   2022-07-05 03:06:01 
 iris_clf:kanmpkp4fwp7r3c4  bentoml.sklearn     5.77 KiB   2022-07-05 02:40:08 
 iris_clf:prylms74fsl5f3c4  bentoml.sklearn     5.77 KiB   2022-07-05 02:34:13 
 iris_clf:sckamh74fcvkp3c4  bentoml.sklearn     5.77 KiB   2022-07-05 02:06:09 
 iris_clf:74p4xjp4e6qf73c4  bentoml.sklearn     5.77 KiB   2022-07-05 02:02:05 
 iris_clf:wjslp5x4ewnhl3c4  bentoml.sklearn     5.77 KiB   2022-07-05 01:45:37 
 tensorflow_mnist:zb6a5kx���  bentoml.

To verify that the saved model can be loaded correctly:

In [6]:
loaded_model = bentoml.sklearn.load_model("iris_clf:latest")

loaded_model.predict([[5.9, 3. , 5.1, 1.8]])

array([2])

In BentoML, the recommended way of running ML model inference in serving is via Runner, which 
gives BentoML more flexibility in terms of how to schedule the inference computation, how to 
batch inference requests and take advantage of hardware resoureces available. Saved models can
be loaded as Runner instance as shown below:


In [7]:
# Create a Runner instance:
iris_clf_runner = bentoml.sklearn.get("iris_clf:latest").to_runner()

# Runner#init_local initializes the model in current process, this is meant for development and testing only:
iris_clf_runner.init_local()

# This should yield the same result as the loaded model:
iris_clf_runner.predict.run([[5.9, 3., 5.1, 1.8]])

'Runner.init_local' is for debugging and testing only


array([2])

## Serving the model

A simple BentoML Service that serves the model saved above look like this:

In [8]:
%%writefile service.py
import numpy as np
import bentoml
from bentoml.io import NumpyNdarray

iris_clf_runner = bentoml.sklearn.get("iris_clf:latest").to_runner()

svc = bentoml.Service("iris_classifier", runners=[iris_clf_runner])

@svc.api(input=NumpyNdarray(), output=NumpyNdarray())
def classify(input_series: np.ndarray) -> np.ndarray:
    return iris_clf_runner.predict.run(input_series)


Overwriting service.py


# Iris Classifier (난초 구분) 설명

https://www.youtube.com/watch?v=pTjsr_0YWas

1. Sepal length (cm)
2. Sepal width (cm)
3. Petal length (cm)
4. petal width (cm)
5. class --- 3 classes  Iris Setosa, Iris Versicolour, Iris Virginica

예. 
## [4.9, 3 , 1.4, 0.2] --> 0 (Iris Setosa)
## [6.1, 2.9, 4.7, 1.4] --> 1 (Iris Versicolour)
## [5.8, 2.7, 5.1, 1.9] --> 2 (Iris Virginica)
## [6.2, 3.4, 5.4, 2.3] --> 2 (Iris Virginica)

In [4]:
print (iris)

{'data': array([[5.1, 3.5, 1.4, 0.2],
       [4.9, 3. , 1.4, 0.2],
       [4.7, 3.2, 1.3, 0.2],
       [4.6, 3.1, 1.5, 0.2],
       [5. , 3.6, 1.4, 0.2],
       [5.4, 3.9, 1.7, 0.4],
       [4.6, 3.4, 1.4, 0.3],
       [5. , 3.4, 1.5, 0.2],
       [4.4, 2.9, 1.4, 0.2],
       [4.9, 3.1, 1.5, 0.1],
       [5.4, 3.7, 1.5, 0.2],
       [4.8, 3.4, 1.6, 0.2],
       [4.8, 3. , 1.4, 0.1],
       [4.3, 3. , 1.1, 0.1],
       [5.8, 4. , 1.2, 0.2],
       [5.7, 4.4, 1.5, 0.4],
       [5.4, 3.9, 1.3, 0.4],
       [5.1, 3.5, 1.4, 0.3],
       [5.7, 3.8, 1.7, 0.3],
       [5.1, 3.8, 1.5, 0.3],
       [5.4, 3.4, 1.7, 0.2],
       [5.1, 3.7, 1.5, 0.4],
       [4.6, 3.6, 1. , 0.2],
       [5.1, 3.3, 1.7, 0.5],
       [4.8, 3.4, 1.9, 0.2],
       [5. , 3. , 1.6, 0.2],
       [5. , 3.4, 1.6, 0.4],
       [5.2, 3.5, 1.5, 0.2],
       [5.2, 3.4, 1.4, 0.2],
       [4.7, 3.2, 1.6, 0.2],
       [4.8, 3.1, 1.6, 0.2],
       [5.4, 3.4, 1.5, 0.4],
       [5.2, 4.1, 1.5, 0.1],
       [5.5, 4.2, 1.4, 0.2],
     

Note: using `%%writefile` here because `bentoml.Service` definition must be created in its own `.py` file

Start a dev model server to test out the service defined above:

In [None]:
# Open your web browser at http://127.0.0.1:3000 to view the Bento UI for sending test requests.
# Windows OS에서는 --reload option 없이 돌려야 한다. 
#!bentoml serve service.py:svc --reload
!bentoml serve service.py:svc


Open your web browser at http://127.0.0.1:3000 to view the Bento UI for sending test requests.

You may also send request with `curl` command or any HTTP client, e.g.:

```bash
curl -X POST -H "content-type: application/json" --data "[[5.9, 3, 5.1, 1.8]]" http://127.0.0.1:3000/classify
```


### Build Bento for deployment

Bento is the distribution format in BentoML which captures all the source code, model files, config
files and dependency specifications required for running the service for production deployment. Think 
of it as Docker/Container designed for machine learning models.

To begin with building Bento, create a `bentofile.yaml` under your project directory:

In [None]:
%%writefile bentofile.yaml
service: "service.py:svc"
labels:
  owner: bentoml-team
  project: gallery
include:
- "*.py"
python:
  packages:
    - scikit-learn
    - pandas

Next, run `bentoml build` from current directory to start the Bento build:

In [None]:
!bentoml build

A new Bento is now built and saved to local Bento store. You can view and manage it via 
`bentoml list`,`bentoml get` and `bentoml delete` CLI command.

## Containerize and Deployment

Bento is designed to be deployed to run efficiently in a variety of different environments.
And there are lots of deployment options and tools as part of the BentoML eco-system, such as 
[Yatai](https://github.com/bentoml/Yatai) and [bentoctl](https://github.com/bentoml/bentoctl) for
direct deployment to cloud platforms.

In this guide, we will show you the most basic way of deploying a Bento, which is converting a Bento
into a Docker image containing the HTTP model server.

Make sure you have docker installed and docker deamon running, and run the following commnand:

```bash
bentoml containerize iris_classifier:latest
```

This will build a new docker image with all source code, model files and dependencies in place,
and ready for production deployment. To start a container with this docker image locally, run:

```bash
docker run -p 3000:3000 iris_classifier:invwzzsw7li6zckb2ie5eubhd 
```

## What's Next?

- 👉 [Pop into our Slack community!](https://l.linklyhq.com/l/ktO8) We're happy to help with any issue you face or even just to meet you and hear what you're working on.

- Dive deeper into the [Core Concepts](https://docs.bentoml.org/en/latest/concepts/index.html) in BentoML
- Learn how to use BentoML with other ML Frameworks at [Frameworks Guide](https://docs.bentoml.org/en/latest/frameworks/index.html) or check out other [gallery projects](https://github.com/bentoml/gallery)
- Learn more about model deployment options for Bento:
  - [🦄️ Yatai](https://github.com/bentoml/Yatai): Model Deployment at scale on Kubernetes
  - [🚀 bentoctl](https://github.com/bentoml/bentoctl): Fast model deployment on any cloud platform
