Add kernlab engine for svm_linear() #438

juliasilge · 2021-03-01T18:56:44Z

Closes #336

This PR adds a second engine for svm_linear(). We already have "LiblineaR" and this PR adds "kernlab".

library(tidymodels)

data(two_class_dat, package = "modeldata")
example_split <- initial_split(two_class_dat, prop = 0.99)
example_train <- training(example_split)
example_test  <-  testing(example_split)

set.seed(123)
mod <- svm_linear() %>%
  set_engine("kernlab") %>%
  set_mode("classification") %>%
  fit(Class ~ ., example_train)
#>  Setting default kernel parameters

mod
#> parsnip model object
#> 
#> Fit time:  856ms 
#> Support Vector Machine object of class "ksvm" 
#> 
#> SV type: C-svc  (classification) 
#>  parameter : cost C = 1 
#> 
#> Linear (vanilla) kernel function. 
#> 
#> Number of Support Vectors : 358 
#> 
#> Objective Function Value : -355.0963 
#> Training error : 0.179847 
#> Probability model included.

predict(mod, new_data = example_test)
#> # A tibble: 7 x 1
#>   .pred_class
#>   <fct>      
#> 1 Class2     
#> 2 Class1     
#> 3 Class2     
#> 4 Class1     
#> 5 Class1     
#> 6 Class1     
#> 7 Class2
predict(mod, new_data = example_test, type = "prob")
#> # A tibble: 7 x 2
#>   .pred_Class1 .pred_Class2
#>          <dbl>        <dbl>
#> 1        0.457       0.543 
#> 2        0.850       0.150 
#> 3        0.172       0.828 
#> 4        0.980       0.0203
#> 5        0.698       0.302 
#> 6        0.983       0.0166
#> 7        0.439       0.561
predict(mod, new_data = example_test, type = "raw")
#> [1] Class2 Class1 Class2 Class1 Class1 Class1 Class2
#> Levels: Class1 Class2

^{Created on 2021-03-01 by the reprex package (v1.0.0)}

Unlike the "LiblineaR" engine, the "kernlab" engine does support class probabilities.

R/svm_linear.R

DavisVaughan · 2021-03-03T15:39:56Z

tests/testthat/test_svm_linear.R

 })

 test_that('engine arguments', {

  LiblineaR_type <- svm_linear(mode = "regression") %>% set_engine("LiblineaR", type = 12)
+  kernlab_cv <- svm_linear(mode = "regression") %>% set_engine("kernlab", cross = 10)


So this cross arg seems to do internal cross validation, using accuracy as a metric for classification. When doing binary classification, do you happen to know if it is considering the first or second level as the event level? Does that matter at all here?

It looks like it is doing the same thing as what parsnip is doing, with no problems like what xgboost had:

library(parsnip) library(tidyverse) library(kernlab) #> #> Attaching package: 'kernlab' #> The following object is masked from 'package:purrr': #> #> cross #> The following object is masked from 'package:ggplot2': #> #> alpha data("PimaIndiansDiabetes", package = "mlbench") df <- PimaIndiansDiabetes %>% mutate(diabetes = fct_relevel(diabetes, 'pos')) set.seed(234) parsnip_fit <- svm_linear(mode = "classification") %>% set_engine("kernlab", cross = 10) %>% fit(diabetes ~ ., df) #> Setting default kernel parameters set.seed(234) kernlab_fit <- ksvm(diabetes ~ ., data = df, kernel = "vanilladot", cross = 10) #> Setting default kernel parameters parsnip_fit #> parsnip model object #> #> Fit time: 833ms #> Support Vector Machine object of class "ksvm" #> #> SV type: C-svc (classification) #> parameter : cost C = 1 #> #> Linear (vanilla) kernel function. #> #> Number of Support Vectors : 401 #> #> Objective Function Value : -396.4286 #> Training error : 0.226562 #> Cross validation error : 0.233117 #> Probability model included. kernlab_fit #> Support Vector Machine object of class "ksvm" #> #> SV type: C-svc (classification) #> parameter : cost C = 1 #> #> Linear (vanilla) kernel function. #> #> Number of Support Vectors : 401 #> #> Objective Function Value : -396.4286 #> Training error : 0.226562 #> Cross validation error : 0.233117 identical(parsnip_fit$fit@alpha, kernlab_fit@alpha) #> [1] TRUE

^{Created on 2021-03-03 by the reprex package (v1.0.0)}

^{Created on 2021-03-03 by the reprex package (v1.0.0)}

If there is a problem (I don't think there is), it would apply to all the kernlab engines and we should open a new issue and fix it in a new PR.

Sounds good, just wanted to check

github-actions · 2021-03-18T00:27:06Z

This pull request has been automatically locked. If you believe you have found a related problem, please file a new issue (with a reprex: https://reprex.tidyverse.org) and link to this issue.

juliasilge added 4 commits March 1, 2021 11:48

Add kernlab as engine for svm_linear()

f151dc6

Tests for kernlab linear SVM

fa093c1

Redocument

3a552c7

Update NEWS for new engine

babecae

juliasilge requested a review from DavisVaughan March 1, 2021 20:11

DavisVaughan approved these changes Mar 3, 2021

View reviewed changes

juliasilge merged commit 65a5ab8 into master Mar 3, 2021

juliasilge deleted the kernlab-linear-svm branch March 3, 2021 17:55

github-actions bot locked and limited conversation to collaborators Mar 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add kernlab engine for svm_linear() #438

Add kernlab engine for svm_linear() #438

Uh oh!

juliasilge commented Mar 1, 2021

Uh oh!

Uh oh!

DavisVaughan Mar 3, 2021

Uh oh!

juliasilge Mar 3, 2021 •

edited

Loading

Uh oh!

DavisVaughan Mar 3, 2021

Uh oh!

github-actions bot commented Mar 18, 2021

Uh oh!

Uh oh!

Add kernlab engine for svm_linear() #438

Add kernlab engine for svm_linear() #438

Uh oh!

Conversation

juliasilge commented Mar 1, 2021

Uh oh!

Uh oh!

DavisVaughan Mar 3, 2021

Choose a reason for hiding this comment

Uh oh!

juliasilge Mar 3, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

DavisVaughan Mar 3, 2021

Choose a reason for hiding this comment

Uh oh!

github-actions bot commented Mar 18, 2021

Uh oh!

Uh oh!

juliasilge Mar 3, 2021 •

edited

Loading