Refactor run_ml() API for future expandability #251

kelly-sovacool · 2021-04-29T19:12:54Z

From openjournals/joss-reviews#3073 (comment):

The run_ml() function currently implements 5 off-the-shelve ML algorithms, while providing 12 other parameters for training criteria, hyperparameters, feature importance, etc.. If in the future it would support more algorithms, custom metrics, or training parameters, I'd imagine there'll be limitations imposed by the function arguments. I'd suggest the function to take in 3 objects, e.g. run_ml(dataset, model, metrics, [args]), where a metrics object can allow the user select standard metrics or define their own metric functions given the model output and true labels.

The text was updated successfully, but these errors were encountered:

zenalapp · 2021-04-30T21:06:54Z

run_ml() already takes these objects:

dataset: The input dataset.
method: The ML model to be used. While we only officially support 5 models, all of the models supported by caret (https://topepo.github.io/caret/available-models.html) should work in our package. If caret supports additional models in the future, these should also work in mikropml. We realize that the model options are not as generalizable as e.g. PyTorch, since users must choose from options supported by caret. However, our code heavily relies on caret to perform the underlying model training. Additionally, as mikropml is oriented toward beginner practitioners, we believe that it does not need to provide the option to include custom models.
perf_metric_function and perf_metric_name: The performance metric to be used. We chose sensible defaults, but the user can provide their own performance metrics if they would like.
hyperparameters: The values of hyperparameters in the model that the user would like to tune.

kelly-sovacool added the JOSS-paper label Apr 29, 2021

kelly-sovacool assigned kelly-sovacool and zenalapp Apr 30, 2021

zenalapp closed this as completed Apr 30, 2021

zenalapp added the wontfix This will not be worked on label Apr 30, 2021

pschloss mentioned this issue May 10, 2021

[REVIEW]: mikropml: User-Friendly R Package for Supervised Machine Learning Pipelines openjournals/joss-reviews#3073

Closed

40 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor run_ml() API for future expandability #251

Refactor run_ml() API for future expandability #251

kelly-sovacool commented Apr 29, 2021

zenalapp commented Apr 30, 2021

Refactor run_ml() API for future expandability #251

Refactor run_ml() API for future expandability #251

Comments

kelly-sovacool commented Apr 29, 2021

zenalapp commented Apr 30, 2021