This functions hub is intended to be a centralized location for open source contributions of function components.
These are functions expected to be run as independent mlrun pipeline compnents, and as public contributions,
it is expected that contributors follow certain guidelines/protocols (please chip-in).
function | kind | description | categories |
---|---|---|---|
aggregate | job | Rolling aggregation over Metrics and Lables according to specifications | data-prep |
bert-embeddings | nuclio | Get BERT based embeddings for given text | NLP, BERT, embeddings |
churn-server | nuclio | churn classification and predictor | serving, ml |
concept-drift | job | Deploy a streaming Concept Drift detector on a labeled stream | ml, serve |
concept-drift-streaming | nuclio | Deploy a streaming Concept Drift detector on a labeled stream. the nuclio part of the concept_drift function | ml, serve |
coxph-test | job | Test cox proportional hazards model | ml, test |
coxph-trainer | job | cox proportional hazards, kaplan meier plots | training, ml |
describe | job | describe and visualizes dataset stats | analysis |
describe-dask | job | describe and visualizes dataset stats | analysis |
describe-spark | job | ||
feature-perms | job | estimate feature importances using permutations | analysis |
feature-selection | job | Select features through multiple Statistical and Model filters | data-prep, ml |
gen-class-data | job | Create a binary classification sample dataset and save. | data-prep |
github-utils | job | add comments to github pull request | notifications, utils |
load-dataset | job | load a toy dataset from scikit-learn | data-source, ml |
model-monitoring-batch | job | ||
model-monitoring-stream | nuclio | ||
model-server | nuclio | generic sklearn model server | serving, ml |
model-server-tester | job | test model servers | ml, test |
pandas-profiling-report | job | Create Pandas Profiling Report from Dataset | analysis |
project-runner | nuclio | Nuclio based - Cron scheduler for running your MLRun projects | utils |
rnn-serving | serving | deploy an rnn based stock analysis model server. | model-serving |
send-email | job | Send Email messages through SMTP server | notifications |
sentiment-analysis-serving | serving | BERT based sentiment classification model | serving, NLP, BERT, sentiment analysis |
sklearn-classifier | job | train any classifier using scikit-learn's API | ml, training |
sklearn-classifier-dask | job | train any classifier using scikit-learn's API over Dask | ml, training, dask |
slack-notify | job | Send Slack notification | ops |
sql-to-file | job | SQL To File - Ingest data using SQL query | data-prep |
stream-to-parquet | nuclio | Saves a stream to Parquet and can lunch drift detection task on it | ml, serve |
test-classifier | job | test a classifier using held-out or new data | ml, test |
tf1-serving | nuclio | tf1 image classification server | serving, dl |
tf2-serving | nuclio | tf2 image classification server | serving, dl |
tf2-serving-v2 | serving | tf2 image classification server v2 | serving, dl |
v2-model-server | serving | generic sklearn model server | serving, ml |
v2-model-tester | job | test v2 model servers | ml, test |
virtual-drift | job | Compute drift magnitude between Time-Samples T and U | ml, serve, concept-drift |
xgb-custom | job | simulate data with outliers. | model-testing |
xgb-serving | nuclio | xgboost test data classification server | model-serving |
xgb-test | job | Test one or more classifier models against held-out dataset. | model-test |
xgb-trainer | job | train multiple model types using xgboost. | model-prep |