You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Spark is a unified analytics engine for large-scale data processing. It has support for running machine learning workloads with MLlib. MLlib supports cross-validation: a standard procedure used to evaluate machine learning models on a limited data sample.
Machine learning workloads use cross-validation very often, but it's almost always delegated to different libraries. It's interesting to see how it's implemented in one of the most popular libraries.
Spark is known for its performance and scalability.
Related work
N/A
Other
There are so many things to learn from Spark, but it's so big it's hard to get started with the code base. This example, however, seems rather independent from the rest of the code.
The text was updated successfully, but these errors were encountered:
General
Description
Spark is a unified analytics engine for large-scale data processing. It has support for running machine learning workloads with MLlib. MLlib supports cross-validation: a standard procedure used to evaluate machine learning models on a limited data sample.
Links
CrossValidator.scala
What makes it interesting
Related work
N/A
Other
There are so many things to learn from Spark, but it's so big it's hard to get started with the code base. This example, however, seems rather independent from the rest of the code.
The text was updated successfully, but these errors were encountered: