[NEW EXAMPLE] Spark - Cross-validation #10

ainzzorl · 2021-07-15T11:37:22Z

General

Project name: Spark
Example name: Cross-validation
Project home page: https://github.com/apache/spark
Programming language(s): Scala
Frameworks, libraries used: N/A

Description

Spark is a unified analytics engine for large-scale data processing. It has support for running machine learning workloads with MLlib. MLlib supports cross-validation: a standard procedure used to evaluate machine learning models on a limited data sample.

Links

CrossValidator.scala

What makes it interesting

Machine learning workloads use cross-validation very often, but it's almost always delegated to different libraries. It's interesting to see how it's implemented in one of the most popular libraries.
Spark is known for its performance and scalability.

Related work

N/A

Other

There are so many things to learn from Spark, but it's so big it's hard to get started with the code base. This example, however, seems rather independent from the rest of the code.

ainzzorl · 2021-09-07T08:05:10Z

I'm struggling to understand how it actually works. While it's an interesting piece of code, I'm moving it to "cooldown" for now.

ainzzorl added the new example New example proposal label Jul 15, 2021

ainzzorl self-assigned this Jul 15, 2021

ainzzorl added the cooldown label Sep 7, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NEW EXAMPLE] Spark - Cross-validation #10

[NEW EXAMPLE] Spark - Cross-validation #10

ainzzorl commented Jul 15, 2021

ainzzorl commented Sep 7, 2021

[NEW EXAMPLE] Spark - Cross-validation #10

[NEW EXAMPLE] Spark - Cross-validation #10

Comments

ainzzorl commented Jul 15, 2021

General

Description

Links

What makes it interesting

Related work

Other

ainzzorl commented Sep 7, 2021