Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[NEW EXAMPLE] Spark - Cross-validation #10

Open
ainzzorl opened this issue Jul 15, 2021 · 1 comment
Open

[NEW EXAMPLE] Spark - Cross-validation #10

ainzzorl opened this issue Jul 15, 2021 · 1 comment
Assignees
Labels
cooldown new example New example proposal

Comments

@ainzzorl
Copy link
Owner

General

  • Project name: Spark
  • Example name: Cross-validation
  • Project home page: https://github.com/apache/spark
  • Programming language(s): Scala
  • Frameworks, libraries used: N/A

Description

Spark is a unified analytics engine for large-scale data processing. It has support for running machine learning workloads with MLlib. MLlib supports cross-validation: a standard procedure used to evaluate machine learning models on a limited data sample.

Links

CrossValidator.scala

What makes it interesting

  • Machine learning workloads use cross-validation very often, but it's almost always delegated to different libraries. It's interesting to see how it's implemented in one of the most popular libraries.
  • Spark is known for its performance and scalability.

Related work

N/A

Other

There are so many things to learn from Spark, but it's so big it's hard to get started with the code base. This example, however, seems rather independent from the rest of the code.

@ainzzorl ainzzorl added the new example New example proposal label Jul 15, 2021
@ainzzorl ainzzorl self-assigned this Jul 15, 2021
@ainzzorl
Copy link
Owner Author

ainzzorl commented Sep 7, 2021

I'm struggling to understand how it actually works. While it's an interesting piece of code, I'm moving it to "cooldown" for now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cooldown new example New example proposal
Projects
None yet
Development

No branches or pull requests

1 participant