Utility for common use cases and bug workarounds in Apache Spark 2
-
Updated
Apr 25, 2019 - Scala
Utility for common use cases and bug workarounds in Apache Spark 2
SMOTE-MR: A distributed Synthetic Minority Oversampling Technique (SMOTE) for Big Data which applies a MapReduce based-approach. SMOTE-MR is categorized as an `approximated/ non exact` solution. Also, there is an `exact` solution called SMOTE-BD written by the author (See: https://github.com/majobasgall/smote-bd)
Implementation of SMOTE - Synthetic Minority Over-sampling Technique in SparkML / MLLib
Approx-SMOTE: fast SMOTE for Big Data on Apache Spark
Add a description, image, and links to the smote topic page so that developers can more easily learn about it.
To associate your repository with the smote topic, visit your repo's landing page and select "manage topics."