Skip to content

Latest commit

 

History

History
254 lines (185 loc) · 4.6 KB

pyspark.mllib.rst

File metadata and controls

254 lines (185 loc) · 4.6 KB

MLlib (RDD-based)

Classification

pyspark.mllib.classification

LogisticRegressionModel LogisticRegressionWithSGD LogisticRegressionWithLBFGS SVMModel SVMWithSGD NaiveBayesModel NaiveBayes StreamingLogisticRegressionWithSGD

Clustering

pyspark.mllib.clustering

BisectingKMeansModel BisectingKMeans KMeansModel KMeans GaussianMixtureModel GaussianMixture PowerIterationClusteringModel PowerIterationClustering StreamingKMeans StreamingKMeansModel LDA LDAModel

Evaluation

pyspark.mllib.evaluation

BinaryClassificationMetrics RegressionMetrics MulticlassMetrics RankingMetrics

Feature

pyspark.mllib.feature

Normalizer StandardScalerModel StandardScaler HashingTF IDFModel IDF Word2Vec Word2VecModel ChiSqSelector ChiSqSelectorModel ElementwiseProduct

Frequency Pattern Mining

pyspark.mllib.fpm

FPGrowth FPGrowthModel PrefixSpan PrefixSpanModel

Vector and Matrix

pyspark.mllib.linalg

Vector DenseVector SparseVector Vectors Matrix DenseMatrix SparseMatrix Matrices QRDecomposition

Distributed Representation

pyspark.mllib.linalg.distributed

BlockMatrix CoordinateMatrix DistributedMatrix IndexedRow IndexedRowMatrix MatrixEntry RowMatrix SingularValueDecomposition

Random

pyspark.mllib.random

RandomRDDs

Recommendation

pyspark.mllib.recommendation

MatrixFactorizationModel ALS Rating

Regression

pyspark.mllib.regression

LabeledPoint LinearModel LinearRegressionModel LinearRegressionWithSGD RidgeRegressionModel RidgeRegressionWithSGD LassoModel LassoWithSGD IsotonicRegressionModel IsotonicRegression StreamingLinearAlgorithm StreamingLinearRegressionWithSGD

Statistics

pyspark.mllib.stat

Statistics MultivariateStatisticalSummary ChiSqTestResult MultivariateGaussian KernelDensity ChiSqTestResult KolmogorovSmirnovTestResult

Tree

pyspark.mllib.tree

DecisionTreeModel DecisionTree RandomForestModel RandomForest GradientBoostedTreesModel GradientBoostedTrees

Utilities

pyspark.mllib.util

JavaLoader JavaSaveable LinearDataGenerator Loader MLUtils Saveable