Pages

Home Last updated Jun 20, 2016
a9a binary classification (logistic regression) Last updated Jan 7, 2016
a9a binary classification (Mini Batch Gradient Descent) Last updated Jan 7, 2016
a9a binary dataset Last updated Jan 7, 2016
a9a logistic regression with iterations Last updated Jul 2, 2014
Adding rowid for each row Last updated May 9, 2015
Asterisk argument for UDTF does not work Last updated Jul 16, 2014
Binarize labels for Positive Negative instances Last updated Mar 16, 2016
Create Hivemall UDFs as permanent functions Last updated Dec 11, 2015
E2006 tfidf regression dataset Last updated May 4, 2015
E2006 tfidf regression evaluation (PA, AROW) Last updated Jun 14, 2016
Efficient Top k computation on Apache Hive using Hivemall UDTF Last updated Nov 24, 2015
Feature hashing Last updated Jun 28, 2016
Feature scaling Last updated Sep 19, 2015
Hadoop tuning for Hivemall Last updated Dec 4, 2015
How to use Model Mixing Last updated Nov 26, 2015
Input Format for classification and regression Last updated Jun 17, 2016
Installation Last updated Jan 12, 2016
Iris dataset multi class classification Last updated Mar 19, 2015
Iris multi class classification using RandomForest Last updated Jan 1, 2016
Iteration training of new20 binary multiclass classification with CW multiclass CW Last updated Jul 3, 2014
Iterative training using distributed cache Last updated Sep 8, 2014
Kaggle Titanic binary classification using Random Forest Last updated Jan 12, 2016
KDD cup 1999 network intrusion dataset #1 Last updated Oct 16, 2013
KDD cup 1999 network intrusion dataset #2 (modified) Last updated Nov 30, 2013
KDD2010a binary classification dataset Last updated Oct 9, 2014
KDD2010a classification Last updated May 4, 2015
KDD2010b arow classification Last updated May 4, 2015
kdd2010b binary classification dataset Last updated Jul 25, 2014
KDDCup 2012 track 2 CTR prediction (regression) Last updated Jun 14, 2016
KDDCup 2012 track 2 CTR prediction AdaGrad AdaDelta Last updated May 4, 2015
KDDCup 2012 track 2 CTR prediction dataset Last updated Aug 7, 2014
List of Hivemall's generic functions Last updated Apr 14, 2016
List of parameters of Matrix Factorization Last updated May 22, 2015
List of Supported Algorithms Last updated Jun 15, 2015
Logistic regression dataset generation Last updated Feb 10, 2016
Map side Join causes ClassCastException on Tez: LazyBinaryArray cannot be cast to [Ljava.lang.Object; Last updated Jun 28, 2016
MovieLens 10 folds Cross Validation Last updated Feb 7, 2015
MovieLens Dataset Last updated Feb 5, 2015
MovieLens Matrix Factorization Last updated May 22, 2015
Movielens Rating Prediction using Factorization Machine Last updated Jun 28, 2016
news20 binary classification #1 (Perceptron PA) Last updated May 4, 2015
news20 binary classification #2 (CW, AROW, SCW) Last updated May 4, 2015
news20 binary classification AdaGradRDA AdaGrad AdaDelta Last updated May 4, 2015
news20 binary classification on Amazon Elastic MapReduce Last updated Feb 27, 2014
news20 binary dataset Last updated Oct 21, 2015
news20 k NN search using b Bits minhash Last updated Jul 14, 2015
news20 multiclass classification #1 (PA) Last updated May 4, 2015
news20 multiclass classification #2 (CW, AROW, SCW) Last updated May 4, 2015
news20 multiclass classification #3 Ensemble learning Last updated May 15, 2015
news20 multiclass classification #4 one vs the rest classifier Last updated May 4, 2015
news20 multiclass dataset Last updated May 4, 2015
news20 multiclass dataset (preparation for one vs the rest classifiers) Last updated May 17, 2016
news20 Nearest Neighbor (kNN) Search Last updated Oct 15, 2015
Outlier Detection using Local Outlier Factor Last updated Dec 4, 2015
OutOfMemoryError in training Last updated Jun 12, 2014
Polynomial Features Last updated Jun 7, 2016
Quantify values of non number columns Last updated Aug 29, 2015
Real time prediction on MySQL and batch model construction on Hivemall Last updated May 14, 2015
Recommendation using Min wise LSH (minhash) Last updated Jul 31, 2015
SemanticException Generate Map Join Task Error: Cannot serialize object Last updated Jul 2, 2014
Statistical evaluation of a prediction model Last updated Jun 15, 2016
TFIDF calculation Last updated Mar 8, 2016
The number of mappers is less than splits in Hadoop 2.x Last updated Oct 24, 2014
Tips #1 use rand_amplify() instead of iterations Last updated Mar 8, 2015
Tokenizer Last updated Jan 12, 2016
Using explicit addBias() for a better prediction Last updated May 14, 2015
webspam binary classification Last updated Aug 10, 2014
webspam dataset Last updated Aug 9, 2014