Skip to content
Mirror of Apache Mahout
Java Scala Perl6 Other
Latest commit 1f3566d May 26, 2016 @chu11 chu11 committed with andrewpalumbo MAHOUT-1863: Several fixes to cluster-syntheticcontrol.sh to fix Inpu…
…t path does not exist. closes apache/mahout#235
Failed to load latest commit information.
bin MAHOUT-1797: Typos for SPARK_ASSEMBLY_BIN Jan 15, 2016
buildtools [maven-release-plugin] prepare for next development iteration May 18, 2016
conf MAHOUT-1821: Use a mahout-flink-conf.yaml configuration file for Maho… Apr 8, 2016
distribution [maven-release-plugin] prepare for next development iteration May 18, 2016
examples MAHOUT-1863: Several fixes to cluster-syntheticcontrol.sh to fix Inpu… May 26, 2016
flink [maven-release-plugin] prepare for next development iteration May 18, 2016
h2o [maven-release-plugin] prepare for next development iteration May 18, 2016
hdfs [maven-release-plugin] prepare for next development iteration May 18, 2016
integration [maven-release-plugin] prepare for next development iteration May 18, 2016
math-scala MAHOUT-1799:Read null row vectors from file in TextDelimeterReaderWri… May 21, 2016
math [maven-release-plugin] prepare for next development iteration May 18, 2016
mr [maven-release-plugin] prepare for next development iteration May 18, 2016
spark-shell [maven-release-plugin] prepare for next development iteration May 18, 2016
spark MAHOUT-1799:Read null row vectors from file in TextDelimeterReaderWri… May 22, 2016
src MAHOUT-1759: Deprecate Random Forests, this closes apache/mahout#173 Nov 6, 2015
.gitignore NOJIRA: added .cache to gitignore Sep 25, 2015
.travis.yml (nojira) travis: test math, hdfs, math-scala, spark only closes apach… Apr 20, 2016
CHANGELOG MAHOUT-1775 FileNotFoundException caused by aborting the process of d… Oct 25, 2015
LICENSE.txt MAHOUT-1838: Simple 2d plotting of a sampled DRM closes apache/mahout #… Apr 29, 2016
NOTICE.txt Merging branch flink-binding to master Apr 11, 2016
README.md reverting commits: Apr 19, 2016
doap_Mahout.rdf NoJira: Fix the DOAP File for Mahout 0.12.0 Release Apr 13, 2016
pom.xml [maven-release-plugin] prepare for next development iteration May 18, 2016
runtests.sh (nojira) travis: test math, hdfs, math-scala, spark only closes apach… Apr 20, 2016

README.md

Welcome to Apache Mahout!

The Apache Mahout™ project's goal is to build an environment for quickly creating scalable performant machine learning applications.

For additional information about Mahout, visit the Mahout Home Page

Setting up your Environment

Whether you are using Mahout's Shell, running command line jobs or using it as a library to build your own apps you'll need to setup several environment variables. Edit your environment in ~/.bash_profile for Mac or ~/.bashrc for many linux distributions. Add the following

export MAHOUT_HOME=/path/to/mahout
export MAHOUT_LOCAL=true # for running standalone on your dev machine, 
# unset MAHOUT_LOCAL for running on a cluster

You will need a $JAVA_HOME, and if you are running on Spark, you will also need $SPARK_HOME

Note when running the spark-shell job it can help to set some JVM options so you don't run out of memory:

$MAHOUT_OPTS="-Xmx6g -XX:MaxPermSize=512m" mahout spark-shell

Using Mahout as a Library

Running any application that uses Mahout will require installing a binary or source version and setting the environment. To compile from source:

  • mvn -DskipTests clean install
  • To run tests do mvn test
  • To set up your IDE, do mvn eclipse:eclipse or mvn idea:idea

To use maven, add the appropriate setting to your pom.xml or build.sbt following the template below.

To use the Samsara environment you'll need to include both the engine neutral math-scala dependency:

<dependency>
    <groupId>org.apache.mahout</groupId>
    <artifactId>mahout-math-scala_2.10</artifactId>
    <version>${mahout.version}</version>
</dependency>

and a dependency for back end engine translation, e.g:

<dependency>
    <groupId>org.apache.mahout</groupId>
    <artifactId>mahout-spark_2.10</artifactId>
    <version>${mahout.version}</version>
</dependency>
<dependency>
    <groupId>org.apache.mahout</groupId>
    <artifactId>mahout-flink_2.10</artifactId>
    <version>${mahout.version}</version>
</dependency>

Examples

For examples of how to use Mahout, see the examples directory located in examples/bin

For information on how to contribute, visit the How to Contribute Page

Legal

Please see the NOTICE.txt included in this directory for more information.

Build Status

Something went wrong with that request. Please try again.