Mirror of Apache Mahout
Java Scala Perl6 Other
Permalink
Failed to load latest commit information.
bin MAHOUT-1894 Add Support for Spark 2.x closes apache/mahout#271 Feb 24, 2017
buildtools MAHOUT-1883 closes no PR, adds dataset filtering for minimal needed t… Oct 11, 2016
conf MAHOUT-1881:flink-config.yaml is not copied to /conf in Binary Distro… Sep 8, 2016
distribution Add code signing key to KEYS Feb 26, 2017
examples MAHOUT-1885: Inital commit of VCL bindings. closes apache/mahout#269 Jan 26, 2017
flink MAHOUT-1856 Add Framework for Models, Fitters, and Tests closes apach… Feb 1, 2017
h2o MAHOUT-1856 Add Framework for Models, Fitters, and Tests closes apach… Feb 1, 2017
hdfs MAHOUT-1883 closes no PR, adds dataset filtering for minimal needed t… Oct 11, 2016
integration MAHOUT-1883 closes no PR, adds dataset filtering for minimal needed t… Oct 11, 2016
math-scala MAHOUT-1924 Add Unit Test for DurbinWatson Test closes apache/mahout#282 Feb 24, 2017
math MAHOUT-1906: Ensure customJars are added to the MahoutContext under c… Jan 16, 2017
mr MAHOUT-1906: Ensure customJars are added to the MahoutContext under c… Jan 16, 2017
spark MAHOUT-1912: MAHOUT-1912: CLI driver tests not working with vienniacl… Feb 26, 2017
src MAHOUT-1881:flink-config.yaml is not copied to /conf in Binary Distro… Sep 8, 2016
viennacl-omp MAHOUT-1885: Inital commit of VCL bindings. closes apache/mahout#269 Jan 26, 2017
viennacl [MAHOUT-1903][MAHOUT-1907] VCL Vector memory copy fix. closes apache/… Feb 26, 2017
.gitignore MAHOUT-1856 Add Framework for Models, Fitters, and Tests closes apach… Feb 1, 2017
.travis.yml MAHOUT-1885: Inital commit of VCL bindings. closes apache/mahout#269 Jan 26, 2017
LICENSE.txt MAHOUT-1885: Inital commit of VCL bindings. closes apache/mahout#269 Jan 26, 2017
NOTICE.txt Merging branch flink-binding to master Apr 11, 2016
README.md Add Apache license header to READMEmd, this closes apache/mahout#259 Oct 7, 2016
doap_Mahout.rdf No Jira: Update DOAP to the most recent release Oct 10, 2016
pom.xml MAHOUT-1894 Add Support for Spark 2.x closes apache/mahout#271 Feb 24, 2017
runtests.sh MAHOUT-1885: Inital commit of VCL bindings. closes apache/mahout#269 Jan 26, 2017

README.md

Welcome to Apache Mahout!

The Apache Mahout™ project's goal is to build an environment for quickly creating scalable performant machine learning applications.

For additional information about Mahout, visit the Mahout Home Page

Setting up your Environment

Whether you are using Mahout's Shell, running command line jobs or using it as a library to build your own apps you'll need to setup several environment variables. Edit your environment in ~/.bash_profile for Mac or ~/.bashrc for many linux distributions. Add the following

export MAHOUT_HOME=/path/to/mahout
export MAHOUT_LOCAL=true # for running standalone on your dev machine, 
# unset MAHOUT_LOCAL for running on a cluster

You will need a $JAVA_HOME, and if you are running on Spark, you will also need $SPARK_HOME

Note when running the spark-shell job it can help to set some JVM options so you don't run out of memory:

$MAHOUT_OPTS="-Xmx6g -XX:MaxPermSize=512m" mahout spark-shell

Using Mahout as a Library

Running any application that uses Mahout will require installing a binary or source version and setting the environment. To compile from source:

  • mvn -DskipTests clean install
  • To run tests do mvn test
  • To set up your IDE, do mvn eclipse:eclipse or mvn idea:idea

To use maven, add the appropriate setting to your pom.xml or build.sbt following the template below.

To use the Samsara environment you'll need to include both the engine neutral math-scala dependency:

<dependency>
    <groupId>org.apache.mahout</groupId>
    <artifactId>mahout-math-scala_2.10</artifactId>
    <version>${mahout.version}</version>
</dependency>

and a dependency for back end engine translation, e.g:

<dependency>
    <groupId>org.apache.mahout</groupId>
    <artifactId>mahout-spark_2.10</artifactId>
    <version>${mahout.version}</version>
</dependency>
<dependency>
    <groupId>org.apache.mahout</groupId>
    <artifactId>mahout-flink_2.10</artifactId>
    <version>${mahout.version}</version>
</dependency>

Examples

For examples of how to use Mahout, see the examples directory located in examples/bin

For information on how to contribute, visit the How to Contribute Page

Legal

Please see the NOTICE.txt included in this directory for more information.

Build Status Coverage Status