Skip to content
Sample recommender flow for search as recommendation
Find file
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Failed to load latest commit information.
data
src/main/pig
.gitignore
LICENSE
README.md
pom.xml

README.md

ponies

Sample recommender flow for search as recommendation

Download data files

for i in 0 1 2 3 4 5 6 7 8 9
do
    wget --no-check-certificate https://s3-us-west-1.amazonaws.com/mlt-lab-data-master/data/log-1M/log-000$i
done

Download music info files

wget --no-check-certificate https://s3-us-west-1.amazonaws.com/mlt-lab-data-master/data/music/albums/albums.tsv
wget --no-check-certificate https://s3-us-west-1.amazonaws.com/mlt-lab-data-master/data/music/artists/artists.tsv
wget --no-check-certificate https://s3-us-west-1.amazonaws.com/mlt-lab-data-master/data/music/tracks/tracks-2.tsv

Preprocess data using Pig

pig -p MUSIC_HOME=/mapr/my.cluster.com/user/mapr/data/ -f preprocess.pig

Run item similarity from Mahout

/opt/mapr/mahout/mahout-0.8/bin/mahout itemsimilarity -s SIMILARITY_LOGLIKELIHOOD -b -i /mapr/my.cluster.com/user/mapr/data/output/finish/ -o /mapr/my.cluster.com/user/mapr/data/similarities
Something went wrong with that request. Please try again.