Skip to content

HTTPS clone URL

Subversion checkout URL

You can clone with HTTPS or Subversion.

Download ZIP
A simple machine learned tagger trained on BOSS web / delicious data
branch: master

Fetching latest commit…

Cannot retrieve the latest commit at this time

Failed to load latest commit information.
README
autosvm.py
classify.py
conf.py
crawl_delicious.py
featurize.py
gen_training_test_set.py
libsvm-2.89.tar.gz
tags.txt
test_data.txt
training_data.txt
vector_data.cpickle

README

@author: Vik Singh (viksi@yahoo-inc.com)

A simple BOSS example for Yahoo! Hack Day NYC
A machine learned tagger trained on BOSS web / delicious data

Read this if you want to learn more and especially check out the caveats section if
you're planning to use this code for more practical purposes

http://zooie.wordpress.com/2009/10/09/build-an-automatic-tagger-in-200-lines-with-boss/


# Install libsvm

tar -xzvf libsvm-2.89.tar.gz
cd libsvm-2.89
make

cd ..

# Optional: Crawl fresh delicious data via BOSS (a previous crawl already included)

python crawl_delicious.py

# Generate a binary training set via two tags (pick from tags.txt)

python gen_training_test_set.py microsoft google

# Learn from the resulting training_data.txt and predict on test_data.txt

python autosvm.py training_set.txt test_set.txt

# Prints out the accuracy of the learner and saves model + prediction files in timestamped folder
Something went wrong with that request. Please try again.