A simple machine learned tagger trained on BOSS web / delicious data
Python
Switch branches/tags
Nothing to show
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Failed to load latest commit information.
README
autosvm.py
classify.py
conf.py
crawl_delicious.py
featurize.py
gen_training_test_set.py
libsvm-2.89.tar.gz
tags.txt
test_data.txt
training_data.txt
vector_data.cpickle

README

@author: Vik Singh (viksi@yahoo-inc.com)

A simple BOSS example for Yahoo! Hack Day NYC
A machine learned tagger trained on BOSS web / delicious data

Read this if you want to learn more and especially check out the caveats section if
you're planning to use this code for more practical purposes

http://zooie.wordpress.com/2009/10/09/build-an-automatic-tagger-in-200-lines-with-boss/


# Install libsvm

tar -xzvf libsvm-2.89.tar.gz
cd libsvm-2.89
make

cd ..

# Optional: Crawl fresh delicious data via BOSS (a previous crawl already included)

python crawl_delicious.py

# Generate a binary training set via two tags (pick from tags.txt)

python gen_training_test_set.py microsoft google

# Learn from the resulting training_data.txt and predict on test_data.txt

python autosvm.py training_set.txt test_set.txt

# Prints out the accuracy of the learner and saves model + prediction files in timestamped folder