Python crawler, which scans the tabulaturi.ro site for guitar tabs, and then sorts them by rating
Switch branches/tags
Nothing to show
Clone or download
Fetching latest commit…
Cannot retrieve the latest commit at this time.
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
README.md
crawler.py
license.txt
parser.py
ranking.txt
songs.txt

README.md

tabulaturi.ro crawler

Tabulaturi is a well known Romanian website with guitar tabs for songs. But it misses a feature: a ranking with the best tabs.

So I wrote a crawler in Python, that uses urllib2 and BeautifulSoup, parses each page with tabs, and then writes the needed information to songs.txt.

Then parser.py, takes that file and sorts the songs by the number of persons who voted, then by the number of stars, and outputs everything to ranking.txt.