github
Advanced Search
  • Home
  • Pricing and Signup
  • Explore GitHub
  • Blog
  • Login

dwillis / fumblerooski

  • Admin
  • Watch Unwatch
  • Fork
  • Your Fork
  • Pull Request
  • Download Source
    • 7
    • 1
  • Source
  • Commits
  • Network (1)
  • Issues (9)
  • Downloads (0)
  • Wiki (1)
  • Graphs
  • Branch: master

click here to add a description

click here to add a homepage

  • Branches (1)
    • master ✓
  • Tags (0)
Sending Request…
Enable Donations

Pledgie Donations

Once activated, we'll place the following badge in your repository's detail box:
Pledgie_example
This service is courtesy of Pledgie.

college football app — Read more

  cancel

http://www.fumblerooski.org/

  cancel
  • Private
  • Read-Only
  • HTTP Read-Only

This URL has Read+Write access

fixed coaching links 
Derek Willis (author)
Thu Jan 28 09:22:05 -0800 2010
commit  f7fc5fe1ec3ab8e2b0a6ad2b5722ff6179506f97
tree    42f81097c7e3245e43198d6625e211d8400b86b6
parent  144443d5e155be16e0a2aa1ca790b0cb696b7448
fumblerooski /
name age
history
message
file README.rst Loading commit data...
file __init__.py Wed Mar 05 11:00:39 -0800 2008 first commit [Derek Willis]
directory api/ Wed Jan 06 12:59:51 -0800 2010 roll back college year api [dwillis]
directory blog/
directory college/
file feeds.py Thu Jan 07 19:19:27 -0800 2010 another try at feeds [Derek Willis]
file ncaa_log.txt Wed Mar 05 11:00:39 -0800 2008 first commit [Derek Willis]
directory rankings/
directory scrapers/
directory templates/ Thu Jan 28 09:22:05 -0800 2010 fixed coaching links [Derek Willis]
file urls.py
file utils.py Tue Jan 26 18:35:12 -0800 2010 added coaching staff utility [Derek Willis]
README.rst

Fumblerooski

Fumblerooski is a college football statistics application written in Python using the Django framework, based on data provided by the NCAA and editor input. Most statistics are from 2000 onward, although some teams have game scores dating back to 1988.

Requirements

  • Python 2.5+ (lower versions may work but are untested.)
  • Django 1.1+ (aggregates support required)
  • BeautifulSoup (HTML and XML parsing)
  • django-googlecharts (http://github.com/jacobian/django-googlecharts)

Overview

Fumblerooski uses the NCAA's statistics site (http://web1.ncaa.org/mfb/mainpage.jsp?year=2009) as a base - nearly everything game and player-related derives from information parsed or scraped from this site. The NCAA provides game information in XML, but most other elements - rosters, drives and rankings included - are scraped using BeautifulSoup. Coaching information is based on an Excel file provided by the NCAA but supplemented by data entry for assistant coaches and coaching history back to 2000.

Structure

Fumblerooski is divided into several app or app-like structures. The main one is the college app, which contains most of the models and views, including those related to teams, games, coaches and players. The rankings and scrapers are split into their own directories as well, along with small API and blog apps.

Loaders

Information from the NCAA is scraped using the files in scrapers/, which presently are very tied into the rest of the app, so running the loaders independent of the college app will not produce the desired effect (and will produce lots of errors). The NCAA produces an XML file for each completed game, but the remainder of the information used by Fumblerooski, including schedules, rosters and rankings are parsed using BeautifulSoup.

The main scraper library has three functions: full_load, full_nostats_load and partial_loader. Because of the need to scrape the HTML tables for an entire season, each of the loaders loops through each team marked to be updated (updated=True) and parses the schedule/results information. The full_load also creates drive and player stats, but those are not available until up to 12-15 hours after the end of the game, so a nostats load only records the score. Normally the full_load can be run on a Sunday afternoon to capture all of Saturday's games. It's also recommended that the load_roster scraper be run after a full_load, as it updates the number of games a player has played in. Rankings also tend to be updated on Sundays, so the team and player rankings can be run then, too.

Utilities

The code includes several generic utilities of use in the app but which depend on the use of the scrapers to populate the database. These include functions to generate and update CollegeYear instances, identify conference games and populate Week instances.

Blog | Support | Training | Contact | API | Status | Twitter | Help | Security
© 2010 GitHub Inc. All rights reserved. | Terms of Service | Privacy Policy
Powered by the Dedicated Servers and
Cloud Computing of Rackspace Hosting®
Dedicated Server