Skip to content

Analyzes past torunament results to show an average tournament based on seeding

Notifications You must be signed in to change notification settings

benb116/March-Madness-Analyzer

Repository files navigation

BenB116's March Madness Analyzer

#####Updated with 2016 Madness

This is a script I wrote that analyzes the results of each NCAA Men's Basketball Tournament since 1985 (the first year that the tournament was expanded to 64 teams). The script only pays attention to the seed of each team and how far they get in the tournament. From the historical data, which I got using CBS Sports Year-by-Year History, I came up with averages that can be used to predict future tournaments.

Inside the folder "Raw Data" is, well, the raw data. There are two text files for each year since 1985: the first, labeled "raw", is a list of the teams that won a game in the tournament. The first 32 lines are from the First Round, the next 16 are from the Second Round, the following 8 are from the Sweet Sixteen, and so on. The second file, labeled "key", is a list of every team in the tournament that year and their seed.

The script works by counting the number of times teams with a specific seed win in each round of each tournament. The numbers are added and each divided by 30 (the number of tournaments since 1985) to find the average number of wins per seed in each round of a given tournament.

Also in this repo are tools to generate full brackets in Excel based on the historical data. The bracket script determines the relative likelihoods of two seeds winning in the same round and randomly chooses a team based on the weighted chances.

###Here are ten facts and tips based on the data:

  1. A #16 seed has never beaten a #1 seed.
  2. A #9 seed is slightly more likely to beat an #8 seed than vice versa.
  3. Florida Gulf Coast was the only #15 seed to make it to the Sweet Sixteen (2013).
  4. An #8 or #9 seed is not likely to get past the Second Round (probably because they have to beat a #1 seed to do it).
  5. That being said, it is equally likely that 3 #1 seeds and 4 #1 seeds advance to the Sweet Sixteen.
  6. Knock out at least 1 #2 seed in the Second Round of your bracket, and maybe even leave only two #3 seeds standing.
  7. Of teams with seeds lower than 8, #10 seeds and #12 seeds are the most likely to advance to the Sweet Sixteen.
  8. A #2 seed winning the championship and a #3 seed winning the championship are equally likely.
  9. Best Final Four seeds? 1, 1, 2, 3 or 1, 1, 2, 4.
  10. The championship has never been won by a #9 seed or lower

Here are the results from 1985 to 2014.

Round 1  {120, 113, 102, 95, 76, 79, 73, 59, 61, 47, 41, 44, 25, 18, 7, 0}

(120 out of 120 #1 seeds won in the first round, 113 out of 120 #2 seeds won in the first round...)

Round 2  {104, 77, 61, 55, 39, 40, 20, 11, 5, 22, 17, 20, 6, 2, 1, 0}
Round 3  {82, 56, 30, 19, 8, 13, 8, 8, 2, 7, 6, 1, 0, 0, 0, 0}
Round 4  {48, 26, 14, 13, 6, 3, 1, 5, 1, 0, 3, 0, 0, 0, 0, 0}
Round 5  {27, 12, 9, 3, 3, 2, 1, 3, 0, 0, 0, 0, 0, 0, 0, 0}
Round 6  {18, 4, 4, 1, 0, 1, 1, 1, 0, 0, 0, 0, 0, 0, 0, 0}

Percentages:

{100.0, 94.17, 85.0, 79.17, 63.33, 65.83, 60.83, 49.17, 50.83, 39.17, 34.17, 36.67, 20.83, 15.0, 5.83, 0.0}

(100% of #1 seed won in the first round, 94% of #2 seeds won in the first round...)

{86.67, 64.17, 50.83, 45.83, 32.5, 33.33, 16.67, 9.17, 4.17, 18.33, 14.17, 16.67, 5.0, 1.67, 0.83, 0.0}
{68.33, 46.67, 25.0, 15.83, 6.67, 10.83, 6.67, 6.67, 1.67, 5.83, 5.0, 0.83, 0.0, 0.0, 0.0, 0.0}
{40.0, 21.67, 11.67, 10.83, 5.0, 2.5, 0.83, 4.17, 0.83, 0.0, 2.5, 0.0, 0.0, 0.0, 0.0, 0.0}
{22.5, 10.0, 7.5, 2.5, 2.5, 1.67, 0.83, 2.5, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0}
{15.0, 3.33, 3.33, 0.83, 0.0, 0.83, 0.83, 0.83, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0, 0.0}

About

Analyzes past torunament results to show an average tournament based on seeding

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published