# P2. Investigate a Dataset

## Introduction
<img style="float: center;" src="img/logo.png">

This is about exploring a provided dataset, and I chose the [Sean Lahman's baseball database](http://www.seanlahman.com/baseball-archive/statistics/) as the source of data.


##  Questions to the data

At this point I can't ask any interesting questions to the data just yet, since:
*  I don't know yet what exactly kind of data is available, how is it organized, which parts can be used etc.
*  I am not that good in the domain area either (American baseball is not really a thing in Europe)

So I'll start from both exploring the data and trying to build up a context about it in parallel, and hopefully the relevant questions will arise in the process.

##  Data exploration

Let's start from the top-down view of the data, and then proceed with drilling into more details. 

In [1]:
# initial setup/imports
%matplotlib inline

import os 
import IPython.core.display as disp
import matplotlib as mpl
import numpy as np
import pandas as pd
import seaborn as sns

# custom notebook styling 
with open('custom.css', 'r') as f:
    style = disp.HTML("""<style type="text/css">%s</style>"""%(f.read()))
style

###  General structure

We've got an archive with a set of CSV files and a couple of text files (readme*.txt). Let's unpack the files into the "data" folder. Then, the CSV files are:

In [2]:
dir_root = "data/"
files = [f for f in os.listdir(dir_root) if f.endswith('.csv')]

print "Tables: %s"%(", ".join(map(lambda s: s.split(".")[0], files)))

Tables: AllstarFull, Appearances, AwardsManagers, AwardsPlayers, AwardsShareManagers, AwardsSharePlayers, Batting, BattingPost, CollegePlaying, Fielding, FieldingOF, FieldingPost, HallOfFame, HomeGames, Managers, ManagersHalf, Master, Parks, Pitching, PitchingPost, Salaries, Schools, SeriesPost, Teams, TeamsFranchises, TeamsHalf


We can see the list of "tables" with data that are available. Also, there is a text file ("readme2014.txt") in the same folder, which has a very short description of each table and its columns. It's a good start.


In [3]:
import csv
import pandas as pd

tables = {}

for f in files:
    name = f.split('.')[0]
    tables[name] = pd.read_csv(dir_root + f)

for name, t in tables.iteritems():
    print "[%s]: %s"%(name, ", ".join(t.columns))
    disp.display(t.describe())

[ManagersHalf]: playerID, yearID, teamID, lgID, inseason, half, G, W, L, rank


Unnamed: 0,yearID,inseason,half,G,W,L,rank
count,93.0,93.0,93.0,93.0,93.0,93.0,93.0
mean,1947.505376,1.387097,1.483871,49.784946,24.645161,24.645161,5.16129
std,43.351351,0.752276,0.502448,19.150916,12.2187,9.389686,3.194051
min,1892.0,1.0,1.0,2.0,0.0,2.0,1.0
25%,1892.0,1.0,1.0,47.0,16.0,21.0,3.0
50%,1981.0,1.0,1.0,53.0,25.0,25.0,5.0
75%,1981.0,2.0,2.0,57.0,31.0,30.0,7.0
max,1981.0,5.0,2.0,80.0,53.0,46.0,12.0


[BattingPost]: yearID, round, playerID, teamID, lgID, G, AB, R, H, 2B, 3B, HR, RBI, SB, CS, BB, SO, IBB, HBP, SH, SF, GIDP


Unnamed: 0,yearID,G,AB,R,H,2B,3B,HR,RBI,SB,CS,BB,SO,IBB,HBP,SH,SF,GIDP
count,11690.0,11690.0,11690.0,11690.0,11690.0,11690.0,11690.0,11690.0,11690.0,11690.0,11489.0,11690.0,11690.0,10651.0,10453.0,10446.0,10442.0,10485.0
mean,1983.04166,3.277844,8.84089,1.054149,2.150984,0.375192,0.055518,0.218392,0.977844,0.169718,0.07938,0.832678,1.67083,0.096517,0.072611,0.126843,0.058609,0.175203
std,30.878498,1.930691,8.981517,1.591499,2.704839,0.739625,0.251774,0.562878,1.647508,0.608723,0.313297,1.358261,2.000072,0.378132,0.288822,0.401163,0.253721,0.450651
min,1884.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
25%,1971.0,2.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
50%,1995.0,3.0,5.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0
75%,2006.0,5.0,16.0,2.0,4.0,1.0,0.0,0.0,1.0,0.0,0.0,1.0,3.0,0.0,0.0,0.0,0.0,0.0
max,2015.0,15.0,66.0,13.0,21.0,6.0,4.0,6.0,13.0,15.0,5.0,13.0,13.0,7.0,4.0,5.0,3.0,5.0


[CollegePlaying]: playerID, schoolID, yearID


Unnamed: 0,yearID
count,17350.0
mean,1969.490259
std,32.783259
min,1864.0
25%,1949.0
50%,1981.0
75%,1995.0
max,2014.0


[Parks]: park.key, park.name, park.alias, city, state, country


Unnamed: 0,park.key,park.name,park.alias,city,state,country
count,250,250,57,250,249,250
unique,250,240,56,83,34,6
top,NYC15,Athletic Park,Federal League Park,Philadelphia,NY,US
freq,1,4,2,14,40,242


[AwardsPlayers]: playerID, awardID, yearID, lgID, tie, notes


Unnamed: 0,yearID
count,6078.0
mean,1968.459033
std,30.567689
min,1877.0
25%,1942.0
50%,1974.0
75%,1995.0
max,2015.0


[Master]: playerID, birthYear, birthMonth, birthDay, birthCountry, birthState, birthCity, deathYear, deathMonth, deathDay, deathCountry, deathState, deathCity, nameFirst, nameLast, nameGiven, weight, height, bats, throws, debut, finalGame, retroID, bbrefID


Unnamed: 0,birthYear,birthMonth,birthDay,deathYear,deathMonth,deathDay,weight,height
count,18703.0,18531.0,18382.0,9336.0,9335.0,9334.0,17975.0,18041.0
mean,1930.664118,6.627327,15.60902,1963.850364,6.484092,15.570281,185.980862,72.25564
std,41.229079,3.46711,8.748942,31.506369,3.528685,8.77858,21.226988,2.598983
min,1820.0,1.0,1.0,1872.0,1.0,1.0,65.0,43.0
25%,1894.0,4.0,8.0,1942.0,3.0,8.0,170.0,71.0
50%,1936.0,7.0,16.0,1966.0,6.0,15.0,185.0,72.0
75%,1968.0,10.0,23.0,1989.0,10.0,23.0,200.0,74.0
max,1995.0,12.0,31.0,2016.0,12.0,31.0,320.0,83.0


[FieldingOF]: playerID, yearID, stint, Glf, Gcf, Grf


Unnamed: 0,yearID,stint,Glf,Gcf,Grf
count,12028.0,12028.0,11991.0,11991.0,11985.0
mean,1912.736448,1.086548,15.740639,15.574598,15.755695
std,23.72365,0.306644,33.289793,34.59955,33.078331
min,1871.0,1.0,0.0,0.0,0.0
25%,1891.0,1.0,0.0,0.0,0.0
50%,1912.0,1.0,1.0,1.0,1.0
75%,1933.0,1.0,11.0,8.0,11.0
max,1955.0,5.0,156.0,162.0,160.0


[Pitching]: playerID, yearID, stint, teamID, lgID, W, L, G, GS, CG, SHO, SV, IPouts, H, ER, HR, BB, SO, BAOpp, ERA, IBB, WP, HBP, BK, BFP, GF, R, SH, SF, GIDP


Unnamed: 0,yearID,stint,W,L,G,GS,CG,SHO,SV,IPouts,...,IBB,WP,HBP,BK,BFP,GF,R,SH,SF,GIDP
count,44139.0,44139.0,44139.0,44139.0,44139.0,44139.0,44139.0,44139.0,44139.0,44138.0,...,29564.0,44006.0,43580.0,44139.0,43900.0,44006.0,44139.0,11239.0,11239.0,745.0
mean,1967.786493,1.07925,4.748794,4.748771,23.667142,9.55255,3.207979,0.45615,1.503976,255.673886,...,2.447064,2.534836,2.271111,0.303881,345.551572,6.355611,43.332291,2.207759,1.908088,4.844295
std,37.352599,0.28443,5.837989,5.00708,18.4629,12.312479,7.134955,1.11821,4.971535,258.428826,...,2.792671,3.438515,3.008115,0.759298,350.259188,10.003279,43.437952,2.751917,2.143002,5.524863
min,1871.0,1.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
25%,1940.0,1.0,0.0,1.0,7.0,0.0,0.0,0.0,0.0,50.0,...,0.0,0.0,0.0,0.0,64.0,0.0,11.0,0.0,0.0,1.0
50%,1977.0,1.0,2.0,3.0,22.0,3.0,0.0,0.0,0.0,169.0,...,2.0,1.0,1.0,0.0,229.0,3.0,29.0,1.0,1.0,3.0
75%,2000.0,1.0,7.0,8.0,35.0,18.0,3.0,0.0,1.0,397.0,...,4.0,4.0,3.0,0.0,540.0,8.0,68.0,3.0,3.0,7.0
max,2015.0,4.0,59.0,48.0,106.0,75.0,75.0,16.0,62.0,2040.0,...,23.0,63.0,41.0,16.0,2906.0,84.0,519.0,21.0,14.0,36.0


[AllstarFull]: playerID, yearID, gameNum, gameID, teamID, lgID, GP, startingPos


Unnamed: 0,yearID,gameNum,GP,startingPos
count,5069.0,5069.0,5050.0,1580.0
mean,1976.433024,0.136911,0.778218,5.037975
std,23.693503,0.461412,0.415486,2.653486
min,1933.0,0.0,0.0,0.0
25%,1958.0,0.0,1.0,3.0
50%,1976.0,0.0,1.0,5.0
75%,1998.0,0.0,1.0,7.0
max,2015.0,2.0,1.0,10.0


[HomeGames]: year.key, league.key, team.key, park.key, span.first, span.last, games, openings, attendance


Unnamed: 0,year.key,games,openings,attendance
count,2944.0,2944.0,2944.0,2944.0
mean,1952.110054,70.804008,48.941236,1077794.0
std,42.433247,19.765014,33.301467,1032963.0
min,1871.0,1.0,0.0,0.0
25%,1915.0,74.0,7.0,48366.0
50%,1959.0,78.0,66.0,874752.5
75%,1990.0,81.0,79.0,1805209.0
max,2014.0,89.0,83.0,4483203.0


[Appearances]: yearID, teamID, lgID, playerID, G_all, GS, G_batting, G_defense, G_p, G_c, G_1b, G_2b, G_3b, G_ss, G_lf, G_cf, G_rf, G_of, G_dh, G_ph, G_pr


Unnamed: 0,yearID,G_all,GS,G_batting,G_defense,G_p,G_c,G_1b,G_2b,G_3b,G_ss,G_lf,G_cf,G_rf,G_of,G_dh,G_ph,G_pr
count,100951.0,100748.0,49030.0,100951.0,100748.0,100951.0,100951.0,100951.0,100951.0,100951.0,100951.0,100951.0,100951.0,100951.0,100951.0,49233.0,49233.0,42945.0
mean,1963.352755,51.604081,34.329023,48.68249,48.84355,10.349189,4.766025,4.614724,4.597894,4.613615,4.587562,4.864003,4.595219,4.726798,13.881289,1.923832,4.551987,0.962696
std,38.585051,47.260338,47.514144,48.949621,46.090187,16.970473,19.002137,20.784987,20.550055,20.400925,21.217885,19.117279,20.2754,19.382664,34.853954,10.621366,9.476957,2.78277
min,1871.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
25%,1933.0,13.0,0.0,7.0,11.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
50%,1972.0,35.0,9.0,31.0,33.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
75%,1997.0,81.0,52.0,81.0,74.0,17.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,1.0,0.0,5.0,1.0
max,2015.0,165.0,163.0,165.0,165.0,106.0,160.0,162.0,163.0,164.0,165.0,163.0,162.0,162.0,164.0,162.0,95.0,92.0


[HallOfFame]: playerID, yearid, votedBy, ballots, needed, votes, inducted, category, needed_note


Unnamed: 0,yearid,ballots,needed,votes
count,4120.0,3927.0,3770.0,3927.0
mean,1968.889563,320.705373,243.98992,50.995926
std,22.899162,125.495156,94.557016,84.845195
min,1936.0,78.0,59.0,0.0
25%,1950.0,226.0,175.0,2.0
50%,1964.0,274.0,213.0,10.0
75%,1987.0,425.0,321.0,64.0
max,2016.0,581.0,436.0,555.0


[FieldingPost]: playerID, yearID, teamID, lgID, round, POS, G, GS, InnOuts, PO, A, E, DP, TP, PB, SB, CS


Unnamed: 0,yearID,G,GS,InnOuts,PO,A,E,DP,TP,PB,SB,CS
count,12311.0,12311.0,11924.0,11990.0,12311.0,12311.0,12311.0,12311.0,12311.0,1351.0,5555.0,5555.0
mean,1986.404922,2.963935,2.170748,58.023353,6.443506,2.458208,0.189505,0.532776,8.1e-05,0.143597,0.580378,0.314851
std,27.247863,1.844349,2.237781,57.560177,11.287227,4.628965,0.525923,1.267366,0.009013,0.421762,1.278113,0.777604
min,1903.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
25%,1975.0,1.0,0.0,10.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
50%,1996.0,3.0,1.0,30.0,1.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0
75%,2006.0,4.0,4.0,105.0,8.0,2.0,0.0,0.0,0.0,0.0,1.0,0.0
max,2015.0,11.0,8.0,222.0,91.0,33.0,8.0,13.0,1.0,3.0,16.0,10.0


[SeriesPost]: yearID, round, teamIDwinner, lgIDwinner, teamIDloser, lgIDloser, wins, losses, ties


Unnamed: 0,yearID,wins,losses,ties
count,307.0,307.0,307.0,307.0
mean,1981.374593,3.570033,1.420195,0.009772
std,32.681865,0.782246,1.106716,0.09853
min,1884.0,1.0,0.0,0.0
25%,1970.0,3.0,0.0,0.0
50%,1995.0,4.0,1.0,0.0
75%,2006.0,4.0,2.0,0.0
max,2015.0,10.0,5.0,1.0


[Schools]: schoolID, name_full, city, state, country


Unnamed: 0,schoolID,name_full,city,state,country
count,1207,1207,1207,1207,1207
unique,1207,1199,856,49,1
top,mercer,Butler County Community College,Chicago,CA,USA
freq,1,2,9,136,1207


[AwardsManagers]: playerID, awardID, yearID, lgID, tie, notes


Unnamed: 0,yearID,notes
count,177.0,0.0
mean,1988.615819,
std,20.788693,
min,1936.0,
25%,1980.0,
50%,1994.0,
75%,2004.0,
max,2015.0,


[Teams]: yearID, lgID, teamID, franchID, divID, Rank, G, Ghome, W, L, DivWin, WCWin, LgWin, WSWin, R, AB, H, 2B, 3B, HR, BB, SO, SB, CS, HBP, SF, RA, ER, ERA, CG, SHO, SV, IPouts, HA, HRA, BBA, SOA, E, DP, FP, name, park, attendance, BPF, PPF, teamIDBR, teamIDlahman45, teamIDretro


Unnamed: 0,yearID,Rank,G,Ghome,W,L,R,AB,H,2B,...,HA,HRA,BBA,SOA,E,DP,FP,attendance,BPF,PPF
count,2805.0,2805.0,2805.0,2406.0,2805.0,2805.0,2805.0,2805.0,2805.0,2805.0,...,2805.0,2805.0,2805.0,2805.0,2805.0,2488.0,2805.0,2526.0,2805.0,2805.0
mean,1955.03672,4.107308,150.34795,78.465919,74.74902,74.74902,681.945811,5142.492335,1346.27344,227.624955,...,1346.083779,101.136542,474.010695,731.229234,186.337255,140.186495,0.961519,1344346.0,100.199643,100.225668
std,41.519083,2.323414,23.22725,4.698684,17.640402,17.378079,135.738244,750.551691,219.891603,58.692602,...,219.521064,58.245002,131.890032,296.409881,107.657444,29.322764,0.030224,946931.6,4.882215,4.814985
min,1871.0,1.0,6.0,44.0,0.0,4.0,24.0,211.0,33.0,3.0,...,49.0,0.0,0.0,0.0,47.0,18.0,0.76,6088.0,60.0,60.0
25%,1919.0,2.0,153.0,77.0,66.0,65.0,613.0,5127.0,1299.0,193.0,...,1288.0,46.0,427.0,501.0,116.0,126.0,0.96,528716.2,97.0,97.0
50%,1963.0,4.0,157.0,81.0,77.0,76.0,690.0,5389.0,1393.0,231.0,...,1392.0,109.0,494.0,735.0,145.0,145.0,0.97,1140348.0,100.0,100.0
75%,1992.0,6.0,162.0,81.0,87.0,87.0,763.0,5517.0,1467.0,270.0,...,1470.0,148.0,555.0,965.0,217.0,159.25,0.98,2014687.0,103.0,103.0
max,2015.0,13.0,165.0,84.0,116.0,134.0,1220.0,5781.0,1783.0,376.0,...,1993.0,241.0,827.0,1450.0,639.0,217.0,0.991,4483350.0,129.0,141.0


[AwardsShareManagers]: awardID, yearID, lgID, playerID, pointsWon, pointsMax, votesFirst


Unnamed: 0,yearID,pointsWon,pointsMax,votesFirst
count,414.0,414.0,414.0,414.0
mean,1999.777778,39.879227,141.932367,4.543478
std,9.283738,41.598739,19.191283,7.060568
min,1983.0,1.0,24.0,0.0
25%,1992.0,4.0,140.0,0.0
50%,2000.0,22.0,140.0,1.0
75%,2008.0,69.75,150.0,6.0
max,2015.0,154.0,160.0,30.0


[Salaries]: yearID, teamID, lgID, playerID, salary


Unnamed: 0,yearID,salary
count,25575.0,25575.0
mean,2000.374389,2008563.0
std,8.610604,3315706.0
min,1985.0,0.0
25%,1993.0,275000.0
50%,2000.0,550000.0
75%,2008.0,2250000.0
max,2015.0,33000000.0


[PitchingPost]: playerID, yearID, round, teamID, lgID, W, L, G, GS, CG, SHO, SV, IPouts, H, ER, HR, BB, SO, BAOpp, ERA, IBB, WP, HBP, BK, BFP, GF, R, SH, SF, GIDP


Unnamed: 0,yearID,W,L,G,GS,CG,SHO,SV,IPouts,H,...,IBB,WP,HBP,BK,BFP,GF,R,SH,SF,GIDP
count,5109.0,5109.0,5109.0,5109.0,5109.0,5109.0,5109.0,5109.0,5109.0,5109.0,...,5059.0,5059.0,5059.0,4884.0,5059.0,5109.0,5109.0,4398.0,4398.0,5059.0
mean,1987.565473,0.299667,0.299667,1.911529,0.601683,0.128597,0.027794,0.109219,16.149149,4.921707,...,0.213679,0.146867,0.160901,0.013514,22.026092,0.473087,2.413192,0.290359,0.132788,0.382882
std,27.590228,0.561805,0.533933,1.011292,0.811819,0.480739,0.172533,0.420001,15.375949,4.783624,...,0.498319,0.415658,0.418301,0.115471,18.74947,0.838524,2.791809,0.634862,0.378057,0.693607
min,1884.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,...,0.0,0.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0
25%,1977.0,0.0,0.0,1.0,0.0,0.0,0.0,0.0,6.0,2.0,...,0.0,0.0,0.0,0.0,8.0,0.0,0.0,0.0,0.0,0.0
50%,1998.0,0.0,0.0,2.0,0.0,0.0,0.0,0.0,12.0,4.0,...,0.0,0.0,0.0,0.0,17.0,0.0,2.0,0.0,0.0,0.0
75%,2007.0,1.0,1.0,2.0,1.0,0.0,0.0,0.0,21.0,7.0,...,0.0,0.0,0.0,0.0,29.0,1.0,4.0,0.0,0.0,1.0
max,2015.0,4.0,4.0,8.0,8.0,8.0,3.0,4.0,213.0,64.0,...,4.0,5.0,3.0,1.0,178.0,6.0,36.0,7.0,4.0,6.0


[AwardsSharePlayers]: awardID, yearID, lgID, playerID, pointsWon, pointsMax, votesFirst


Unnamed: 0,yearID,pointsWon,pointsMax,votesFirst
count,6795.0,6795.0,6795.0,6437.0
mean,1971.923032,43.347609,266.87844,1.61799
std,27.449771,67.958756,128.989358,4.888965
min,1911.0,0.0,16.0,0.0
25%,1950.0,4.0,140.0,0.0
50%,1974.0,12.0,336.0,0.0
75%,1995.0,52.0,336.0,0.0
max,2015.0,448.0,448.0,32.0


[Fielding]: playerID, yearID, stint, teamID, lgID, POS, G, GS, InnOuts, PO, A, E, DP, PB, WP, SB, CS, ZR


Unnamed: 0,yearID,stint,G,GS,InnOuts,PO,A,E,DP,PB,WP,SB,CS,ZR
count,170526.0,170526.0,170526.0,75849.0,102313.0,156409.0,156408.0,156407.0,156408.0,11116.0,4189.0,6024.0,6024.0,4189.0
mean,1966.517123,1.077818,33.651854,26.930823,708.231134,79.463496,30.826198,3.512093,6.349017,5.167326,11.772977,24.909031,12.066899,0.703032
std,38.550401,0.283581,41.117359,40.790139,1061.42381,176.463601,75.011893,7.53445,18.388112,9.116445,12.249974,25.816811,13.222888,1.394882
min,1871.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
25%,1941.0,1.0,4.0,1.0,54.0,2.0,1.0,0.0,0.0,0.0,2.0,4.0,1.0,0.0
50%,1977.0,1.0,16.0,7.0,220.0,10.0,5.0,1.0,1.0,2.0,8.0,17.0,8.0,0.0
75%,1998.0,1.0,45.0,32.0,754.0,72.0,22.0,3.0,3.0,6.0,19.0,39.0,19.0,1.0
max,2015.0,5.0,165.0,164.0,4469.0,1846.0,641.0,119.0,194.0,105.0,69.0,155.0,89.0,15.0


[TeamsFranchises]: franchID, franchName, active, NAassoc


Unnamed: 0,franchID,franchName,active,NAassoc
count,120,120,95,12
unique,120,99,2,12
top,CKK,Washington Nationals,N,NYU
freq,1,5,65,1


[TeamsHalf]: yearID, lgID, teamID, Half, divID, DivWin, Rank, G, W, L


Unnamed: 0,yearID,Half,Rank,G,W,L
count,52.0,52.0,52.0,52.0,52.0,52.0
mean,1981.0,1.5,3.692308,53.423077,26.711538,26.711538
std,0.0,0.504878,1.84219,2.872084,5.333345,5.184201
min,1981.0,1.0,1.0,48.0,15.0,20.0
25%,1981.0,1.0,2.0,52.0,23.0,23.0
50%,1981.0,1.5,4.0,53.0,27.0,26.0
75%,1981.0,2.0,5.0,56.0,31.0,29.0
max,1981.0,2.0,7.0,60.0,37.0,42.0


[Managers]: playerID, yearID, teamID, lgID, inseason, G, W, L, rank, plyrMgr


Unnamed: 0,yearID,inseason,G,W,L,rank
count,3405.0,3405.0,3405.0,3405.0,3405.0,3404.0
mean,1953.661087,1.234949,123.872834,61.578855,61.587372,4.351645
std,41.852866,0.598186,50.501189,28.789993,26.456165,2.400171
min,1871.0,1.0,1.0,0.0,0.0,1.0
25%,1917.0,1.0,91.0,42.0,45.0,2.0
50%,1962.0,1.0,154.0,70.0,68.0,4.0
75%,1990.0,1.0,162.0,84.0,81.0,6.0
max,2015.0,9.0,165.0,116.0,120.0,12.0


[Batting]: playerID, yearID, stint, teamID, lgID, G, AB, R, H, 2B, 3B, HR, RBI, SB, CS, BB, SO, IBB, HBP, SH, SF, GIDP


Unnamed: 0,yearID,stint,G,AB,R,H,2B,3B,HR,RBI,SB,CS,BB,SO,IBB,HBP,SH,SF,GIDP
count,101332.0,101332.0,101332.0,96183.0,96183.0,96183.0,96183.0,96183.0,96183.0,95759.0,94883.0,72729.0,96183.0,88345.0,59620.0,93373.0,89845.0,60151.0,70075.0
mean,1963.506533,1.077567,51.400111,149.970327,19.887038,39.261647,6.637067,1.373361,2.949305,17.965163,3.158184,1.324025,13.811484,21.629849,1.213234,1.113395,2.4579,1.150122,3.210032
std,38.628278,0.283676,47.145273,186.557072,28.671365,53.310941,9.801563,2.710547,6.409662,26.756514,7.922994,2.838196,21.092775,28.432978,2.894918,2.32066,4.347818,2.023981,4.835881
min,1871.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0
25%,1933.0,1.0,13.0,7.0,0.0,1.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,2.0,0.0,0.0,0.0,0.0,0.0
50%,1972.0,1.0,34.0,57.0,5.0,11.0,1.0,0.0,0.0,4.0,0.0,0.0,3.0,10.0,0.0,0.0,1.0,0.0,1.0
75%,1997.0,1.0,80.25,251.0,30.0,63.0,10.0,2.0,3.0,27.0,2.0,1.0,20.0,30.0,1.0,1.0,3.0,2.0,5.0
max,2015.0,5.0,165.0,716.0,192.0,262.0,67.0,36.0,73.0,191.0,138.0,42.0,232.0,223.0,120.0,51.0,67.0,19.0,36.0


In [4]:
tables["BattingPost"]

Unnamed: 0,yearID,round,playerID,teamID,lgID,G,AB,R,H,2B,...,RBI,SB,CS,BB,SO,IBB,HBP,SH,SF,GIDP
0,1884,WS,becanbu01,NY4,AA,1,2,0,1,0,...,0,0,,0,0,0.0,,,,
1,1884,WS,bradyst01,NY4,AA,3,10,1,0,0,...,0,0,,0,1,0.0,,,,
2,1884,WS,esterdu01,NY4,AA,3,10,0,3,1,...,0,1,,0,3,0.0,,,,
3,1884,WS,forstto01,NY4,AA,1,3,0,0,0,...,0,0,,0,1,0.0,,,,
4,1884,WS,keefeti01,NY4,AA,2,5,0,1,0,...,0,0,,0,4,0.0,,,,
5,1884,WS,kenneed01,NY4,AA,3,7,0,0,0,...,0,0,,0,2,0.0,,,,
6,1884,WS,nelsoca01,NY4,AA,3,10,0,1,0,...,0,0,,0,1,0.0,,,,
7,1884,WS,orrda01,NY4,AA,3,9,0,1,0,...,0,0,,0,0,0.0,,,,
8,1884,WS,reipsch01,NY4,AA,2,5,1,0,0,...,0,0,,0,1,0.0,,,,
9,1884,WS,rosemch01,NY4,AA,3,9,1,3,0,...,1,0,,0,1,0.0,,,,
