Development for baseball databank, an Open Data collection of historical baseball data
Clone or download
Latest commit 526d80f Jul 17, 2018
Permalink
Failed to load latest commit information.
core Updated People. Jul 17, 2018
upstream Added teams upstream table. Jan 1, 2018
README.txt Added teams upstream table. Jan 1, 2018

README.txt

Baseball Databank is a compilation of historical baseball data in a
convenient, tidy format, distributed under Open Data terms.

This work is licensed under a Creative Commons Attribution-ShareAlike
3.0 Unported License.  For details see:
http://creativecommons.org/licenses/by-sa/3.0/

Organisation of the files
-------------------------

There are two directories in the repository.

* 'core' contains the databank itself.  If you are a user of the data, these are the
  files you need.
* 'upstream' contains files used to construct the databank.

Most of the data in the Databank is provided by Chadwick Baseball Bureau
(http://www.chadwick-bureau.com).  The data differ from the data the Bureau provides
to its clients in that it contains less detail, is updated less frequently, 
and is provided on an as-is basis.


Other sources
-------------

The Databank is historically based in part on the Lahman Baseball Database, 
version 2015-01-24, which is Copyright (C) 1996-2015 by Sean Lahman.

The tables Parks.csv and HomeGames.csv are based on the game logs
and park code table published by Retrosheet.
This information is available free of charge from and is copyrighted
by Retrosheet.  Interested parties may contact Retrosheet at 
http://www.retrosheet.org.


Queries and suggested revisions
-------------------------------

Queries and suggested revisions to the data can be posted in the issue tracker at
https://github.com/chadwickbureau/baseballdatabank/issues.

Files in 'core' are all generated by scripts.  As such they are not edited manually
(and therefore pull requests should not be submitted against these files).

Files in 'upstream' are manually-maintained files which contain information specific
to constructing the Databank.  As they are maintained manually, it is valid to submit
pull requests containing corrections or additions to these files.

Data which does not originate from the 'upstream' files is data maintained by
Chadwick Baseball Bureau.  While enquiries regarding these data are welcomed,
remember that these data are updated with some lag, and therefore may differ from
data which appear elsewhere on the Internet or other sources.