## Ebird data

### From [About Ebird](https://ebird.org/about)

>eBird is the world’s largest biodiversity-related citizen science project, with more than 100 million bird sightings contributed each year by eBirders around the world. A collaborative enterprise with hundreds of partner organizations, thousands of regional experts, and hundreds of thousands of users, eBird is managed by the Cornell Lab of Ornithology.

>eBird data document bird distribution, abundance, habitat use, and trends through checklist data collected within a simple, scientific framework. Birders enter when, where, and how they went birding, and then fill out a checklist of all the birds seen and heard during the outing. eBird’s free mobile app allows offline data collection anywhere in the world, and the website provides many ways to explore and summarize your data and other observations from the global eBird community. 

>eBird is available worldwide, comprising a network of local, national, and international partners. eBird directly collaborates with hundreds of partner groups for regional data entry portals, outreach, engagement, and local impact. Explore regional collaborators.

>Data quality is of critical importance. When entering sightings, observers are presented with a list of likely birds for that date and region. These checklist filters are developed by some of the most knowledgeable bird distribution experts in the world. When unusual birds are seen, or high counts are reported, the regional experts review these records. Learn more about eBird data quality.

### [Data access link](https://ebird.org/data/download) 

#### Notes: 
- You need to create an account to access data. 
- This notebook shows sample data available for immediate download. To access the full dataset, you need to submit a request.


#### There are two datasets (comments from the download page): 

>Basic Dataset (EBD):
>eBird dataset with all primary database fields. This includes checklist and observation data. Records that have not yet been reviewed can also be requested. This is the most complete and current dataset and recommended for most uses, including seasonal reports for journals.
>Updated Quarterly

>Reference Dataset (ERD):
>eBird dataset for the Western Hemipshere with observational and checklist data, zero-filled, and associated with a suite of landscape variables. This is intended for niche-modeling and other data-intensive science uses.
>Updated Annually

#### The code below reads the sample data files and displays panda df contents:


In [22]:
# remove warnings
import warnings
warnings.filterwarnings('ignore')

#%matplotlib inline
import pandas as pd
pd.options.display.max_columns = 100
from matplotlib import pyplot as plt
import matplotlib
matplotlib.style.use('ggplot')
import numpy as np

basicdir = 'sample/basic/'
basicfile = 'ebd_US-AL-101_201801_201801_relMay-2018.txt'
IBAcodesfile = 'IBACodes.txt'
USFWSCodesfile = 'USFWSCodes.txt'
BCRcodesfile = 'BCRCodes.txt'

refdir = 'sample/reference/2016/'
refclfile = 'checklists-sample.csv'
refcvfile = 'extended-covariates-sample.csv'


In [23]:
basicdata = pd.read_csv(basicdir+basicfile, sep='\t') # NOTE: tab-delimited
print(basicdata.shape)
basicdata.head()

(1939, 47)


Unnamed: 0,GLOBAL UNIQUE IDENTIFIER,LAST EDITED DATE,TAXONOMIC ORDER,CATEGORY,COMMON NAME,SCIENTIFIC NAME,SUBSPECIES COMMON NAME,SUBSPECIES SCIENTIFIC NAME,OBSERVATION COUNT,BREEDING BIRD ATLAS CODE,BREEDING BIRD ATLAS CATEGORY,AGE/SEX,COUNTRY,COUNTRY CODE,STATE,STATE CODE,COUNTY,COUNTY CODE,IBA CODE,BCR CODE,USFWS CODE,ATLAS BLOCK,LOCALITY,LOCALITY ID,LOCALITY TYPE,LATITUDE,LONGITUDE,OBSERVATION DATE,TIME OBSERVATIONS STARTED,OBSERVER ID,SAMPLING EVENT IDENTIFIER,PROTOCOL TYPE,PROTOCOL CODE,PROJECT CODE,DURATION MINUTES,EFFORT DISTANCE KM,EFFORT AREA HA,NUMBER OBSERVERS,ALL SPECIES REPORTED,GROUP IDENTIFIER,HAS MEDIA,APPROVED,REVIEWED,REASON,TRIP COMMENTS,SPECIES COMMENTS,Unnamed: 46
0,URN:CornellLabOfOrnithology:EBIRD:OBS570779581,2018-01-24 14:41:34.0,19073.0,species,American Crow,Corvus brachyrhynchos,,,2,,,,United States,US,Alabama,US-AL,Montgomery,US-AL-101,,27,,,"Rudder Rd., Hope Hull",L982093,H,32.25315,-86.351219,2018-01-05,14:20:00,obsr231506,S42207073,Traveling,P22,EBIRD,15.0,1.609,,2,1,,0,1,0,,,,
1,URN:CornellLabOfOrnithology:EBIRD:OBS563756637,2018-01-03 22:54:39.0,19073.0,species,American Crow,Corvus brachyrhynchos,,,2,,,,United States,US,Alabama,US-AL,Montgomery,US-AL-101,,27,,,"Windham Rd., Pintlala",L665057,H,32.188361,-86.36816,2018-01-02,08:02:00,obsr665305,S41603999,Traveling,P22,EBIRD,18.0,1.706,,2,1,G2836759,0,1,0,,,,
2,URN:CornellLabOfOrnithology:EBIRD:OBS563333659,2018-01-03 22:54:39.0,19073.0,species,American Crow,Corvus brachyrhynchos,,,2,,,,United States,US,Alabama,US-AL,Montgomery,US-AL-101,,27,,,"Windham Rd., Pintlala",L665057,H,32.188361,-86.36816,2018-01-02,08:02:00,obsr130801,S41567526,Traveling,P22,EBIRD,18.0,1.706,,2,1,G2836759,0,1,0,,,,
3,URN:CornellLabOfOrnithology:EBIRD:OBS563148351,2018-01-09 21:46:14.0,19073.0,species,American Crow,Corvus brachyrhynchos,,,1,,,,United States,US,Alabama,US-AL,Montgomery,US-AL-101,,27,,,Bankhead Ave,L6654172,P,32.348862,-86.289228,2018-01-02,10:40:00,obsr130801,S41550556,Traveling,P22,EBIRD,23.0,2.189,,2,1,G2836779,0,1,0,,,,
4,URN:CornellLabOfOrnithology:EBIRD:OBS566089279,2018-01-10 16:17:09.0,19073.0,species,American Crow,Corvus brachyrhynchos,,,1,,,,United States,US,Alabama,US-AL,Montgomery,US-AL-101,,27,,,Folsom Building,L6698646,P,32.378946,-86.299321,2018-01-10,12:40:00,obsr228192,S41800795,Incidental,P20,EBIRD,,,,1,0,,0,1,0,,,,


In [24]:
IBAcodedata = pd.read_csv(basicdir+IBAcodesfile, sep='\t', encoding = "latin1") # NOTE: tab-delimited
print(IBAcodedata.shape)
IBAcodedata.head()

(3681, 2)


Unnamed: 0,IBA_CODE,IBA NAME
0,MX_164-3,CERROS ALREDEDOR DE SAN CRISTOBAL DE LAS CASAS...
1,MX_165,LAGOS DE MONTEBELLO
2,MX_222,LAGUNA DE CHACAHUA-PASTORIA
3,MX_168,LA ENCRUCIJADA
4,MX_196,LAGUNA PAMPA EL CABILDO


In [25]:
USFWScodedata = pd.read_csv(basicdir+USFWSCodesfile, sep='\t', encoding = "latin1") # NOTE: tab-delimited
print(USFWScodedata.shape)
USFWScodedata.head()

(833, 2)


Unnamed: 0,USFWS CODE,USFWS NAME
USFWS_1,,FEATHERSTONE NATIONAL WILDLIFE REFUGE
USFWS_2,,ARCHIE CARR NATIONAL WILDLIFE REFUGE
USFWS_3,,BALD KNOB NATIONAL WILDLIFE REFUGE
USFWS_4,,GREEN CAY NATIONAL WILDLIFE REFUGE
USFWS_5,,SEAL BEACH NATIONAL WILDLIFE REFUGE


In [26]:
BCRcodedata = pd.read_csv(basicdir+BCRcodesfile, sep='\t', encoding = "latin1") # NOTE: tab-delimited
print(BCRcodedata.shape)
BCRcodedata.head()

(66, 2)


Unnamed: 0,BCR_CODE,BCR_NAME
0,1,ALEUTIAN/BERING_SEA_ISLANDS
1,2,WESTERN_ALASKA
2,3,ARCTIC_PLAINS_AND_MOUNTAINS
3,4,NORTHWESTERN_INTERIOR_FOREST
4,5,NORTHERN_PACIFIC_RAINFOREST


In [27]:
checklistdata = pd.read_csv(refdir+refclfile, encoding = "latin1")
print(checklistdata.shape)
checklistdata.head()


(99, 4670)


Unnamed: 0,SAMPLING_EVENT_ID,LOC_ID,LATITUDE,LONGITUDE,YEAR,MONTH,DAY,TIME,COUNTRY,STATE_PROVINCE,COUNTY,COUNT_TYPE,EFFORT_HRS,EFFORT_DISTANCE_KM,EFFORT_AREA_HA,OBSERVER_ID,NUMBER_OBSERVERS,GROUP_ID,PRIMARY_CHECKLIST_FLAG,Abeillia_abeillei,Aburria_aburri,Acanthidops_bairdi,Acanthis_flammea,Acanthis_hornemanni,Accipiter_bicolor,Accipiter_collaris,Accipiter_cooperii,Accipiter_gentilis,Accipiter_gundlachi,Accipiter_nisus,Accipiter_poliogaster,Accipiter_rufitorques,Accipiter_striatus,Accipiter_superciliosus,Acridotheres_cristatellus,Acridotheres_fuscus,Acridotheres_grandis,Acridotheres_tristis,Acrobatornis_fonsecai,Acrocephalus_aequinoctialis,Acrocephalus_atyphus,Acrocephalus_caffer,Acrocephalus_dumetorum,Acrocephalus_familiaris,Acrocephalus_kerearako,Acrocephalus_mendanae,Acrocephalus_percernis,Acrocephalus_rimitarae,Acrocephalus_schoenobaenus,Acrocephalus_taiti,...,Xiphorhynchus_pardalotus,Xiphorhynchus_spixii,Xiphorhynchus_susurrans,Xiphorhynchus_triangularis,Xolmis_cinereus,Xolmis_coronatus,Xolmis_dominicanus,Xolmis_irupero,Xolmis_pyrope,Xolmis_rubetra,Xolmis_salinarum,Xolmis_velatus,Zapornia_atra,Zapornia_tabuensis,Zaratornis_stresemanni,Zebrilus_undulatus,Zeledonia_coronata,Zenaida_asiatica,Zenaida_auriculata,Zenaida_aurita,Zenaida_galapagoensis,Zenaida_macroura,Zenaida_meloda,Zentrygon_albifacies,Zentrygon_carrikeri,Zentrygon_chiriquensis,Zentrygon_costaricensis,Zentrygon_frenata,Zentrygon_goldmani,Zentrygon_lawrencii,Zentrygon_linearis,Zimmerius_acer,Zimmerius_albigularis,Zimmerius_bolivianus,Zimmerius_chicomendesi,Zimmerius_chrysops,Zimmerius_cinereicapilla,Zimmerius_gracilipes,Zimmerius_vilissimus,Zimmerius_villarejoi,Zimmerius_viridiflavus,Zonotrichia_albicollis,Zonotrichia_atricapilla,Zonotrichia_capensis,Zonotrichia_leucophrys,Zonotrichia_querula,Zosterops_explorator,Zosterops_japonicus,Zosterops_lateralis,Zosterops_samoensis
0,S12317271,L1841231,38.074717,-122.194061,2016,1,23,9.0,United States,California,Solano,P22,2.0,11.265,0,obs348216,20,?,1,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0,0,0
1,S19116985,L4280191,39.75868,-75.837537,2016,2,45,15.25,United States,Pennsylvania,Chester,P21,0.25,0.0,0,obs528321,2,?,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0
2,S21877588,L4247643,34.923801,-80.766442,2016,2,43,12.0,United States,North Carolina,Union,P21,1.0,0.0,0,obs242790,2,?,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,6,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,7,0,0,0,0,0,0,0,0
3,S22169735,L845398,35.080786,-106.671796,2016,3,64,6.83,United States,New Mexico,Bernalillo,P22,1.167,1.851,0,obs19508,1,?,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,6,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,1,0,0,0,0,0
4,S23384804,L2732230,39.01186,-114.19696,2016,5,130,7.52,United States,Nevada,White Pine,P21,3.0,0.0,0,obs2970,1,?,1,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,...,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0,0


In [28]:
extendedcovdata = pd.read_csv(refdir+refcvfile, encoding = "latin1")
print(checklistdata.shape)
extendedcovdata.head()


(99, 4670)


Unnamed: 0,SAMPLING_EVENT_ID,EASTNESS,ELEV,LOC_ID,MODISWATER_FS_C0_1500_ED,MODISWATER_FS_C0_1500_LPI,MODISWATER_FS_C0_1500_PD,MODISWATER_FS_C0_1500_PLAND,MODISWATER_FS_C1_1500_ED,MODISWATER_FS_C1_1500_LPI,MODISWATER_FS_C1_1500_PD,MODISWATER_FS_C1_1500_PLAND,MODISWATER_FS_C2_1500_ED,MODISWATER_FS_C2_1500_LPI,MODISWATER_FS_C2_1500_PD,MODISWATER_FS_C2_1500_PLAND,MODISWATER_FS_C3_1500_ED,MODISWATER_FS_C3_1500_LPI,MODISWATER_FS_C3_1500_PD,MODISWATER_FS_C3_1500_PLAND,MODISWATER_FS_C5_1500_ED,MODISWATER_FS_C5_1500_LPI,MODISWATER_FS_C5_1500_PD,MODISWATER_FS_C5_1500_PLAND,MODISWATER_FS_C6_1500_ED,MODISWATER_FS_C6_1500_LPI,MODISWATER_FS_C6_1500_PD,MODISWATER_FS_C6_1500_PLAND,MODISWATER_FS_C7_1500_ED,MODISWATER_FS_C7_1500_LPI,MODISWATER_FS_C7_1500_PD,MODISWATER_FS_C7_1500_PLAND,NORTHNESS,UMD_FS_C0_1500_ED,UMD_FS_C0_1500_LPI,UMD_FS_C0_1500_PD,UMD_FS_C0_1500_PLAND,UMD_FS_C1_1500_ED,UMD_FS_C1_1500_LPI,UMD_FS_C1_1500_PD,UMD_FS_C1_1500_PLAND,UMD_FS_C10_1500_ED,UMD_FS_C10_1500_LPI,UMD_FS_C10_1500_PD,UMD_FS_C10_1500_PLAND,UMD_FS_C12_1500_ED,UMD_FS_C12_1500_LPI,UMD_FS_C12_1500_PD,UMD_FS_C12_1500_PLAND,UMD_FS_C13_1500_ED,UMD_FS_C13_1500_LPI,UMD_FS_C13_1500_PD,UMD_FS_C13_1500_PLAND,UMD_FS_C16_1500_ED,UMD_FS_C16_1500_LPI,UMD_FS_C16_1500_PD,UMD_FS_C16_1500_PLAND,UMD_FS_C2_1500_ED,UMD_FS_C2_1500_LPI,UMD_FS_C2_1500_PD,UMD_FS_C2_1500_PLAND,UMD_FS_C3_1500_ED,UMD_FS_C3_1500_LPI,UMD_FS_C3_1500_PD,UMD_FS_C3_1500_PLAND,UMD_FS_C4_1500_ED,UMD_FS_C4_1500_LPI,UMD_FS_C4_1500_PD,UMD_FS_C4_1500_PLAND,UMD_FS_C5_1500_ED,UMD_FS_C5_1500_LPI,UMD_FS_C5_1500_PD,UMD_FS_C5_1500_PLAND,UMD_FS_C6_1500_ED,UMD_FS_C6_1500_LPI,UMD_FS_C6_1500_PD,UMD_FS_C6_1500_PLAND,UMD_FS_C7_1500_ED,UMD_FS_C7_1500_LPI,UMD_FS_C7_1500_PD,UMD_FS_C7_1500_PLAND,UMD_FS_C8_1500_ED,UMD_FS_C8_1500_LPI,UMD_FS_C8_1500_PD,UMD_FS_C8_1500_PLAND,UMD_FS_C9_1500_ED,UMD_FS_C9_1500_LPI,UMD_FS_C9_1500_PD,UMD_FS_C9_1500_PLAND,UMD_LANDCOVER,UMD_WATERCOVER,YEAR
0,S12317271,-0.027166,51.925701,L1841231,0.0,0.0,0.0,0.0,0.001439,0.583333,1.294044e-07,58.333333,0.000959,0.416667,1.294044e-07,41.666667,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0.024475,0.00024,0.027778,1.294044e-07,2.777778,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.00024,0.027778,1.294044e-07,2.777778,0.002518,0.472222,3.882132e-07,63.888889,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.00048,0.027778,2.588088e-07,5.555556,0.001439,0.166667,2.588088e-07,25.0,0.0,0.0,0.0,0.0,8,2,2016
1,S19116985,0.12223,115.293665,L4280191,0.0,0.0,0.0,0.0,0.001439,1.0,1.294044e-07,100.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,-0.016961,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.00048,0.027778,2.588088e-07,5.555556,0.002398,0.555556,1.294044e-07,55.555556,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.001199,0.138889,2.588088e-07,19.444444,0.00024,0.027778,1.294044e-07,2.777778,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.001319,0.055556,6.47022e-07,16.666667,0.0,0.0,0.0,0.0,12,1,2016
2,S21877588,-0.506603,183.307414,L4247643,0.0,0.0,0.0,0.0,0.001336,1.0,1.109181e-07,100.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0.099173,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.000308,0.047619,1.109181e-07,4.761905,0.000514,0.119048,1.109181e-07,11.904762,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.001233,0.190476,2.218361e-07,23.809524,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.002158,0.595238,1.109181e-07,59.52381,0.0,0.0,0.0,0.0,8,1,2016
3,S22169735,-0.021029,1511.336739,L845398,0.0,0.0,0.0,0.0,0.001233,1.0,9.507262e-08,100.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,-0.094055,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.001233,1.0,9.507262e-08,100.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,13,1,2016
4,S23384804,0.074151,1951.474841,L2732230,0.0,0.0,0.0,0.0,0.001439,0.888889,1.294044e-07,88.888889,0.000719,0.055556,2.588088e-07,11.111111,0.0,0.0,0.0,0.0,0,0,0,0,0,0,0,0,0,0,0,0,0.017241,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.001919,0.916667,1.294044e-07,91.666667,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.000719,0.027778,3.882132e-07,8.333333,0.0,0.0,0.0,0.0,0.0,0.0,0.0,0.0,10,1,2016
