Skip to content

rivatal/subway

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

subway_research:

Both .sh files let you get gtfs data and turnstile data. ##1: join_gtfs.R Joins together the various gtfs datasets and gives stations unique ids.

##2: Get turnstile names for matching: ####python getfiles.py | cut -d, -f4 | sort | uniq > turnstile_names.txt ####cat station_ids_trips.csv | cut -d, -f4 | sort | uniq > gtfs_names.txt

(After cat station_ids_trips.csv | cut -d, -f9 | sort | uniq | wc) There are 440 unique station ids in the gtfs data. There are 385 individual station names in the turnstile data && 377 in the gtfs Why such a disparity? -Different stations with the same name: 3- "Van Siclen Av"'s 2 "Wall St"'s

Test: station_names_ids_disparity_test.R

##3: Match up the names from the above files. match_names.py

##4: Small, more manual edits: smalleredits.awk

##5: Merge it up! (gtfs and turnstile data) merge.R

##6: Get average hourly entries, exits by station: load_subway_trips.R

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published