Skip to content

Predicting total ride duration of taxi trips in New York City, the training set (contains 1458644 trip records)

License

Notifications You must be signed in to change notification settings

San13deep/NYC-Taxi-Trip-Time-Prediction

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

NYC-Taxi-Trip-Time-Prediction

Predicting total ride duration of taxi trips in New York City.

DOMAIN: Transportation

The dataset is based on the 2016 NYC Yellow Cab trip record data made available in Big Query on Google Cloud Platform. The data was originally published by the NYC Taxi and Limousine Commission (TLC). The data was sampled and cleaned for the purposes of this playground. Based on individual trip attributes, should predict the duration of each trip.

Source:

Details:

NYC Taxi Data.csv - the training set (contains 1458644 trip records).

Data fields

● id - a unique identifier for each trip

● vendor_id - a code indicating the provider associated with the trip record

● pickup_datetime - date and time when the meter was engaged

● dropoff_datetime - date and time when the meter was disengaged

● passenger_count - the number of passengers in the vehicle (driver entered value)

● pickup_longitude - the longitude where the meter was engaged

● pickup_latitude - the latitude where the meter was engaged

● dropoff_longitude - the longitude where the meter was disengaged

● dropoff_latitude - the latitude where the meter was disengaged

● store_and_fwd_flag - This flag indicates whether the trip record was held in vehicle memory before sending to the vendor because the vehicle did not have a connection to the server - Y=store and forward; N=not a store and forward trip

● trip_duration - duration of the trip in seconds

Objective:

Build a model that predicts the total trip duration of taxi trips in New York City.

Releases

No releases published

Packages

No packages published