Skip to content

(409th place - Top 28%) Deep Learning using Tensorflow for the "New York City Taxi Fare Prediction" Kaggle competition.

License

Notifications You must be signed in to change notification settings

dimitreOliveira/NewYorkCityTaxiFare

Repository files navigation

Deep Learning regression with Tensorflow

About the repository

The goal here is use the Tensorflow API and create a end-to-end project, from data loading to model predictions, and use the Kaggle "New York City Taxi Fare Prediction competition" as the data source.

What you will find

  • Data preprocessing and cleaning using the "csv" library. [link]
  • Data batch loading using Tensorflow. [link]
  • Preprocess each batch by time with Tensorflow. [link]
  • Deep learning models using the estimator API from Tensorflow. [link]
  • Tensorflow estimator API wrapping Keras models. [link]
  • Model validation and analysis using Tensorboard. [link]
  • Model prediction by batch with Tensorflow. [link]

New York City Taxi Fare Prediction

Can you predict a rider's taxi fare?

Kaggle competition: https://www.kaggle.com/c/new-york-city-taxi-fare-prediction

Overview

In this playground competition, hosted in partnership with Google Cloud and Coursera, you are tasked with predicting the fare amount (inclusive of tolls) for a taxi ride in New York City given the pickup and dropoff locations. While you can get a basic estimate based on just the distance between the two points, this will result in an RMSE of $5-$8, depending on the model used (see the starter code for an example of this approach in Kernels). Your challenge is to do better than this using Machine Learning techniques!

To learn how to handle large datasets with ease and solve this problem using TensorFlow, consider taking the Machine Learning with TensorFlow on Google Cloud Platform specialization on Coursera -- the taxi fare problem is one of several real-world problems that are used as case studies in the series of courses.

Dependencies:

To-Do:

  • Fix prediction by batch "e.g.: estimator.predict() using input_fn=read_dataset() as parameter"
  • Efficiently use Keras models wrapped by the Tensorflow estimator API.

About

(409th place - Top 28%) Deep Learning using Tensorflow for the "New York City Taxi Fare Prediction" Kaggle competition.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages