Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Date/time value is out of range in the Uber 2014 data #3
When I import the Uber data for 2014 I get an error message that the date/time field value is out of range for the value
It looks as though this value exists in a number of records:
$ grep -n "4/13/2014 0:01:00" data/uber-raw-data-apr14.csv
Just about every other month in 2014 for the Uber data is raising the same issue.
Apologies for jumping in to this without sufficient context, I was reading @marklit's blog post but haven't tried reproducing this myself.
@marklit Is it just the 13th that you get errors for, no other days of the month? It looks suspiciously like it's trying to import m/d/y dates as d/m/y. If it's using the machine's locale that may explain why it fails in London and works in NYC. Apologies if it's too basic a question, it's just that you haven't explicitly addressed it yet.
It's importing without any complaints now. Thanks @toddwschneider.
Here's my steps in testing out 0f08601:
This was all done on a fresh Ubuntu 15 installation.
$ sudo apt-get install postgresql-9.4-postgis-2.1 postgis git unzip $ git clone https://github.com/toddwschneider/nyc-taxi-data.git $ cd nyc-taxi-data $ git checkout 0f08601 $ vi minimal_downloads.txt
$ cat minimal_downloads.txt | \ xargs -n 1 -P 6 wget -P data/ & $ sudo su - postgres -c "psql -c 'CREATE USER mark; ALTER USER mark WITH SUPERUSER;'" $ ./initialize_database.sh $ ./import_uber_trip_data.sh
Relevant output from