When I import the Uber data for 2014 I get an error message that the date/time field value is out of range for the value 4/13/2014 0:01:00.
Tue Feb 9 14:32:16 EET 2016: beginning load for data/uber-raw-data-apr14.csv
ERROR: date/time field value out of range: "4/13/2014 0:01:00"
HINT: Perhaps you need a different "datestyle" setting.
CONTEXT: COPY uber_trips_staging, line 15076, column pickup_datetime: "4/13/2014 0:01:00"
Tue Feb 9 14:32:16 EET 2016: finished raw load for data/uber-raw-data-apr14.csv
It looks as though this value exists in a number of records:
Apologies for jumping in to this without sufficient context, I was reading @marklit's blog post but haven't tried reproducing this myself.
@marklit Is it just the 13th that you get errors for, no other days of the month? It looks suspiciously like it's trying to import m/d/y dates as d/m/y. If it's using the machine's locale that may explain why it fails in London and works in NYC. Apologies if it's too basic a question, it's just that you haven't explicitly addressed it yet.
$ cat minimal_downloads.txt | \
xargs -n 1 -P 6 wget -P data/ &
$ sudo su - postgres -c "psql -c 'CREATE USER mark; ALTER USER mark WITH SUPERUSER;'"
Relevant output from import_uber_trip_data.sh:
Mon Feb 15 09:40:33 PST 2016: beginning load for data/uber-raw-data-apr14.csv
Mon Feb 15 09:40:35 PST 2016: finished raw load for data/uber-raw-data-apr14.csv
INSERT 0 564516
Mon Feb 15 09:40:47 PST 2016: loaded trips for data/uber-raw-data-apr14.csv