Snowplow v0.8.8
Adding Postgres support, re-adding HiveQL support, and also adding support for multiple storage targets.
Plus plenty of small improvements, bug fixes and simplifications.
JavaScript Tracker
- Moved into own repo (#277)
Hadoop ETL
- Bumped to 0.3.3
- URL-decodes "%3D" to "=" to allow Hive-style directory names as arguments (#305)
- Bumped referer-parser to 0.1.1 to fix java.lang.NullPointerException (#314)
EmrEtlRunner
- Bumped to 0.4.0
- Bumped Sluice to 0.0.7 (#299)
- Removed :snowplow: section from config.yml.sample (#289)
- Simplified EmrEtlRunner and its config (#287)
- Added run= to timestamped ETL folder names (#294)
- Updated "Jobflow started" stdout message to include jobflow ID (#315)
Hive ETL
- Removed folder 3-enrich/hive-etl as no longer supported (#286)
Hive storage
- Updated hive-storage scripts to work with current Redshift-format flatfile (#290)
Infobright storage
- Rremoved folder 4-storage/infobright as not currently supported (#285)
Postgres storage
- Added Postgres table definition in atomic schema (#160)
StorageLoader
- Bumped to 0.1.0
- Bumped Sluice 0.0.7 (#300)
- Removed code to delete Hive ETL's empty event files (#306)
- Fixed bug where download path has to be set (even when using Redshift) (#280)
- Optimized ANALYZE and VACUUM commands (#283)
- Added MAXERROR as StorageLoader configuration value for Redshift (#273)
- Added support for loading Postgres (#161)
- Removed Infobright loading capability (#307)
- Added support for loading into multiple storage targets (#311)