Snowplow v0.8.8

alexanderdean released this 04 Aug 21:54

· 4786 commits to master since this release

Adding Postgres support, re-adding HiveQL support, and also adding support for multiple storage targets.

Plus plenty of small improvements, bug fixes and simplifications.

Blog post

JavaScript Tracker

Moved into own repo (#277)

Hadoop ETL

Bumped to 0.3.3
URL-decodes "%3D" to "=" to allow Hive-style directory names as arguments (#305)
Bumped referer-parser to 0.1.1 to fix java.lang.NullPointerException (#314)

EmrEtlRunner

Bumped to 0.4.0
Bumped Sluice to 0.0.7 (#299)
Removed :snowplow: section from config.yml.sample (#289)
Simplified EmrEtlRunner and its config (#287)
Added run= to timestamped ETL folder names (#294)
Updated "Jobflow started" stdout message to include jobflow ID (#315)

Hive ETL

Removed folder 3-enrich/hive-etl as no longer supported (#286)

Hive storage

Updated hive-storage scripts to work with current Redshift-format flatfile (#290)

Infobright storage

Rremoved folder 4-storage/infobright as not currently supported (#285)

Postgres storage

Added Postgres table definition in atomic schema (#160)

StorageLoader

Bumped to 0.1.0
Bumped Sluice 0.0.7 (#300)
Removed code to delete Hive ETL's empty event files (#306)
Fixed bug where download path has to be set (even when using Redshift) (#280)
Optimized ANALYZE and VACUUM commands (#283)
Added MAXERROR as StorageLoader configuration value for Redshift (#273)
Added support for loading Postgres (#161)
Removed Infobright loading capability (#307)
Added support for loading into multiple storage targets (#311)

Assets 2