Release 86 Petra
Brings in-batch synthetic deduplication and data-modeling improvements.
Common
- Add AWS credentials to .travis.yml (#2963)
- Add CI/CD for Scala Hadoop Enrich (#2982)
- Add CI/CD for Scala Hadoop Shred (#2928)
- Migrate Hadoop Event Recovery deployment to Release Manager (#2983)
- Remove short-hostname addon from travis.yml (#2674)
- Update script to sync us-east-2 (Ohio) Snowplow Hosted Assets bucket (#2986)
- Update script to sync ca-central-1 (Montreal) Snowplow Hosted Assets bucket (#3004)
- Update script to sync eu-west-2 (London) Snowplow Hosted Assets bucket (#3005)
- Use AWS environment variables to sync Snowplow Hosted Assets buckets (#2985)
Scala Hadoop Shred
- Bump to 0.10.0 (#2979)
- Add general top-level exception handling (#2071)
- Get the CustomPartitionSourceTest working with Hadoop 2.4 (#1960)
- Fix omitted string interpolation (#2562)
- Deduplicate event_ids with different event_fingerprints (synthetic duplicates) (#24)
- Stop catching fatal errors (#1456)
Data Modeling
- Add drill fields to web block (#2956)
- Resolve issues with web model (#2954)
- Restrict table scan on deduplication queries (#2929)
- Add web model (#2925)
- Delete example models (#2836)
- Remove outdated recipes (#2626)
EmrEtlRunner
- Update hadoop_shred version in config.yml.sample to 0.10.0 (#3003)