Setting up the Scala Stream Collector

Ben Fradet edited this page Mar 19, 2018 · 6 revisions

HOME » SNOWPLOW SETUP GUIDE » Step 1: setup a Collector » Setting up the Scala Stream Collector

Overview of the Scala Stream Collector

The Scala Stream Collector allows near-real-time processing (Enrichment, Storage, Analytics) of a Snowplow raw event stream. Snowplow raw events can be sunk to either Amazon Kinesis, Google PubSub, Apache Kafka, NSQ or to stdout for a custom stream collection process.

For more information on the architecture of the Scala Stream Collector, please see Scala Stream Collector.

Contents

Setting up the Scala Stream Collector is a 3 step process:

  1. Install the Scala Stream Collector
  2. Configure the Scala Stream Collector
  3. Run the Scala Stream Collector

Note: If you're running on AWS, we recommend running all Snowplow AWS operations through an IAM user with the bare minimum permissions required to run Snowplow. Please see our IAM user setup page for more information on doing this.

Clone this wiki locally
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.
Press h to open a hovercard with more details.