Recommended infrastructure #23

Lazmonster · 2014-02-19T16:21:54Z

Hi I'd welcome some advice. We are new to Boomerang but are considering using it to monitor a client site with circa 82m Page Views per month traffic.
Do we need to supply our own infrastructure for data storage etc, or is there some out there for general use? If the latter, how is it paid for, and if the former grateful if somebody could recommend a spec for an environment to support this.
In other words, can we just deploy the tag and capture the results, or do we need to setup an infrastructure to do this?
By all means point me to any docs on the subject.
Many thanks

bluesmoon · 2014-02-19T17:04:49Z

You have a few options.

Use something like Google Analytics or Piwik. There are howtos on the web about using boomerang with both of these. For Piwik you'll have to have your own hardware, but the software is already built for you. I'm not sure if you'll get histograms and percentiles with these though.
Use a commercial service. My company (SOASTA) has a commercial service built around boomerang (mPulse), and many web/performance shops use our service for their clients. If you're interested in this option, send me an offline message since I'd rather not push a commercial service on the opensource forum. There are other companies that offer commercial services around boomerang as well, like Neustar, Keynote, and more.
Build your own. It could be as easy as post-hoc log processing (I think Howto-0 covers some of this), or writing a php or jsp, or some other web endpoint to receive the data and insert it into a database. 82MM b/m < 2000 beacons/minute. This is very easy to handle on a single web server, but I'd suggest using 2 just for redundancy.

Will write more as I think of it.

Lazmonster · 2014-02-19T17:46:43Z

Thanks!

bluesmoon · 2014-04-15T22:09:09Z

Just wanted to add another opensource project I found called boomcatch that handles the backend for boomerang: https://github.com/nature/boomcatch/ and http://cruft.io/posts/introducing-boomcatch/

andreas-marschke · 2014-11-30T03:06:34Z

Btw. I'm currently in the process of writing and roll-out planning for my own boomerang backend server boomerang-express

Most of the ruleset is pretty solid by now. It's designed to scale-up to a multi-tenant system but also capable to scale down to a single user that collects data for his sites.

Is capable of serializing everything from the headers over cookies to everything else that might come with a beacon of any kind.

It also has a "pluggable"/"replaceable" backend where I plan to integrate it at first only with a locally running NeDB (developer setup) but also scale to the point of multiple mongodb instances. Most of these things are configured using the beacon url in the frontend and the datastore in the backend.

It also actively works at preventing the beacon from being abused by scoring incoming beacons or requests to beacon urls on its referral URL and url-parameters.

Current working tree here is capable of running with NeDB and all loggin deferred to scale to approx 3K Req/s on a single cpu and single node with concurrency of 100 requests in 2 threads over 60s.

I used wrk for these measurements. They aren't necessairly complete but a viable first start for benchmarking.

nicjansma · 2016-05-02T14:22:43Z

Since there aren't any open questions in this Issue, I'm going to close it for now.

Lazmonster closed this as completed Feb 19, 2014

Lazmonster reopened this Feb 19, 2014

alexanderdean mentioned this issue Jun 17, 2014

Create Boomerang-JS Tracker bridge snowplow/snowplow-javascript-tracker#226

Closed

nicjansma closed this as completed May 2, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recommended infrastructure #23

Recommended infrastructure #23

Lazmonster commented Feb 19, 2014

bluesmoon commented Feb 19, 2014

Lazmonster commented Feb 19, 2014

bluesmoon commented Apr 15, 2014

andreas-marschke commented Nov 30, 2014

nicjansma commented May 2, 2016

Recommended infrastructure #23

Recommended infrastructure #23

Comments

Lazmonster commented Feb 19, 2014

bluesmoon commented Feb 19, 2014

Lazmonster commented Feb 19, 2014

bluesmoon commented Apr 15, 2014

andreas-marschke commented Nov 30, 2014

nicjansma commented May 2, 2016