Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Report: median number of sites a user logs into with Persona #1

Closed
nmalkin opened this issue Jun 29, 2012 · 2 comments
Closed

Report: median number of sites a user logs into with Persona #1

nmalkin opened this issue Jun 29, 2012 · 2 comments

Comments

@nmalkin
Copy link
Owner

nmalkin commented Jun 29, 2012

from Bugzilla #746231

KPI Dashboard's first Key Performance Indicator report will be:

Median # of sites a user logs into w Persona

Description: How well are we doing on Persona ID uptake? On average, how many sites does our user interact with over the day? This value should climb over time and stabilize at a healthy value.

A report should have the following common features:

  • Show current value and related information
  • Allow Segmenting across these axis:
    • buckets by number of emails
    • language code
    • buckets by screen size (mobile/tablet/desktop)
  • Express this number over time, so changes can be correlated with releases

Common features will show up in other KPI reports

@nmalkin
Copy link
Owner Author

nmalkin commented Jun 29, 2012

This is mostly implemented as of 0b9da9a.

The dashboard retrieves data, calculates the median number_sites_logged_in for each day and displays that over time.

Controls allow segmentation by OS, browser, and locale.

@nmalkin
Copy link
Owner Author

nmalkin commented Jul 17, 2012

Since this report's functionality has been implemented for a while and is largely stable, I'm closing this issue. New issues should be opened for any changes or improvements.

@nmalkin nmalkin closed this as completed Jul 17, 2012
nmalkin added a commit that referenced this issue Jul 18, 2012
The report is also converted to display the mean number of sites logged in
instead of the median (closes #29).

The reason behind it (from #29):

Report #1 is the median number of sites a user logs into with Persona.

As part of migrating to CouchDB as the backend (#27), finding the median
of the data series becomes a significantly harder technical challenge.
(To do it in a map/reduce framework requires a quick-select algorithm,
which there doesn't seem to be a good way to do in CouchDB.)

Alternately, the median value for each day could be precalculated when
data arrives and then stored in the database. However, this would
require either a new database (cumbersome) or a change to the data
format and code of the current one (very undesirable).

Calculating the mean of the dataset, however, is much easier.

While the median is a more sensible value to look at (it is less
sensitive to outliers), it has been agreed, before, that this entire
report is not hugely meaningful. The median value itself doesn't really
say anything. The only way we'd use it is to watch the number and hope
it trends up. In that case, however, the mean is just about as good: we
can look at it and watch its trend.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant