Skip to content
This repository has been archived by the owner on Mar 19, 2021. It is now read-only.

Total FxA Accounts #68

Closed
davismtl opened this issue Apr 3, 2017 · 18 comments
Closed

Total FxA Accounts #68

davismtl opened this issue Apr 3, 2017 · 18 comments
Assignees

Comments

@davismtl
Copy link

davismtl commented Apr 3, 2017

As a product manager, I should be able to know how many accounts have been created to date.

We currently don't have an easy way to easily measure the total number accounts that have been created to date.

As we work with marketing over 2017 to revive inactive FxA accounts, it would be nice to know how many accounts have ever been created.

It would also be nice to keep track of the proportion of active to active/inactive accounts.

As per how we roll-it up, we could have something like this (which doesn't load since it is too resource intensive in Re:Dash):
https://sql.telemetry.mozilla.org/queries/3916/source#7783

I'm open to other data formats though if we want more granularity.

@vladikoff
Copy link
Contributor

@jbuck @rfk pushing this to active

@rfk
Copy link
Contributor

rfk commented Apr 27, 2017

Somewhere we have a daily job that runs a "COUNT (*) FROM accounts" job on our read-replica db, and pushes the results into heka for display in this old metrics dashboard:

https://metrics.services.mozilla.com/accounts-detail-dashboard/

The simplest option here is probably to appropriate that and have it also pipe the data into S3 (if it's not there already) and then we can import it into redshift. @jbuck do you know the details of this daily job? I only have a vague memory of its existence.

@vladikoff vladikoff added the P2 label May 2, 2017
@whd
Copy link
Member

whd commented May 2, 2017

The database query that counted total accounts was disabled in September of last year, at @jrgm's request due to heavy database load. I believe there was a plan to move that to a read replica and re-enable it, but that may have never happened. Consequently I don't think there is a currently running job that counts total accounts, but I also have only vague memories of how this is set up.

@jbuck
Copy link
Member

jbuck commented May 2, 2017

I'll re-enable this job on the read replica in us-east-1 & dump data onto the same S3 bucket that we currently use for pulling data into redshift

@jbuck
Copy link
Member

jbuck commented May 10, 2017

I finally got this working - I'll check tomorrow morning to see if the job ran successfully overnight.

If we want more up-to-date account totals I think we could run this more frequently than once a day.

@jbuck
Copy link
Member

jbuck commented May 11, 2017

This is available in the same s3 bucket that we currently pull from for redshift but with a different path: /fxa-basic-metrics/fxa-basic-metrics-YYYY-MM-DD.txt. The first date available is 2017-05-11.

@jbuck jbuck closed this as completed May 11, 2017
@jaredhirsch jaredhirsch removed the P2 label May 11, 2017
@rfk
Copy link
Contributor

rfk commented May 11, 2017

Thansk @jbuck! Re-opening and assigning to phil & myself to figure out the import step.

@rfk rfk reopened this May 11, 2017
@rfk rfk assigned philbooth and unassigned jbuck May 11, 2017
@rfk
Copy link
Contributor

rfk commented May 11, 2017

For visibility, the contents of this file are currently formatted as:

count_time|total|verified
2017-05-11|999999999|8888888888

@jbuck how complex would it be to format this as a headerless CSV instead? I'm sure we can work with the current format, but if it's easy to reformat upstream than it might be simpler overall.

@jbuck
Copy link
Member

jbuck commented May 11, 2017

Easy to change it, but I used the pipes because it's the default separator. PMed you the file for editing. It'll require a redeploy of fxa-admin to push the change live. @jrgm when you do the redeploy, make sure to nuke the old stacks

@vladikoff
Copy link
Contributor

from mtg: put it on the graph

@vladikoff vladikoff added the P2 label May 30, 2017
@rfk
Copy link
Contributor

rfk commented Jun 1, 2017

@philbooth I think it should be possible for us to use your generalized metrics import script for this now, let's discuss next week once we're sure we've got to the bottom of the sendSms metrics.

@davismtl
Copy link
Author

davismtl commented Jun 1, 2017

Glad to see movement on this! :)
I was asked by my VP last night about how many accounts we have. I had a bit of trouble answering. :-/

@philbooth
Copy link
Contributor

@rfk, @jbuck, I just went to work on this but of course I can't SSH onto the redshift helper because I changed my SSH keys after losing my machine the other week. Any idea what I should do to fix this?

@rfk
Copy link
Contributor

rfk commented Jun 6, 2017

@philbooth IIRC the latest recommendation is for us to pull people's ssh pubkeys from github; are you keys up-to-date in github? I should be able to add you back to the boxes.

@philbooth
Copy link
Contributor

philbooth commented Jun 6, 2017

@rfk thanks! Yep, my GitHub keys are the updated ones.

@rfk
Copy link
Contributor

rfk commented Jun 7, 2017

@philbooth I've added your key from https://api.github.com/users/philbooth/keys to the redshift-helper box, LMK if you still don't have access.

@philbooth
Copy link
Contributor

Hey @jbuck, there's the odd day where no data is exported to S3, e.g. 2017-06-03, 2017-06-07. Any idea what's up with that?

@jbuck
Copy link
Member

jbuck commented Jun 8, 2017 via email

@rfk rfk removed their assignment Jun 14, 2017
@jaredhirsch jaredhirsch removed the P2 label Jun 15, 2017
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

7 participants