What additional request attributes should we capture? #7
Replies: 7 comments 15 replies
-
timezone could be nice for backing out questions like "what time of day do folks listen to my episode" without going through region gymnastics. |
Beta Was this translation helpful? Give feedback.
-
I’m not sure how accurate lon/lat and other values are anyhow. Cell traffic, corporate routers… so I’d be shy of data that’s not a certainty. |
Beta Was this translation helpful? Give feedback.
-
I don't have any hesitations about privacy with that dataset. Some of it may be redundant (e.g. a country will only ever be on one continent), but maybe you just want to make it nicer for the API to do some of those lookups for them? I’m not sure how accurate lon/lat and other values are anyhow. |
Beta Was this translation helpful? Give feedback.
-
Country, continent, timezone, and possibly regioncode for me (I wonder whether that's just a US thing). I'd be very wary of using anything more granular: while we understand that lat/lon or name of place is only approximate, the optics of it are that "OMG we are being tracked", and I'm not sure that helps anyone. For me, for example, I'd expect "Brisbane" would show up, but in reality, knowing the datasets, it'll be the small suburb I live in, Bardon, which has a very low population of just 9,500 people. (Let's not forget - the podcast hosting company can absolutely show that to their customers if they want, but I'm a little more wary of this being open data available everywhere. The fewer objections people have to joining OP3, the better, as far as I'm concerned.) asn may be useful to help combat fraud, and I think it worth including that. That could directly lead back to the (erroneous, in my view) "disregard everything from Amazon AWS" rule that the IAB has. Other suggestions: |
Beta Was this translation helpful? Give feedback.
-
I agree that storing |
Beta Was this translation helpful? Give feedback.
-
I would be careful about including |
Beta Was this translation helpful? Give feedback.
-
Alright, just pushed a release yesterday that captures more of these non-listener-identifying attributes on every request served via OP3. https://op3.dev/releases#2022-10-20 Thanks to everyone for chiming in! |
Beta Was this translation helpful? Give feedback.
-
Right now, OP3 follows the principle of Data minimization, and only captures/stores the small subset of the request attributes that it needs to serve back in the API, no more.
We have many other request-level attributes available to us, but now are completely ignoring/dropping them. Some of these attributes should probably be considered safe to process as they have little to no listener privacy impact, and would make our eventual show/episode-level analytics much more useful.
I'd like to arrive at a consensus as to which of the following attributes are "no-brainers" to start storing/returning, those that are "on-the-line", could go either way, and those we should always avoid storing. The goal is to provide maximum utility without going over the creepy line wrt listeners' expectations.
Here are the attributes we are not currently capturing for each redirect request, but could in the future:
asn
: (number) the ASN of the incoming request (e.g.395747
)asOrganization
: (string) The organization which owns the ASN of the incoming request (e.g.Google Cloud
)country
: (string) two-letter country code (e.g.US
)isEUCountry
: (boolean) Whether the country is in the EUcity
: (string) (e.g.Austin
)continent
: (string) (e.g.NA
for North America)latitude
: (string) (e.g.30.27130
)longitude
: (string) (e.g.-97.74260
)postalCode
: (string) (e.g.78701
)metroCode
: (string) Metro code (DMA) (e.g.635
)region
: (string) If known, the ISO 3166-2 name for the first level region (e.g.Texas
)regionCode
: (string) If known, the ISO 3166-2 code for the first-level region (e.g.TX
)timezone
: (string) (e.g.America/Chicago
)To me:
country
,continent
,region/regionCode
, andtimezone
seem like no-brainers.I think
metroCode
would also be broad enough to be non-identifying but still very useful for folks building on top of OP3.asn
would be potentially useful for tagging traffic coming from cloud services in the future.What does everyone think?
Beta Was this translation helpful? Give feedback.
All reactions