Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create summary tables of the main page table #5

Closed
vipulnaik opened this issue Nov 28, 2019 · 6 comments
Closed

Create summary tables of the main page table #5

vipulnaik opened this issue Nov 28, 2019 · 6 comments

Comments

@vipulnaik
Copy link

I'm interested in grouped summaries such as:

  • Summary of pageview counts on timelines wiki and Wikipedia based on principal contributor(s)
  • Summary of pageview counts on timelines wiki and Wikipedia based on topic
  • Summary of pageview counts on timelines wiki and Wikipedia based on whether the article is on Wikipedia
@riceissa
Copy link
Owner

What summary statistics do you want (e.g. just a sum of pageviews)?

I'm guessing just sorting the existing table by the appropriate column (once we add principal contributor) is inadequate, but I don't understand why.

@vipulnaik
Copy link
Author

e.g. just a sum of pageviews

Yes, exactly. Sum of pageviews on TW, sum of pageviews on Wikipedia, and total number of rows.

It would be hard for me to mentally sum up dozens of rows even after the sorting capability is added. It's hard for me to sum up more than 3-4 numbers by eyeballing. I'm guessing others will have similar limitations.

@riceissa
Copy link
Owner

How do you want to treat the case where a single timeline has multiple principal contributors?

e.g. let's say Issa Rice is principal contributor on timelines A and B, and Sebastian Sanchez is principal contributor on timelines B and C.

We could do a thing where multiple authors get to "claim" a timeline and get it included into their view counts like this:

Issa Rice (A,B): _ views
Sebastian Sanchez (B,C): _ views

or each person's name could stand for timelines where they are the sole principal contributor, like this:

Issa Rice (A): _ views
Sebastian Sanchez (C): _ views
Issa Rice and Sebastian Sanchez (B): _ views

@riceissa
Copy link
Owner

Also for pageviews I'm assuming you only want the past 30-31 days (just like the current main page table).

@vipulnaik
Copy link
Author

@riceissa Great question! How hard would it be to break out number of views to which that person was the sole principal contributor, and number of views for which that person was one of multiple principal contributors? From this level of broken-out data, we should be able to recover both the summary views you mentioned.

@riceissa
Copy link
Owner

riceissa commented Dec 1, 2019

Here's what I have so far: https://timelines.issarice.com/wiki/User:Issa/test

what I still need to do:

  • check if the numbers make sense
  • add explanations (e.g. the date window for pageviews)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants