Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documentation needed for record counts #753

Open
bobpersing opened this issue May 2, 2022 · 1 comment
Open

Documentation needed for record counts #753

bobpersing opened this issue May 2, 2022 · 1 comment
Labels

Comments

@bobpersing
Copy link

bobpersing commented May 2, 2022

Princeton has asked for clarification on the record counts associated with their organization. I couldn't find anything in the wiki, so I think we should add something. We might also want to consider giving the counts unique and more descriptive names.

There are currently four record counts displayed for each organization. Two are visible on the organization index page:
https://pod.stanford.edu/organizations
and two more on the individual page for the organization. Using Princeton as the example, that would be:
https://pod.stanford.edu/organizations/princeton

Here are Princeton's counts as of 4/29/22:

  1. Index page, unique records=7,273,095
  2. Index page, total records=12,754,251
  3. Individual org page, unique records=6,034,298
  4. Individual org page, records=6,058,976

The source of count #4 is self-evident: it's the gross number of records submitted in the current default stream.

I assume count #2 is the total of count #4 for all the organization's streams. This is hard to prove, though, because POD doesn't currently display counts for non-default streams.

Is count #3 a "net" count of records in the default stream: i.e., the gross count, minus any records submitted more than once? If so, organizations should expect the difference between counts 3 and 4 to increase the longer the default stream remains in use.

Is count #1 a deduplicated total of count #3 plus the unique record count from the earlier, non-default streams? For example: if an organization had 2 streams:

  • stream 1 having 500 unique records
  • stream 2 (the default stream) having 1,000 records, 200 of which are newer versions of records in stream 1

would count #1 for that organization be 1,300?

@ggeisler
Copy link
Contributor

ggeisler commented May 2, 2022

I assume count #2 is the total of count #4 for all the organization's streams. This is hard to prove, though, because POD doesn't currently display counts for non-default streams.

Sort of a side issue @bobpersing, but note that this is an oversight we'll hopefully fix soon in #755.

@corylown corylown self-assigned this May 5, 2022
@corylown corylown removed their assignment Jun 1, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Development

No branches or pull requests

3 participants