New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New aggregation function #99
Comments
Overall, this makes a LOT of sense to me, thank you so much for scoping!!
I agree about the external sources, but curious which ones you had in mind. I'd love to integrate MP staff population data, because even imperfect staff rates still feel really valuable to me. A question for me is deciding if the user should specify just one, or whether there should be some coalesce hierarchy across sources.
I also think yes. The vast majority of our data will inevitably be from state and federal prisons, and I think there's a lot of value in making that as comprehensive as possible. Beyond filling in gaps, I think that's also really useful for places like Ohio or Texas.
I personally don't think a facility number count is super useful. One thing that I think might be nice as a (non-default!) option in the long table would be an explicit column for the various population options (so the explicit denominators in addition to the rate column). A few other initial thoughts:
I'll probably have more thoughts when we chat tomorrow, but yay thank you again!! |
Currently we have one aggregation function whose primary intention is to aggregate prison data and compare to the Marshall Project's reported numbers for variables which we both report on. After reviewing I think we should keep that function and have a new function which calculates aggregates by geographic region and the jurisdiction which reports/is responsible for that facility.
An example
Additionally there should be at rates option where we also calculate rates. The trouble is their seems to be several options that we can use for calculating rates.
Population.Feb20
, most recentResidents.Population
, some external source that reports on aggregated population totals. I think that their should be an option to choose which one to use in the function to calculate ratesHere is what im thinking function documentation looks like
Some outstanding issues I have.
The text was updated successfully, but these errors were encountered: