Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create categorical contingency table data structure #17

Closed
fernando-aristizabal opened this issue Feb 23, 2023 · 0 comments
Closed

Create categorical contingency table data structure #17

fernando-aristizabal opened this issue Feb 23, 2023 · 0 comments
Assignees
Labels
enhancement New feature or request

Comments

@fernando-aristizabal
Copy link
Member

fernando-aristizabal commented Feb 23, 2023

Implement a standard schema for contingency table structure.

This issue depends on #38.

Current behavior

  • Some post processing is implemented to index cross-tabulated counts by unique values in candidate map (rows) and benchmark map (columns).
  • No ability to keep track of candidate or benchmark attributes or process parameters is in place.

Expected behavior

  • The above structure is great for readability but does not account for the various hierarchies of samples including sub-samples, band/features, maps, and catalogs.

  • A contingency table structure is required that reports cross-tabulated counts for each sample as well as their associated attributes.

  • Methods to aggregate/groupby crosstab counts by level or associated attributes is important.

  • Ways of melting and pivoting this structure to something more human readable like a cross-tabulation table indexed by unique values in candidate map (rows) and benchmark map (columns) would be nice to have as well.

  • Some method of tagging each column with metadata is necessary. Current metadata source (candidate, benchmark, or process) and hierarchy (catalog, map, band/feature, sub-sample).

  • What data structures are necessary to account for this?

  • What classes should be created for this?

Screenshots

@fernando-aristizabal fernando-aristizabal added the enhancement New feature or request label Feb 23, 2023
@fernando-aristizabal fernando-aristizabal self-assigned this Feb 23, 2023
@fernando-aristizabal fernando-aristizabal changed the title Implement categorical contingency statistics table. Allow for attributes within categorical contingency tables. Mar 7, 2023
@fernando-aristizabal fernando-aristizabal changed the title Allow for attributes within categorical contingency tables. Create categorical contingency table data structure Mar 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant