Skip to content

Conversation

@sof202
Copy link
Owner

@sof202 sof202 commented Oct 7, 2024

Description

This pull request will add a new metric for assessing redundant states. The metric is the median length of contiguous state assignments for each state. The reason we are using this is outlined in the updated documentation. Put simply, there's an edge case that isolation score cannot pick up, so this metric is in place to identify such edge cases and give even more information to the user.

Type of change

  • Bug fix
  • New feature
  • Code refactor
  • Documentation update

Checklist:

  • My code is consistent in style with the rest of ChromOptimise
  • I have performed a self-review of my own code
  • I have made corresponding changes to the documentation

sof202 added 22 commits October 7, 2024 09:19
This is because the distribution of sizes is very skewed. To avoid the
metric being influenced by this skew, we use the median instead. I'd
love to use the mode here, but the sizes are too variable for that to
give meaningful values.
This should help with interpretability when changing between bin sizes.
Especially considering you can't have 2.5 bins (so an optimal number
might not translate etc).
@sof202 sof202 merged commit e4d16bd into main Oct 7, 2024
@sof202 sof202 deleted the add-contiguous-sequences branch October 7, 2024 14:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant