Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Warning messages from the model #26

Open
3 tasks
stephaniereinders opened this issue Oct 3, 2022 · 0 comments
Open
3 tasks

Warning messages from the model #26

stephaniereinders opened this issue Oct 3, 2022 · 0 comments
Labels
bug Something isn't working

Comments

@stephaniereinders
Copy link
Member

stephaniereinders commented Oct 3, 2022

Several things can go wrong with the pipeline.

Too many outliers

If "too many" graphs are assigned to the outlier cluster for the cluster template docs, the model training docs, or the questioned docs, the code should return a warning.

To Do:

  • Ask Alicia and Danica how we should define too many outliers
  • Add warning to make_clustering_templates()
  • Add warning to get_clusterassignment()

Empty clusters

examples_empty_cluster.pdf
In the attached example the cluster template has 5 clusters. None of the graphs from the model training data are assigned to cluster 5. I asked Danica how we should handle this and she said that handwriter should return an error suggesting that the user create a new template with fewer clusters.

Need to decide which function(s) should throw the error. Maybe the format_model_data() function? Then format_questioned_data() should throw an error if the questioned data doesn't use all of the clusters.

Model docs and questioned docs use different clusters

I didn't record an example, but occasionally the questioned docs will use different clusters from the model docs and this throws an error in analyze_questioned_docs().

Need to decide how to handle this. Maybe format_quesitoned_data() should throw an error if it uses different clusters from the model docs?

@stephaniereinders stephaniereinders added the bug Something isn't working label Oct 3, 2022
@stephaniereinders stephaniereinders changed the title What if model training data doesn't use one of the clusters? Warning messages from the model Oct 26, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant