Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Organization of clustering and correlation analytics #13

Closed
bentsherman opened this issue Dec 19, 2017 · 3 comments
Closed

Organization of clustering and correlation analytics #13

bentsherman opened this issue Dec 19, 2017 · 3 comments

Comments

@bentsherman
Copy link
Member

bentsherman commented Dec 19, 2017

As I'm looking at how to implement mixture model clustering, I'm beginning to see a multi-stage pipeline with options at several points:

*.emx ---> clustering [---> ???] ---> correlation ---> *.cmx

clustering:
- none
- k-means
- GMM

correlation:
- Pearson
- Spearman
- ...

So I'm trying to figure out how to best implement this pipeline for the long-term. It looks like KINCv1 can combine clustering with any correlation method, with minimal duplication. Perhaps we will need to create a new data type for the "augmented" expression matrix? It would parallel the PairWiseClusterList from KINCv1. Then the clustering and correlation analytics could be kept separate and the user could simply use the pipeline illustrated above.

@4ctrl-alt-del
Copy link
Member

4ctrl-alt-del commented Dec 19, 2017 via email

@bentsherman
Copy link
Member Author

Ah, so that's how the *.ccm fits into all of this. Excellent!

@feltus
Copy link

feltus commented Dec 21, 2017 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants