New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create method=agc and method=dgc for cluster/cluster.split command #169
Labels
Milestone
Comments
@pschloss Is step 4 supposed to have different settings? How does vsearch know which method you are requesting? Does it rely on the file extension? Do you want to use vsearch to run steps 1-3 or mothur? |
mothur-westcott
added a commit
that referenced
this issue
Oct 12, 2015
mothur-westcott
added a commit
that referenced
this issue
Oct 12, 2015
mothur-westcott
added a commit
that referenced
this issue
Oct 12, 2015
mothur-westcott
added a commit
that referenced
this issue
Oct 12, 2015
mothur-westcott
added a commit
that referenced
this issue
Oct 12, 2015
Sorry - I had a typo above. The DGC isn't supposed to have the --sizeorder flag while AGC is. I've corrected the code above. |
mothur-westcott
added a commit
that referenced
this issue
Jan 5, 2016
mothur-westcott
added a commit
that referenced
this issue
Jan 5, 2016
mothur-westcott
added a commit
that referenced
this issue
Jan 5, 2016
mothur-westcott
added a commit
that referenced
this issue
Feb 7, 2016
mothur-westcott
added a commit
that referenced
this issue
Feb 8, 2016
mothur-westcott
added a commit
that referenced
this issue
Feb 8, 2016
mothur-westcott
added a commit
that referenced
this issue
Feb 9, 2016
mothur-westcott
added a commit
that referenced
this issue
Feb 9, 2016
mothur-westcott
added a commit
that referenced
this issue
Feb 29, 2016
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Both of these would represent a wrapper for the agc (abundance-based greedy clustering) and dgc (distance-based greedy clustering) in the cluster and cluster.split commands
The algorithm would go something like this...
I currently have a hack to do this using bash and R. Here's how it goes using unaligned sequences...
For method=agc...
For method=dgc...
The R code...
Options to include...
--id
should be able to be set by the user. Make 0.97 the default. In mothur terms the default cutoff would be 0.03 so we need to do 1-cutoff to get the value for--id
--threads
option, which takes an intuserLabel
. This should be 0.03 (i.e. the value of cutoff). Also, the R code gets pretty slow, so if there's a way to do it better that would be great. Again, this was a hack :)I'll email some example input and output that was generated without running the
rm
command at the end of each script for both methods.The text was updated successfully, but these errors were encountered: