Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Already on GitHub? Sign in to your account

sens.spec should be able to use a count file #225

Closed
pschloss opened this Issue Apr 14, 2016 · 7 comments

Comments

Projects
None yet
2 participants
Contributor

pschloss commented Apr 14, 2016

Currently it takes in a names file, but not a count file. Also, to run this the user will need a distance file. If they run cluster.split, a composite distance file is not generated so we need to concatenate the distance files for each split.

@pschloss pschloss added this to the Version 1.38.0 milestone Apr 14, 2016

@pschloss pschloss changed the title from sen.spec should be able to use a count file to sens.spec should be able to use a count file Apr 14, 2016

Contributor

mothur-westcott commented Apr 15, 2016

I don't think we need the count file, do we? The name file is used to preprocess the list file to remove the dups, but when you create a list file with a count file it doesn't contain dups.

Contributor

pschloss commented Apr 15, 2016

That's right - then I guess we need to turn off the requirement that a names file be provided? Can you double check that we get the same result generating a list file with and without using a count file?

Contributor

mothur-westcott commented Apr 15, 2016

I think we want to leave the names file. So right now you can cluster in 2 ways:

  1. list and names
  2. list and count - the count file is needed for average neighbor to have the right abundances

If you clustered with a names file then you would need to provide it so we can remove the duplicate sequences. If you clustered with a count file then the list file is already unique.

The list files are different based on whether you cluster with a name or count file, but the shared file generated by the list and group or list and count are the same. Make sense?

Contributor

pschloss commented Apr 15, 2016

But this command doesn't use a shared file - it uses a list file, dist file, and currently a count file

Contributor

mothur-westcott commented Apr 15, 2016 edited

I thought it used a list, dist and names?? What the command is looking for is a unique list, correct?

Contributor

pschloss commented Apr 15, 2016

right sorry - list, dist, and names

Sent frim my iPhone, expext more typos then nirmal

On Apr 15, 2016, at 15:48, Sarah Westcott notifications@github.com wrote:

I thought it used a list, dist and names??


You are receiving this because you authored the thread.
Reply to this email directly or view it on GitHub

Contributor

mothur-westcott commented Jul 12, 2016

Added count file to work with opti method results

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment