Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IUC Contribution Fest - Metagenomics and mothur #419

Closed
8 tasks
bgruening opened this issue Nov 26, 2015 · 16 comments
Closed
8 tasks

IUC Contribution Fest - Metagenomics and mothur #419

bgruening opened this issue Nov 26, 2015 · 16 comments

Comments

@bgruening
Copy link
Member

I will start this issue to list all idea specifically for all people that are interested in working on mothur during our Metagenomic Hackathon (#299).

I have created an initial branch with wrappers from @IyadKandalaft and team: https://github.com/galaxyproject/tools-iuc/tree/mothur

My idea is to use this issue to collect ideas specific to the mothur wrapper, coordinate efforts and create PR against this branch.

Thanks everyone! This will be awesome!

@oxyko
Copy link

oxyko commented Nov 27, 2015

We have existing issue reported for this wrapper, so I'm pasting it here for everyone to be aware:

Problem with 2 of the Chimera tools (both in galaxy tool version 1.22 and 1.23) Chimera.slayer and Chimera.uchime.

In spite of the fact that in the drop down menu for the tool it states that a groups file is optional and a count file is optional instead of leaving these parameters out of the mothur command it uses "groups=None" and then the tool fails due to not being able to open file "None"

mothur > chimera.uchime(fasta=dataset_37011.dat,group=None,count=None,reference=self,abskew=1.9,minh=0.3,mindiv=0.5,xn=8.0,dn=1.4,xa=1.0,chunks=4,minchunk=64,idsmoothwindow=32,maxp=2,processors=8)
Unable to open None. Trying output directory None
Unable to open None. It will be disregarded.
Unable to open None. Trying output directory None
Unable to open None. It will be disregarded.
[ERROR]: no valid group files.
[ERROR]: The number of groupfiles does not match the number of fastafiles, please correct.

Using 8 processors.
[ERROR]: did not complete chimera.uchime.
mothur > chimera.slayer(fasta=dataset_37011.dat,reference=self,group=None,count=None,processors=8)
Unable to open None. Trying output directory None
Unable to open None. It will be disregarded.
Unable to open None. Trying output directory None
Unable to open None. It will be disregarded.
[ERROR]: no valid group files.
[ERROR]: The number of groupfiles does not match the number of fastafiles, please correct.

Using 8 processors.
[ERROR]: did not complete chimera.slayer.

screen shot 2015-06-10 at 1 09 31 pm
screen shot 2015-06-10 at 1 11 34 pm

@jmchilton
Copy link
Member

I think the mothur datatypes should be modified so the short extension includes a prefix mothur. like mothur.otu.

@oxyko
Copy link

oxyko commented Nov 30, 2015

Will be installing and figuring out planemo and then working on the chimera tools issue and tests for them.

@jj-umn
Copy link
Contributor

jj-umn commented Nov 30, 2015

@jmchilton Agree on prefixing datatypes. Also, should see if any of these could just be subclassed, rather than python class. I guess that would depend on whether there is any useful metadata for the datatype.

@IyadKandalaft
Copy link

I'm looking at patching to support v1.36.1 of mothur

@bgruening
Copy link
Member Author

Anyone up for a discussion round about mothur and the current state?
I have made an initial review of #429 and we can talk about it if you like.

@oxyko
Copy link

oxyko commented Nov 30, 2015

I've refactored all wrappers to use macros, as discussed earlier today. Currently macros takes care of requirements, citations and some common text from help. Wanted to extract the wrapper version as well, but the versions are all over the place: from 1.20.0 to 1.27.0. So not sure if it is better to use one version for all the wrappers, and if yes, then which one. Any thoughts?

@bgruening
Copy link
Member Author

@oxyko this is a great work!

@shiltemann
Copy link
Member

@oxyko awesome! :)

Also, since all files pretty consistently use 1-space indentation, I could do something like sed -i 's/^(\ *)/\1\1\1\1/g ' to all xml files to quadruple all leading spaces? ..or shall we we not worry about this for now?

@bgruening
Copy link
Member Author

@shiltemann I think this is a good idea. I would wait before we have merged @oxyko PR to prevent more merge conflicts. But yes this is the way to go.

@bgruening
Copy link
Member Author

@oxyko I think we should aim to target the latest release, right. So fix the latest release available in the requirement tag :)

@bgruening
Copy link
Member Author

@shiltemann did a huge cleanup PR in #447
Please rebase your work to the latest changes. Thanks!
@oxyko you also need to rebase

@oxyko
Copy link

oxyko commented Dec 1, 2015

@shilteman Done! Thanks for clean up!

Also refactored the wrapper versions (all, but TreeVector.xml, which is another tool) to use one version from macros. The version is now set to 1.27.0. Same pull request (#448). Please merge.

@blankenberg
Copy link
Member

Just a drive-by to +1 making Mothur paired dataset collection aware, especially for e.g. Make.contigs (and possibly others).

@shiltemann
Copy link
Member

@blankenberg thanks, see #853

@martenson
Copy link
Member

Closing this issue since this codefest is long finished.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

8 participants