New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
skip trimming option #124
skip trimming option #124
Conversation
thanks! there are some failures btw cc @antgonza |
A fix for the file path is coming in - its passing on my system. But this is ready for review. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One comment.
input_seqs=sequence_generator(seqs_fp), trim_len=trim_length): | ||
out_f.write(">%s\n%s\n" % (label, seq)) | ||
if skip_trim: | ||
for label, seq in sequence_generator(seqs_fp): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is basically duplicating the file, right? Can we do something smarter?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What about creating a simple symlink?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That's cool! I didn't know about that.
Ok. I think this is ready to merge! |
Thanks! |
This adds an option to ignore trimming.
So you should be able to specify a flag to completely ignore trimming.
Here a test case is added. In addition, I have tested out the CLI on some of the test datasets.
Note that this is a little tricky to test, so any suggestions on improving this will be welcome.
But I believe that this should cover the gist of it.