New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[MRG] Update sbt_gather output formats #175
Conversation
The new output format is:
and then 'xxx' is a CSV file containing:
@taylorreiter does this look OK? |
It's everything I ever hoped and dreamed it would be. |
I don't know how to react to @taylorreiter optimism, this never happens on open source projects... |
Codecov Report
@@ Coverage Diff @@
## master #175 +/- ##
==========================================
+ Coverage 85.78% 85.78% +<.01%
==========================================
Files 13 13
Lines 1934 1942 +8
Branches 52 52
==========================================
+ Hits 1659 1666 +7
- Misses 265 266 +1
Partials 10 10
Continue to review full report at Codecov.
|
Note to self: I think this just needs some tests, is all. |
Note that 'total' no longer reflects what is printed out, sigh. |
@taylorreiter @brooksph @halexand how does this look?
|
Very readable. The Mbp makes the interpretation of the percentage more intuitive. |
On Mon, Apr 24, 2017 at 10:13:19AM -0700, Taylor Reiter wrote:
Will the comment about 0.0% identification only be present when a microbes is identified as 0.0%, or will that be a later addition?
I think the bp addition in column 1 is the solution to that, no? It may
be 0.0% if you only get 100kb, of a 10 Mbp genome, but that's still a
significant match, so if we show "100kb" then life is good.
…--titus
|
also note new argument |
oddity in output:
|
above oddity fixed in this branch. |
Ready for review @luizirber @betatim |
Hmm, might be a good idea to sort by bp or something. |
sourmash_lib/__init__.py
Outdated
@@ -15,3 +15,11 @@ | |||
|
|||
DEFAULT_SEED = get_minhash_default_seed() | |||
MAX_HASH = get_minhash_max_hash() | |||
|
|||
def scaled_to_max_hash(scaled): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
not used in this PR?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
good catch - removed!
👍 maybe a second look from @luizirber |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
👍
Fixes #169 (align stdout and --output for sbt_gather) and #170 (label columns for sbt_gather).
Removes
--csv
and replaces it with--output
, which now outputs CSV format.make test
Did it pass the tests?make coverage
Is the new code covered?without a major version increment. Changing file formats also requires a
major version number increment.
changes were made?