Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: disable non-a-number filtering in clade file parsing #1293

Merged
merged 4 commits into from Aug 29, 2023

Conversation

rneher
Copy link
Member

@rneher rneher commented Aug 25, 2023

trival PR to allow gene names 'NA' in clades.tsv

Discussions

  • This change exposed issues with empty values in the clades file. (thread)

@rneher rneher requested a review from victorlin August 25, 2023 12:54
@codecov
Copy link

codecov bot commented Aug 25, 2023

Codecov Report

Patch coverage: 100.00% and no project coverage change.

Comparison is base (435bd29) 69.92% compared to head (f5f9590) 69.92%.

❗ Current head f5f9590 differs from pull request most recent head 11d414d. Consider uploading reports for the commit 11d414d to get more accurate results

Additional details and impacted files
@@           Coverage Diff           @@
##           master    #1293   +/-   ##
=======================================
  Coverage   69.92%   69.92%           
=======================================
  Files          67       67           
  Lines        7161     7162    +1     
  Branches     1745     1746    +1     
=======================================
+ Hits         5007     5008    +1     
  Misses       1848     1848           
  Partials      306      306           
Files Changed Coverage Δ
augur/clades.py 91.06% <100.00%> (+0.05%) ⬆️

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

augur/clades.py Outdated Show resolved Hide resolved
@victorlin victorlin self-assigned this Aug 25, 2023
@rneher
Copy link
Member Author

rneher commented Aug 29, 2023

@victorlin do you think this can go in? The empty line handling is still bit wonky. But I think it is better than before...

victorlin and others added 3 commits August 29, 2023 11:17
@victorlin
Copy link
Member

@rneher I force-pushed a new set of commits to make changes more clear, and added a changelog entry.

Will merge after checks pass.

@victorlin victorlin merged commit 3ce0373 into master Aug 29, 2023
26 checks passed
@victorlin victorlin deleted the fix/clades-na branch August 29, 2023 18:44
joverlee521 added a commit to nextstrain/rsv that referenced this pull request Aug 29, 2023
Discovered as part of nextstrain/augur#1293 that the
clade TSVs for this repo has excess whitespace which causes it to bypass
the default `skip_blank_lines=True` behavior for `pandas.read_csv`.

The issue has been fixed in Augur, but still nice to remove the excess
whitespace here to keep things uniform with other pathogen clade TSVs.

Note that the trailing whitespace after inherited clade definitions
are required.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
No open projects
Development

Successfully merging this pull request may close these issues.

None yet

3 participants