Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Output invariant sites and nucleotide frequencies #62

Open
EpiDemos82 opened this issue Sep 23, 2017 · 5 comments
Open

Output invariant sites and nucleotide frequencies #62

EpiDemos82 opened this issue Sep 23, 2017 · 5 comments

Comments

@EpiDemos82
Copy link

EpiDemos82 commented Sep 23, 2017

In general, phylogenetic programs use invariant sites for likelihood calculations. However, a number of programs, such as RAxML and BEAST, can perform ascertainment bias corrections given the number of invariant sites and the frequencies of nucleotides in the alignment. If SNP-sites output these values, they could be used as direct inputs for RAxML, for example.

@tseemann
Copy link
Contributor

I second this suggestion.

Either add a -s (stats?) option to report all sorts of columnar statistics, characters used etc.

OR

Always output this to stderr as part of the logs.

@andrewjpage
Copy link
Member

andrewjpage commented Sep 27, 2017 via email

@EpiDemos82
Copy link
Author

EpiDemos82 commented Sep 27, 2017

From your example in the README:

sample1
AGACACAGTCAC
sample2
AGACAC----AC
sample3
AAACGCATTCAN

-s (or stderr) would produce:

Input stats:
Alignment length: 12
Proportion Ns: 0.03
Proportion Gap sites: 0.11
Nucleotide frequencies (A,G,C,T): 45.2,12.9,32.3,9.7

Output stats:
SNP alignment length: 3
Number Gap sites (-) introduced: 1
Proportion gap sites: 0.11

Thinking about this more, I obviously came up with a couple other useful stats. seqtk comp can produce similar stats, but having them in one tool with the speed of SNP-sites would be great.

@tseemann
Copy link
Contributor

tseemann commented Oct 4, 2017

But output in a machine readable format so we can parse or JSON-ate.

@slvrshot
Copy link

slvrshot commented Apr 1, 2021

@andrewjpage

Hello. I was curious if this was ever implemented?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants