Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nucleotide alignment with invalid nucleotide "X" #13

Closed
Changwanseo opened this issue May 8, 2023 · 1 comment
Closed

Nucleotide alignment with invalid nucleotide "X" #13

Changwanseo opened this issue May 8, 2023 · 1 comment
Labels
bug Something isn't working

Comments

@Changwanseo
Copy link

  • Environment: UFCG 1.0.3, conda installed version, ubuntu 20.04
  • Steps that I've done:
  1. Prepare input file
    /tmp
    GCA_024703775.1.fna
    .... (many other genome fna files)

  2. Call ucg file with "profile"
    ufcg profile -i ./tmp 0o ./101_ufcg_profile -t 32 -v --developer --force 1

  3. Align with ufcg
    ufcg align -i ./101_ufcg_profile -o ./201_ufcg_align_nuc -l acc -a nucleotide -n align -l strain -t 24

  • Problems
    The resulting alignment includes nucleotide "X", which makes many other programs (such as MAFFT without sequence designation, or geneious) to recognize alignment file as protein alignment. It would be better alignment to change "X" to "N" if "-a nucleotide" flag included
@endixk endixk added the bug Something isn't working label May 9, 2023
@endixk
Copy link
Member

endixk commented May 9, 2023

Current version has a flawed nucleotide sequence extraction process in which all lowercase bases from the backward strands are converted into the alphabet X.

Commit fe699d1 will not only fix this issue, but also improve the speed of the reverse complement sequence computation in general. I will soon make a new release including this update.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants