Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix flat file genotype/population calculation #1141

Merged
merged 11 commits into from
Feb 4, 2025

Conversation

nakib103
Copy link
Contributor

@nakib103 nakib103 commented Jan 2, 2025

  • utils.nf
    Have a common place for all EVA related function. As we were replicating them in multiple script it was getting cumbersome to update.

    • sub get_vcf_content_types:
      • Improve checking of population type - query database to check sub is_freq_from_gts is set.
      • Fix checking of genotype type - sub genotype_samples_exists was missing some cases where sample name had population prefix from database and not vcf_config.json.
  • Species list

    • (species_list.pl) Fixed missing genotype ticks for some database species.
    • (species_list_vcf.pl) Added population column for VCF species (before it was only genotypes).
    • (species_list_vcf.pl) Fixed count difference in the VCF species table (before they were missing).
  • Species detailed list (TBD)

    • Add species with population data (American mink and Nile tilapia).
    • Instead of having same count for population and genotype, irrespective of what data the VCF file have, we now have the count in appropriate column and leave the other column blank if applicable.
  • Sources list

    • Fix mismatch between "Species list" (top-right) and actual table of the species; by preventing adding hidden empty row -
    • Fix no color change in alternating rows in EVA species
    • For new version of EVA script now support adding info in "Species list" (top-right) and color change in table.

@nakib103 nakib103 marked this pull request as draft January 2, 2025 17:01
@nakib103 nakib103 marked this pull request as ready for review January 18, 2025 23:14
@dglemos dglemos added the e114 label Feb 3, 2025
Copy link
Contributor

@jamie-m-a jamie-m-a left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm going to assume this is an improvement as the current set up is messy and repetitive as you say.

One minor thing it would be nice to have it highlighted in the table (VCF species) when a species is new. So for example for Domestic yak, it would say 30 million, then the second column could say "NEW for eXXX", rather than just +30 million

Sorry sent my comment too soon! Can you create a ticket for adding the functionality above (for 115) and I'll go ahead an approve this - thanks!

Tagging @nakib103

@nakib103
Copy link
Contributor Author

nakib103 commented Feb 4, 2025

Hi @jamie-m-a , created the ticket ENSVAR-6689 with the requirement.

@jamie-m-a jamie-m-a merged commit 9b46540 into Ensembl:postreleasefix/114 Feb 4, 2025
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants