Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nextflow pipeline to create GO/Phenotypes annotations for pangenomes #1070

Merged
merged 7 commits into from
Mar 5, 2024

Conversation

nuno-agostinho
Copy link
Contributor

ENSVAR-5823: GO plugin annotations for pangenomes
ENSVAR-4744: Phenotypes plugin annotations for pangenomes

Steps

This Nextflow pipeline performs the following steps:

  1. Download pangenomes data (FASTA and GTF files)
  2. Generate GO/Phenotypes plugin annotations for GRCh38
  3. Phenotypes only:
    • Filter Phenotypes plugin annotation for 'Genes'
    • Fetch lookup table between gene symbols and Ensembl identifiers from HGNC
  4. Create pangenomes annotation (one for each assembly and each plugin):
    • Get the GO/Phenotype info by comparing gene symbols between the GRCh38 and the pangenomes annotation
    • Append info to the pangenome-specific GTFs
  5. Test if plugin annotation works for each assembly using VEP

Testing

nextflow run -resume -profile lsf \
  ${ENSEMBL_ROOT_DIR}/ensembl-variation/nextflow/pangenomes \
  $(st1 details script) \
  --version 112

@nakib103 nakib103 self-requested a review February 19, 2024 09:34
@nakib103 nakib103 self-assigned this Feb 19, 2024
nextflow/pangenomes/modules/download.nf Outdated Show resolved Hide resolved
nextflow/pangenomes/modules/download.nf Show resolved Hide resolved
nextflow/pangenomes/modules/annotation.nf Outdated Show resolved Hide resolved
nextflow/pangenomes/nextflow.config Show resolved Hide resolved
nextflow/pangenomes/modules/test.nf Outdated Show resolved Hide resolved
nextflow/pangenomes/modules/download.nf Outdated Show resolved Hide resolved
Co-authored-by: Syed Nakib Hossain <snhossain@ebi.ac.uk>
Copy link
Contributor

@nakib103 nakib103 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!~
thanks Nuno!

@nakib103 nakib103 merged commit 07d4398 into Ensembl:postreleasefix/112 Mar 5, 2024
1 check passed
@nuno-agostinho nuno-agostinho deleted the add/pangenomes branch March 5, 2024 15:39
@nakib103
Copy link
Contributor

nakib103 commented Mar 5, 2024

merged to main and release/112

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants