Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Import SV script fix - header label corruption #1025

Merged
merged 1 commit into from
Aug 4, 2023

Conversation

nakib103
Copy link
Contributor

@nakib103 nakib103 commented Aug 3, 2023

The header parser function get_header_info was doing greedy regex match until : found for getting label. It results in corrupted label if the header line includes a second (or third ...) :. For example in nstd207 study the $study_desc is coming from -

# Study_description: Long-read and strand-specific sequencing technologies together facilitate the de novo assembly of high-quality haplotype-resolved human genomes without parent-child trio data. We present 64 assembled haplotypes from 32 diverse human genomes. These highly contiguous haplotype assemblies (average contig N50: 26 Mbp) ...

We should do a lazy match instead.

@dglemos dglemos merged commit 4a7358d into Ensembl:postreleasefix/111 Aug 4, 2023
1 check passed
@dglemos
Copy link
Contributor

dglemos commented Aug 4, 2023

Merged into release/111 and main

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
2 participants