Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does IGV support block bgzip compressed and indexed genomes? #321

Closed
sjackman opened this issue Oct 3, 2016 · 7 comments
Closed

Does IGV support block bgzip compressed and indexed genomes? #321

sjackman opened this issue Oct 3, 2016 · 7 comments

Comments

@sjackman
Copy link
Contributor

sjackman commented Oct 3, 2016

Hi, Jim. Does IGV support block gzip compressed and indexed genomes? For example

bgzip -i genome.fa
samtools faidx genome.fa.gz

Creates the files

  • genome.fa.gz Block gzip compressed FASTA file
  • genome.fa.gz.gzi Block gzip index file
  • genome.fa.gz.fai FASTA index
@jrobinso
Copy link
Contributor

jrobinso commented Oct 3, 2016

No it doesn't.

On Mon, Oct 3, 2016 at 11:22 AM, Shaun Jackman notifications@github.com
wrote:

Hi, Jim. Does IGV support block gzip compressed and indexed genomes? For
example

bgzip -i genome.fa
samtools faidx genome.fa.gz

Creates the files

  • genome.fa.gz: Block gzip compressed FASTA file
  • genome.fa.gz.gzi: Block gzip index file
  • genome.fa.gz.fai: FASTA index


You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
#321, or mute the thread
https://github.com/notifications/unsubscribe-auth/AA49HGFMxOmqgEw67xQNXOzAYGfgPyARks5qwUfkgaJpZM4KM4cQ
.

@sjackman sjackman changed the title Does IGV support block bgzip compressed and indexex genomes Does IGV support block bgzip compressed and indexed genomes? Oct 3, 2016
@sjackman
Copy link
Contributor Author

sjackman commented Oct 3, 2016

Thanks for the quick response, Jim. Feel free to close this issue as answered, or convert it to a feature request if it's a feature that you'd like to support.

@jrobinso
Copy link
Contributor

jrobinso commented Oct 3, 2016

I'll leave it open, I didn't realize samtools did that.

@jrobinso
Copy link
Contributor

@sjackman I'm trying to add support for this, but I can't find a description anywhere of the ".gzi" file. I've looked in all the usual places. Do you have a description of this file? Its essential, the .fai file is not granular enough.

@sjackman
Copy link
Contributor Author

sjackman commented Jun 22, 2017

I haven't found a spec. I believe bgzf_index_load_hfile is the code to read the .gzi file format:
https://github.com/samtools/htslib/blob/master/bgzf.c#L1760
It looks like the first 8 bytes (uint64_t) are the number of blocks, n, followed by n pairs of uint64_t offsets, the first offset is the offset into the uncompressed file, and the second offset is the corresponding offset into the compressed file.

jrobinso added a commit that referenced this issue Jun 24, 2017
Also optimizations and bug fix for cram reference support.
@sjackman
Copy link
Contributor Author

sjackman commented Jun 24, 2017

Cool! Thanks, Jim! I look forward to testing it out.

@jrobinso
Copy link
Contributor

It helps a lot with cram files, where the entire sequence for a chromosome is needed.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants