Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

2bit-encoded reference reader #31

Merged
merged 1 commit into from Mar 7, 2017
Merged

2bit-encoded reference reader #31

merged 1 commit into from Mar 7, 2017

Conversation

alumi
Copy link
Member

@alumi alumi commented Mar 3, 2017

Added a reader for 2bit reference file format.
Also added test files converted by faToTwoBit

.2bit is a compact randomly-accessible format of DNA sequences.

(with-open [r (cljam.twobit/reader "PATH/TO/2BIT")]
  (cljam.twobit/read-sequence r {:chr "chr1" :start 1 :end 1000}))
=> "ATGC..."

Maybe we should have some protocols or some abstractions for range-based querying (read-sequence) for FASTA and 2bit.

Copy link
Member

@totakke totakke left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@totakke totakke merged commit 14d4d9c into master Mar 7, 2017
totakke added a commit that referenced this pull request Mar 7, 2017
@totakke totakke deleted the feature/two-bit-reader branch March 7, 2017 03:01
@totakke
Copy link
Member

totakke commented Mar 7, 2017

Thanks!

Maybe we should have some protocols or some abstractions for range-based querying (read-sequence) for FASTA and 2bit.

I think so, too. SeqReader or RandomSeqReader looks good.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants