Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

R9.4 formatting issue #8

Closed
yk-tanigawa opened this issue Oct 18, 2016 · 7 comments
Closed

R9.4 formatting issue #8

yk-tanigawa opened this issue Oct 18, 2016 · 7 comments

Comments

@yk-tanigawa
Copy link
Collaborator

  • R9.4 is not fully supported by the following tools:
    • poretools
    • porekit
@yk-tanigawa
Copy link
Collaborator Author

One of the possible approach we can take is convert to fastq format and map to the reference (avoid handling nanopore data directly).

Or, we can spend time to understand how they changed their format (and fix tools).

@yk-tanigawa
Copy link
Collaborator Author

We decided to go for fastq approach 👍

@yk-tanigawa
Copy link
Collaborator Author

  • I have observed the data structures inside two fast5 files:
    • one file from our cDNA data
    • the other from sample file in poretools

@yk-tanigawa
Copy link
Collaborator Author

@yk-tanigawa
Copy link
Collaborator Author

Difference of data structure

FAST5 file from cDNA

screenshot 2016-10-19 22 46 22

FAST5 file what poretools expect to have

screenshot 2016-10-19 22 48 39

@yk-tanigawa
Copy link
Collaborator Author

what I found from this observation

  • FASTQ file is stored in FAST5 file as string
    • poretools (and other software) just retrieve this information by using h5py package
  • metadata is also stored in FAST5 file
    • poretools format:
      • Analysis/Basecall_1D_000/BaseCalled_template/Events etc.
    • cDNA format:
      • 'Analysis/Basecall_1D_000/Summary/basecall_1d_template
  • Our FAST5 file still contains Raw signal
    • It might be interesting to see what kind of output file we will have after feeding this file to a base caller

@yk-tanigawa
Copy link
Collaborator Author

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant