Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duplex basecalling is impractically slow #98

Closed
hasindu2008 opened this issue Feb 15, 2023 · 5 comments
Closed

Duplex basecalling is impractically slow #98

hasindu2008 opened this issue Feb 15, 2023 · 5 comments

Comments

@hasindu2008
Copy link

The command I ran is: dorado duplex dna_r10.4.1_e8.2_400bps_sup@v4.0.0 reads/ --pairs pairs_from_bam_hg2/pair_ids_filtered.txt
Now 19 hours, but only 120,000 reads are done, on 4xTesla V100. For 1.2M pairs, it will be a few days. GPUs are consistently running at 80-95% utilisation in nvidia-smi. Is it normal to be this slow?

@vellamike
Copy link
Collaborator

vellamike commented Feb 16, 2023 via email

@hasindu2008
Copy link
Author

hasindu2008 commented Feb 17, 2023

Keeps getting this with the newest dorado

 /install/dorado-0.2.1/bin/dorado duplex /install/dorado-0.2.1/models/dna_r10.4.1_e8.2_400bps_sup@v4.1.0  reads/  --pairs pairs_from_bam_hg2/pair_ids_filtered.txt > sup_duplex_hg2_pod.sam
[2023-02-17 14:32:33.612] [info] > Loading pairs file
[2023-02-17 14:32:35.223] [info] > Pairs file loaded
[2023-02-17 14:33:00.545] [info] > Reads basecalled: 0
[2023-02-17 14:33:00.545] [info] > Bases/s: 0.000000e+00
[2023-02-17 14:33:01.040] [error] toml::parse: file open error -> /install/dorado-0.2.1/models/dna_r10.4.1_e8.2_4khz_stereo@v1.1/config.toml

@vellamike
Copy link
Collaborator

Hi @hasindu2008 , is the model dna_r10.4.1_e8.2_4khz_stereo@v1.1 in the directory /install/dorado-0.2.1/models ? It is necessary to download this model with dorado download --model all

@hasindu2008
Copy link
Author

Now the time for duplex is good with 0.2.1! Took only around 6 hours.

@vellamike
Copy link
Collaborator

That's great! We are working on further performance improvements so expect that 6 hours to go down further.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants