Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

SLURM submit error #67

Open
stubrownGal opened this issue Sep 27, 2018 · 12 comments
Open

SLURM submit error #67

stubrownGal opened this issue Sep 27, 2018 · 12 comments

Comments

@stubrownGal
Copy link

I am trying to run Masurca 3.2.8 on our new SLURM based cluster. I am assembling 40 Gb of Nanopore reads plus Illumina PE and Jump libraries. Masurca ran for a few hours and completed the read correction phase, but failed when trying to submit mega-read pass-1 jobs to the grid.

The STD_error message is below. I noticed that there is no specific grid information in the assemble.sh script, but it calls another script masurca/3.2.8/bin//mega_reads_assemble_cluster.sh which does contain grid commands that are wrong for our system (it is set up for SGE grid, not SLURM - see second section below). I did include some SLURM information in the config.txt.

How do I correctly set up Masurca for our SLURM based cluster?

[browns02@bigpurple-ln1 Masurca]$ ./assemble.sh
[Tue Sep 25 15:44:38 EDT 2018] Processing pe library reads
[Tue Sep 25 15:52:49 EDT 2018] Processing sj library reads
[Tue Sep 25 15:59:39 EDT 2018] Average PE read length 101
[Tue Sep 25 15:59:40 EDT 2018] Using kmer size of 67 for the graph
[Tue Sep 25 15:59:40 EDT 2018] MIN_Q_CHAR: 33
[Tue Sep 25 15:59:40 EDT 2018] Creating mer database for Quorum
[Tue Sep 25 16:22:05 EDT 2018] Error correct PE
[Tue Sep 25 16:48:36 EDT 2018] Error correct JUMP
[Tue Sep 25 17:07:41 EDT 2018] Estimating genome size
[Tue Sep 25 17:21:07 EDT 2018] Estimated genome size: 2360747251
[Tue Sep 25 17:21:07 EDT 2018] Creating k-unitigs with k=67
[Tue Sep 25 17:56:30 EDT 2018] Creating k-unitigs with k=31
[Tue Sep 25 18:47:29 EDT 2018] Filtering mate pairs
Assuming outtie orientation
Assuming outtie orientation
Chimeric/Redundant jump reads:
52378114 chimeric_sj.txt
38148468 redundant_sj.txt
90526582 total
[Tue Sep 25 23:57:14 EDT 2018] Creating FRG files
[Wed Sep 26 00:20:47 EDT 2018] Computing super reads from PE
[Wed Sep 26 01:43:09 EDT 2018] Using CABOG from /gpfs/share/apps/masurca/3.2.8/bin/../CA8/Linux-amd64/bin
[Wed Sep 26 01:43:09 EDT 2018] Running mega-reads correction/assembly
[Wed Sep 26 01:43:09 EDT 2018] Using mer size 15 for mapping, B=15, d=0.02
[Wed Sep 26 01:43:09 EDT 2018] Estimated Genome Size 2360747251
[Wed Sep 26 01:43:09 EDT 2018] Estimated Ploidy 1
[Wed Sep 26 01:43:09 EDT 2018] Using 32 threads
[Wed Sep 26 01:43:09 EDT 2018] Output prefix mr.41.15.15.0.02
[Wed Sep 26 01:43:09 EDT 2018] Using 25x of the longest ONT reads
[Wed Sep 26 01:46:29 EDT 2018] Reducing super-read k-mer size
[Wed Sep 26 02:21:04 EDT 2018] Mega-reads pass 1
[Wed Sep 26 02:21:04 EDT 2018] Running on the grid in 89 batches
[Wed Sep 26 02:29:01 EDT 2018] submitting SGE create_mega_reads jobs to the grid
[Wed Sep 26 02:29:01 EDT 2018] create_mega_reads failed on the grid
[Wed Sep 26 02:29:01 EDT 2018] mega-reads pass 1 on the grid failed or stopped, please re-run assemble.sh
[Wed Sep 26 02:29:01 EDT 2018] Assembly stopped or failed, see CA.mr.41.15.15.0.02.log

#this is the batch size for grid execution
PBATCH_SIZE=2000000000
GRID_ENGINE="SGE"
QUEUE=""
USE_SGE=0
PACBIO=""
NANOPORE=""
ONEPASS=0
GC=
RC=
NC=
if tty -s < /dev/fd/1 2> /dev/null; then
GC='\e[0;32m'
RC='\e[0;31m'
NC='\e[0m'
fi

@zrlewis
Copy link

zrlewis commented Oct 22, 2018

@stubrownGal Any luck getting this running on SLURM? I just ran into a similar problem.

@stubrownGal
Copy link
Author

stubrownGal commented Oct 22, 2018 via email

@zrlewis
Copy link

zrlewis commented Oct 23, 2018

@stubrownGal Okay. It looks like SLURM support might not be fully integrated yet, judging from the mega_reads_assemble_cluster.sh script. I'll try running it locally, setting USE_GRID=0

@zrlewis
Copy link

zrlewis commented Nov 20, 2018

@alekseyzimin Any status update for SLURM support in v 3.2.9? Running mega-reads locally on a large genome is very slow in my case.

@sunnycqcn
Copy link

sunnycqcn commented Nov 20, 2018 via email

@alekseyzimin
Copy link
Owner

alekseyzimin commented Nov 20, 2018 via email

@stubrownGal
Copy link
Author

stubrownGal commented Nov 22, 2018 via email

@alekseyzimin
Copy link
Owner

alekseyzimin commented Nov 26, 2018 via email

@zrlewis
Copy link

zrlewis commented Nov 27, 2018

@alekseyzimin Thanks for the update. Keep us posted on SLURM support. I'm limited to nodes with 20 CPUs, so I wouldn't be able to take the approach that @sunnycqcn did, unfortunately.

@alekseyzimin
Copy link
Owner

alekseyzimin commented Nov 29, 2018 via email

@zrlewis
Copy link

zrlewis commented Nov 29, 2018

@alekseyzimin Fantastic! I look forward to trying it out.

If I have already started a run, which has been having trouble getting past mega-reads, then do I need to restart or can I resume with the new version?

@alekseyzimin
Copy link
Owner

alekseyzimin commented Nov 29, 2018 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants