-
Notifications
You must be signed in to change notification settings - Fork 44
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error in rule AberrantSplicing_pipeline_Counting_00_define_datasets_from_anno_R #492
Comments
Hi Frank,
|
Hi Vicente, Thank you so much for your response. 1.The drop version is 1.3.3. Here are the command responses: $ mamba create -n drop3 -c conda-forge -c bioconda drop --override-channels Preparing transaction: done done To activate this environment, use
To deactivate an active environment, use
$drop demo Archive: /tmp/main.zip And then, I input command snakemake --cores 1 -n. As expected, this program does not work properly. So I tried to run the following modules directly. Surprisingly, both the subsequent Expression and Splicing could proceed normally, respectively. I wonder if you could tell me that whether this problem will affect the Expression or Splicing module in some way? On the original question: After my testing, I found why the problem appears. I deleted some contents in the Splicing module of the config file by mistake when I tried to run the Expression module. Now, it worked! The mistake deletion content: FRASER1 configuration
Many thanks! Frank |
Great that it worked! |
Dear DROP team,
Thank you for developing DROP. Recently, I raise an error while running snakemake aberrantSplicing with my own and external RNA-seq data:
rule AberrantSplicing_pipeline_Counting_00_define_datasets_from_anno_R:
input: /home/ngs/rnaseq/03align_out_5/drop_1/sampleAnnotation_s.tsv, Scripts/AberrantSplicing/pipeline/Counting/00_define_datasets_from_anno.R
output: /home/ngs/rnaseq/03align_out_5/drop_1/project3/Output/processed_data/aberrant_splicing/annotations/fraser.tsv, /home/ngs/rnaseq/03align_out_5/drop_1/project3/htmlOutput/AberrantSplicing/annotations/fraser.html
log: /home/ngs/rnaseq/03align_out_5/drop_1/.drop/tmp/AS/fraser/00_defineDataset.Rds
jobid: 11
reason: Missing output files: /home/ngs/rnaseq/03align_out_5/drop_1/project3/Output/processed_data/aberrant_splicing/annotations/fraser.tsv
wildcards: dataset=fraser
resources: tmpdir=/tmp
Config
projectTitle: "DROP: RNAseq"
root: /home/ngs/rnaseq/03align_out_5/drop_1/project3/Output # root directory of all output objects and tables
htmlOutputPath: /home/ngs/rnaseq/03align_out_5/drop_1/project3/htmlOutput # path for HTML rendered reports
indexWithFolderName: true # whether the root base name should be part of the index name
hpoFile: null # if null, downloads it from webserver
sampleAnnotation: /home/ngs/rnaseq/03align_out_5/drop_1/sampleAnnotation_s.tsv # path to sample annotation (see documentation on how to create it)
geneAnnotation:
v29: /home/ngs/rnaseq/03align_out_5/drop_1/gencode.v29.gtf
genomeAssembly: hg19
genome: /home/ngs/rnaseq/03align_out_5/drop_1/hg19_ucsc.fa # path to reference genome sequence in fasta format.
# You can define multiple reference genomes in yaml format, ncbi: path/to/ncbi, ucsc: path/to/ucsc
# the keywords that define the path should be in the GENOME column of the sample annotation table
random_seed: false # just for demo data, remove for analysis
exportCounts:
# specify which gene annotations to include and which
# groups to exclude when exporting counts
geneAnnotations:
- v29
excludeGroups:
- null
aberrantExpression:
run: fasle
groups:
- outrider
fpkmCutoff: 1
implementation: autoencoder
padjCutoff: 0.05
zScoreCutoff: 0
genesToTest: null
maxTestedDimensionProportion: 3
yieldSize: 2000000
aberrantSplicing:
run: true
groups:
- fraser
recount: false
longRead: false
keepNonStandardChrs: false
filter: true
minExpressionInOneSample: 20
quantileMinExpression: 10
minDeltaPsi: 0.05
implementation: PCA
padjCutoff: 0.1
maxTestedDimensionProportion: 6
mae:
run: false
groups:
- group1
- group2
- group3
gatkIgnoreHeaderCheck: true
padjCutoff: 0.05
allelicRatioCutoff: 0.8
addAF: true
maxAF: 0.001
maxVarFreqCohort: 0.05
# VCF-BAM matching
qcVcf: Data/qc_vcf_1000G.vcf.gz
qcGroups:
- mae
dnaRnaMatchCutoff: 0.85
rnaVariantCalling:
run: false
groups:
- batch_0
highQualityVCFs:
- Data/Mills_and_1000G_gold_standard.indels.hg19.sites.chrPrefix.vcf.gz
- Data/1000G_phase1.snps.high_confidence.hg19.sites.chrPrefix.vcf.gz
dbSNP: Data/00-All.vcf.gz
repeat_mask: Data/hg19_repeatMasker_sorted.chrPrefix.bed
createSingleVCF: true
addAF: true
maxAF: 0.001
maxVarFreqCohort: 0.05
hcArgs: ""
minAlt: 3
yieldSize: 100000
tools:
gatkCmd: gatk
bcftoolsCmd: bcftools
samtoolsCmd: samtools
Command: snakemake aberrantSplicing --cores 10
(94GB memory)
I wonder if you could provide a solution to this problem? Thank you!
Best regards,
Frank
2023-09-12.snakemake.log
sampleAnnotation_s.csv
The text was updated successfully, but these errors were encountered: