Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

different reference genome #13

Open
wnddl111 opened this issue Apr 25, 2023 · 5 comments
Open

different reference genome #13

wnddl111 opened this issue Apr 25, 2023 · 5 comments
Labels
bug Something isn't working

Comments

@wnddl111
Copy link

hello

my reference genome is Ensemble v41 hg38.
but your reference genome is ucsc hg38.

so i changed chromosome annotation according to ucsc chromosome version in my vcf file
(ex) GL383518.1 -> chr1_GL383518v1_alt

<style> </style>
chr1_GL383518v1_alt 182,439 GL383518.1

but i dont know what to do to solve this error

my code:
results <- RunFIREVAT(vcf.file = sample.vcf.file,
vcf.file.genome = 'hg38', # for mouse variants: 'mm10'
config.file = mutect2.config.file,
df.ref.mut.sigs = GetPCAWGMutSigs(),
target.mut.sigs = GetPCAWGMutSigsNames(),
sequencing.artifact.mut.sigs = PCAWG.All.Sequencing.Artifact.Signatures,
output.dir = output.dir,
objective.fn = Default.Obj.Fn,
num.cores = 2,
ga.pop.size = 100,
ga.max.iter = 5,
ga.run = 5,
perform.strand.bias.analysis = TRUE,
ref.forward.strand.var = "TumorDPRefForward",
ref.reverse.strand.var = "TumorDPRefReverse",
alt.forward.strand.var = "TumorDPAltForward",
alt.reverse.strand.var = "TumorDPAltReverse",
annotate = FALSE)

error:
Error in .getOneSeqFromBSgenomeMultipleSequences(x, names[i], start[i], :
sequence chr not found
In addition: Warning messages:
1: In file.remove(paste0(output.dir, existing.firevat.optimization.log.tsv.file)) :
cannot remove file 'C:/Users/User/Desktop/', reason 'Permission denied'
2: In scan(text = x, what = "character", quiet = TRUE, sep = split.char) :
EOF within quoted string
3: In scan(text = x, what = "character", quiet = TRUE, sep = split.char) :
EOF within quoted string
4: In rbind(c(ID = "FAIL", Description = "Fail the site if all alleles fail but for different reasons." :
number of columns of result is not a multiple of vector length (arg 1)
5: In rbind(c(ID = "AD", Number = "R", Type = "Integer", Description = "Allelic depths for the ref and alt alleles in the order listed" :
number of columns of result is not a multiple of vector length (arg 1)
6: In rbind(c(ID = "AS_FilterStatus", Number = "A", Type = "String", :
number of columns of result is not a multiple of vector length (arg 2)

@khb7840
Copy link
Collaborator

khb7840 commented Apr 25, 2023

If the vcf & BSgenome chromosome names match, try running RunFIREVAT with check.chromosome.name = FALSE

@wnddl111
Copy link
Author

wnddl111 commented Apr 26, 2023

@khb7840
Thank you very much for your reply. I run FIREVAT again after matching the chromosome name, but a new error occurred in the starnd bias anaylsis part.

[1] "INFO [2023-04-26 10:38:42] Step 02-4. Filter VCF based on optmized filter parameters."
[1] "INFO [2023-04-26 10:38:42] * Before applying filter: 6662 rows in VCF object"
[1] "INFO [2023-04-26 10:38:42] * After applying filter: "
[1] "INFO [2023-04-26 10:38:42] ** 6532 rows in vcf.data.filtered VCF object"
[1] "INFO [2023-04-26 10:38:42] ** 130 rows in vcf.data.artifact VCF object"
[1] "INFO [2023-04-26 10:38:42] Step 03. Additional analysis."
[1] "INFO [2023-04-26 10:38:42] Step 03-1. Perform strand bias analysis [firevat_strand_bias::PerformStrandBiasAnalysis]"
Error in fisher.test(test.mat) :
All entries in "x" must be nonnegative and finite
In addition: Warning messages:
1: In file.remove(paste0(output.dir, existing.firevat.optimization.log.tsv.file)) :
cannot remove file 'C:/Users/User/Desktop/firevat', reason 'Permission denied'
2: In scan(text = x, what = "character", quiet = TRUE, sep = split.char) :
EOF within quoted string
3: In scan(text = x, what = "character", quiet = TRUE, sep = split.char) :
EOF within quoted string
4: In rbind(c(ID = "FAIL", Description = "Fail the site if all alleles fail but for different reasons." :
number of columns of result is not a multiple of vector length (arg 1)
5: In rbind(c(ID = "AD", Number = "R", Type = "Integer", Description = "Allelic depths for the ref and alt alleles in the order listed" :
number of columns of result is not a multiple of vector length (arg 1)
6: In rbind(c(ID = "AS_FilterStatus", Number = "A", Type = "String", :
number of columns of result is not a multiple of vector length (arg 2)

@khb7840
Copy link
Collaborator

khb7840 commented Apr 26, 2023

Strand bias analysis requires strand specific allele supporting read count in the input.
If they are given in the input, they should be read correctly through config file.
Otherwise, you may skip strand bias analysis by applying perform.strand.bias.analysis = FALSE, filter.by.strand.bias.analysis = FALSE to RunFIREVAT.

@wnddl111
Copy link
Author

Thank you very much for your reply. I run FIREVAT again after setting the strand.bias.analysis = FALSE, but a new error occurred in drawing plot section

[1] "INFO [2023-04-27 01:04:36] Step 05. Generate FIREVAT report"
[1] "INFO [2023-04-27 01:04:36] * Started generating FIREVAT report"
[1] "INFO [2023-04-27 01:04:36] ** Started plotting optimization iterations"
Error in grid.Call(C_textBounds, as.graphicsAnnot(x$label), x$x, x$y, :
polygon edge not found

Thank you so much for your quick reply!!!

@khb7840
Copy link
Collaborator

khb7840 commented Aug 30, 2023

I'm not sure but it can be a font-related problem (stack-overflow link)

@khb7840 khb7840 added the bug Something isn't working label Aug 30, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants