-
Notifications
You must be signed in to change notification settings - Fork 219
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
missing required fields from MAF: Hugo_Symbol #397
Comments
Hello, |
Hi, the sample file is attached. I left the header as well, for you to see the gatk commands I used to generate it. Thanks |
Hello, |
Hi, thanks. I will do that and let you know. Best |
Dear all,
Use Example codes below;
you can use this to merge multiple
Best of luck |
Okay found a workaround. |
@PoisonAlien Could you please elaborate how to fix this using data.table? I am having the same issue after annotating mutect2.vcf with the funcotator. I also tried suggestions from @sahuno which did not work in my hands. Thanks so much for making this tool available to the field. |
Hi @jungminchoilab , |
Hi @PoisonAlien, thanks for your prompt response. Please see below and let me know if you are having issues downloading a file. |
Thanks for sharing your file. It seems to work for me. What version of the package are you using? |
Thanks again for checking @PoisonAlien . The version that I installed is maftools_2.2.10. Which one am I supposed to use? Apparently, funcotator does not annotate tumor sample ID. I will re-annotate it properly. Is normal sample ID also required? |
Could you please install the tool from |
As suggested, I re-installed the tool from GitHub and the version is now maftools_2.3.10 . I also ran funcotator again and properly annotated a tumor_sample_barcode column (please see below). Unfortunately, I am re-producing the same error message and wonder what might have gone wrong.
I do appreciate your kind help @PoisonAlien . |
You dont have to run e.g; x = maftools::read.maf(maf = "~/Downloads/YUL0207P.combined.mutect2.strelka2.funcotated (1).maf") P.S; Maftools would be of little help in case of single sample, you would need two or more samples to take advantage of all the functions. |
@PoisonAlien hmm, same error... and yes, I have multiple cases for the analysis but wanted to troubleshoot with one sample first.
|
I can not think of any reasons why it should give an error. I works for me with the same file. > mymaf <-
data.table::fread(
file = "/Users/jungminchoi/mutect2/YUL0207P.combined.mutect2.strelka2.funcotated.maf",
sep = "\t",
stringsAsFactors = FALSE,
verbose = FALSE,
data.table = TRUE,
showProgress = TRUE,
header = TRUE,
fill = TRUE,
skip = "Hugo_Symbol",
quote = ""
)
> test.maf = maftools::read.maf(maf = mymaf) |
Thanks for your patience, @PoisonAlien ! Now getting another error...
|
No problem. I am sorry that you are getting these errors. |
Will do. Thanks so much again for solving this for me. |
i have problem running this code mymaf <-
data.table::fread(
file = "data_mutations.maf",
sep = "\t",
stringsAsFactors = FALSE,
verbose = FALSE,
data.table = TRUE,
showProgress = TRUE,
header = TRUE,
fill = TRUE,
skip = "Hugo_Symbol",
quote = ""
)
test.maf = maftools::read.maf(maf = mymaf)
#Get the sample information
sample_info = read.table(file="clinicaldata_tcga_bc_f.tsv", sep="\t", header=T)
laml_tcga = read.maf(maf = test.maf, clinicalData = sample_info )
####333################################
test.maf = maftools::read.maf(maf = mymaf)
-Validating
--Removed 5396 duplicated variants
-Silent variants: 51626
-Summarizing
--Possible FLAGS among top ten genes:
TTN
MUC16
SYNE1
-Processing clinical data
--Missing clinical data
-Finished in 3.300s elapsed (3.090s cpu)
> #Get the sample information
> sample_info = read.table(file="clinicaldata_tcga_bc_f.tsv", sep="\t", header=T)
laml_tcga = read.maf(maf = test.maf, clinicalData = sample_info )
-Reading Error in file.info(file) : invalid filename argument |
Hi, Note that |
Thank you. The code worked, but another issue has popped up. After running .....all things mymaf <-
data.table::fread(
file = "data_mutations.maf", sep = "\t",stringsAsFactors = FALSE,verbose = FALSE,data.table = TRUE,showProgress = TRUE,
header = TRUE,
fill = TRUE,
skip = "Hugo_Symbol",
quote = "")
##Get the sample information
sample_info = read.table(file="clinicaldata_tcga_bc_f.tsv", sep="\t", header=T)
laml_tcga = read.maf(maf = mymaf, clinicalData = sample_info,verbose = TRUE )
#################
issue ; Error in summarizeMaf(maf = nonSyn, anno = clinicalData, chatty = verbose) :
Tumor_Sample_Barcode column not found in provided clinical data. Rename column containing sample names to Tumor_Sample_Barcode if necessary.
###############
some people said It has an issue when (mymaf file) here can't recognize the nonsynonymous mutation . so added
#################
mymaf <-
data.table::fread(
file = "data_mutations.maf", sep = "\t",stringsAsFactors = FALSE,verbose = FALSE,data.table = TRUE,showProgress = TRUE,
header = TRUE,
fill = TRUE,
skip = "Hugo_Symbol",
quote = "",vc_nonSyn=c("Missense_Mutation", "Nonsense_Mutation"))
##### but
Error in data.table::fread(file = "data_mutations.maf", sep = "\t", stringsAsFactors = FALSE, :
unused argument (vc_nonSyn = c("Missense_Mutation", "Nonsense_Mutation"))
#########
The argument here remains unrecognized |
Hi, You could directly use my_maf <- "data_mutations.maf"
# You need to make sure that this tsv has a column with the column name Tumor_Sample_Barcode containing sample names
clinical_data <- "clinicaldata_tcga_bc_f.tsv"
maftools::fread(maf = my_maf, clinicalData = clinical_data, vc_nonSyn = c("Missense_Mutation", "Nonsense_Mutation")) You should only use |
Hi,
I have a gatk generated MAF file and I am trying to read it with read.maf. It fails at the validation stage with the following:
-Reading
-Validating
Error in validateMaf(maf = maf, isTCGA = isTCGA, rdup = removeDuplicatedVariants, :
missing required fields from MAF: Hugo_Symbol
Do you have any suggestions ?
Thanks
The text was updated successfully, but these errors were encountered: