-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error in read.table #2
Comments
Ok, I think I found what is the error. In readme for the gtf file you ask the following format: #gtf file: but actually in example folder for gtf file you have this format: #bin name chrom strand txStart txEnd cdsStart cdsEnd exonCount exonStarts exonEnds score name2 cdsStartStat cdsEndStat exonFrames and the code is written for the the second format, so the classical gtf file does not suit it. Am I missing something, may be I should gtf download from somewhere else? Thank you in advance! Dasha |
I experience the same error. @dashainparis did you had any success trying to run the program with a differently formatted gtf file as you suggested above? UPDATE: I've tried to run ExonDel with a differently formatted .gtf file and I still get the same error. |
Well, actually I did not succeeded to get to the end, but I had different type of error and I guess it is in my bam files. I am not sure where can be downloaded exactly this format, but thought the uscs you can get this: #name chrom strand txStart txEnd cdsStart cdsEnd exonCount exonStarts exonEnds proteinID alignID I added the first column #bin just manually something like 0 everywhere (I've checked it is not used in the code). |
I've run the program with a gtf file formatted the same as in the example and I get the same error. https://www.dropbox.com/s/59n4q8evtqwaicz/refGene.hg19.24feb2015ucsc.txt?dl=0 |
Thanks! Then I actually do not know, because it worked for me. But perl code is well written and very readable (well as it could be for perl), so good luck! |
@dashainparis Did you manage to get it working? |
Hello Dasha and Koen, You are right. The format of gft file in README is not correct. It was For the gtf files, You can download it from UCSC ( To Koen: Thank you! Best, 2015-03-25 9:32 GMT-05:00 Dasha notifications@github.com:
|
Dear Shilin, I've checked the files you mentioned. "genesPassQCwithGC.bed" is empty and "genesPassQCwithGC.bed.depth" contains the following: DNA03-03099-BRCA1 These are the sample names in my case. Koen |
Dear Shilin, Thank you for the answer. I actually managed to run the tool, but I had several obstacles on my way. First is due the chromosomes names. Actually if you work with any human data you will have chr in the chromosome names, and at the same time you always have just number in other files, so easier you manually change all you data (which is a lot, and also it ruins other tools workflow), easier you change the code. And second thing which took me a while to catch. Is that in bamList it is very strict according to the number of tubs. So I had two and it did not work at all. But after all tool seems to work really good with my data, thank you for it! Dasha |
Dear Shilin, Do you have an estimation how much time it will take to fix ExonDel? Kind regards, Koen Hendriks |
Hello Dasha, For the first thing, do you mean I can change the code so that it can be Please let me know if you have other questions. Best, 2015-03-30 3:43 GMT-05:00 Dasha notifications@github.com:
|
Hello Koen, What about the genesPassQC.bed and covered_percentage file? Can you send me Best, 2015-04-02 3:36 GMT-05:00 Koen Hendriks notifications@github.com:
|
Hello Shilin, I've run ExonDel again and this is in my covered_percentage file: #bin name om strand txStart txEnd cdsStart cdsEnd exonCount exonStarts exonEnds score name2 cdsStartStat cdsEndStat exonFrames coveredBP coveredExon genesPassQC.bed contains the following± 14 52456227 52456418 C14orf166 NM_016039 If you need other information or more data, please don't hesitate to contact me. Bests, Koen |
Hello Koen, I checked your results and found that: the "genesPassQC.bed" file was right I've updated ExonDel to version 1.01 which can handle different Please let me know if you have any other questions. Thank you! Best, 2015-04-07 4:06 GMT-05:00 Koen Hendriks notifications@github.com:
|
Hi Shilin, Today I've tried running ExonDel v1.04. The ExonDel.rLog file contains the following: Error in read.table(file = file, header = header, sep = sep, quote = quote, : Hopefully you can shed some light on this. Bests, Koen |
Hello Koen, I think we can fix the "Error in read.table ~~" problem with the following
Thank you! Best, 2015-05-28 5:53 GMT-05:00 Koen Hendriks notifications@github.com:
|
Hi Shilin, Here's some info to help you solve it.
DNA03-03099-BRCA1_test genesPassWCwithGC.bed is empty. The problem might be in my gtf file. Bests, Koen |
Hi Koen, Yes. I think it is related to the format of gtf file. Would you please The .gtf and .bed files could be downloaded at UCSC table browser #gtf file: I've also updated ExonDel to 1.05 to add a warning message if gtf file with Best, 2015-05-29 7:12 GMT-05:00 Koen Hendriks notifications@github.com:
|
Hi Shilin, Thanks for the quick reply. I've run v1.05 and checked the gtf file and it seems mine is correct. Unfortunately, the read.table error still persists. Koen |
Hi Koen, So the "genesPassQCwithGC.bed" file is still empty? What about the Best, 2015-06-03 6:28 GMT-05:00 Koen Hendriks notifications@github.com:
|
Hi Shilin, Yes, the "genesPassQCwithGC.bed" file is still empty, as is the "genesPassQC.bed" file. [Tue Jun 9 10:15:03 2015] All genes will be used Best regards, Koen |
Thanks Koen. Best, 2015-06-09 3:44 GMT-05:00 Koen Hendriks notifications@github.com:
|
Hi Shilin, No, the covered percentage file contains the following± #bin name chrom strand txStart txEnd cdsStart cdsEnd exonCount exonStarts exonEnds score name2 cdsStartStat cdsEndStat exonFrames c My bed file looks like this: chr17 41197659 41197893 BRCA1_Ex23 And my GTF file starts with this: #bin name chrom strand txStart txEnd cdsStart cdsEnd exonCount exonStarts exonEnds score name2 cdsStartStat cdsEndStat exonFrames Best, Koen . |
Hello Koen, I think here is the issue: Here overall_exon_count_threshold=1 means only when all exons for one gene I will update it in a newer version.Thank you! Best, 2015-06-10 6:07 GMT-05:00 Koen Hendriks notifications@github.com:
|
Hello that it is me again! Thank you for the previous answer!
I have another question, I ran ExomeDel on my data and keep getting this error:
Error in read.table(file = file, header = header, sep = sep, quote = quote, :
no lines available in input
Calls: read.bigFile -> read.delim -> read.table
Execution halted
I definitely have a file to read it, I really do not get what am I doing wrong.
Thank you in advance!
Dasha
The text was updated successfully, but these errors were encountered: