-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue during htseq_cnt job: [E::sam_parse1] hex field does not have an even number of digits #3
Comments
Hi Volker, Thank you very much for your interest in bakR! I will address your minor comments first. In regards to the error you are running into, it is not one I have personally every seen but I'll do my best to try and help you diagnose the problem.
In regards to your HTSeq error:
|
Thank you very much for the super fast response and for willing to help me troubleshoot, much appreciated! Concerning all the minor points: makes sense, thanks for the explanation and
I have run both commands on one of the bam files, please find the output here:
I have pasted the command in this text file, I hope the most relevant parameters are understandable:
I had to subsample 0.5% of the reads to pass the size limit, please find the bam file here: Once again: thanks for your support |
No problem, I'm sorry that you have to deal with errors in someone else's tool, I know how annoying that can be. Thank you for providing all of the requested files, nothing looks out of the ordinary so I will try running your downsampled bam file. Can you also send me the content of your |
Sure, here you go (just renamed to txt since GitHub would not allow otherwise): |
Thank you, I was able to reproduce, diagnose, and (I think) fix the error. The fix is now implemented on the main branch, though before running it you will have to add one parameter to your config file: The problem is that HTseq is misprocessing some of the non-standard tags in your bam files, namely the jI and jM tags. I guess this is a common problem, as the STAR manual states: "jM jI attributes require samtools 0.1.18 or later, and were reported to be incompatible with some downstream tools such as Cufflinks." It might be possible for me to edit the HTSeq python script to get around this issue, but for now I just wrote a python script to remove those tags from bam files, and added the tag removal step as an optional step that can be included by setting the new config parameter |
Hi Isaac, quick feedback: using the I am currently looking at the output from bakR and although I probably need to read you mansucript more carefully, it looks really promising. Along those lines: I very much appreciate your efforts to a) create a tool like that and b) for the extensive documentation. I know how much pain that can be, but except for this "tag" problem, everything else was perfect and very nicely documented. |
Hi Isaac,
first of all: thanks for providing your cool bakR package, I was just looking for such an approach when I saw your publication in RNA and promptly wanted to check it out.
Disclaimer: I am by far not an expert in bioinformatics or snakemake, so please forgive me if some questions/comments are "stupid".
I have a few minor comments and one major problem right now.:
Consequently, samtools sort stops and no final "results/htseq/{sample}_tl.bam" file is produced. Do you have any idea what the problem could be?
The log file for this job is attached below.
Thanks in advance and best wishes,
Volker
Nter_24h_IAA_4SU_2.log
The text was updated successfully, but these errors were encountered: