-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error when checking input: wrong input shape #8
Comments
Hi Valentina, |
Dear Hardip, Thanks a lot for the quick reply! I attached the log file: it is the complete stderr/stdout of the executed commend, I only truncated some file paths and changed the sample ID. Unfortunately, I cannot share the FASTQ file as this is an unpublished dataset and we are not allowed to share that data yet. Attachment: pacific.log Best, |
HI Valentina However, it feels like for one of the sequences, instead of 142 items, it is receiving only 1 mer. For further troubleshooting, could you please isolate the reads in question.
Could you please check if the new Cheers |
Hi Hardip, I created a FASTQ file as suggested and looked at some sequences and also computed the min/max sequence length: awk 'NR%4==2' debug.fq | sort -R | head -n 1000 | less
awk 'NR%4==2' debug.fq | awk '{ print length($0) }' | sort -n | head -n 1 # 40
awk 'NR%4==2' debug.fq | awk '{ print length($0) }' | sort -n | tail -n 1 # 150 I did not see anything suspicious. So, I ran the tool on the read subset using different values for
For awk 'NR%4==2' debug.fq | head -n 30 | tail -n 20
Maybe the issue is with how the chunks are being created and processed? Best, Edit: typos |
Hi Valentina, |
Hi Valentina, thanks for using our software. Indeed there was a bug that originated when all reads in a chunk were discarded. I have fixed the bug so you may want to download PACIFIC.py and give it a go. I have also noticed that your reads have very variable lengths. So far we are not predicting the origin of reads smaller than 150bp, just for you to know. |
Dear @pabloacera, Thank you for the quick fix! I will try the updated version today. Regarding the length of the reads: yes, they have different length because they have been preprocessed (quality and adapter trimming). I did not know about the minimal length constraint - thank you for pointing that out! |
@pabloacera : Thank you for the fix. Is it possible to extend reads with Ns up to 150bp and run the prediction? This can be done at the user end or within PACIFIC perhaps. |
Hi, that is possible to do but we will have to see how that affect the predictions of the model. |
I can confirm that the bug has been fixed - the test sample runs through. Thanks again for fixing the issue so quickly! |
Hello,
First of all, thanks for the great tool!
Unfortunately, I have encountered an issue when trying to run
PACIFIC
on one of our samples:Error message:
Command line:
The input FASTQ file contains paired reads and single-end reads (all three FASTQ files were just concatenated).
Version:
075fb55
conda
env. YAML:Thanks in advance!
Best,
Valentina
The text was updated successfully, but these errors were encountered: