New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
reduced_genome.sh error on #get only sequences from FASTA file #31
Comments
Hi, 1)You are missing an underscore here - grep -xF -f - -B 1 ${genome}${enzyme}flanking_sequences${fl}unique_2.fa. It should be : grep -xF -f - -B 1 ${genome}${enzyme}flanking_sequences${fl}_unique_2.fa. Please try that and let me now.
|
I got the same problem when running reduced_genome.sh. Any suggestion? |
Do you have oligoMatch and bedtools installed? |
Hi, |
Okey... Now it generated a final file which is about 143 MB. By the way, I also edit the code at line 40 and 41 due to that the "sed" code in my MAC machine requires a backup file name for creating backup file. I changed it into: sed -i .bak 's/>//g' ${genome}_${enzyme}flanking_sites${fl}unique.bed |
Hi,
there seem to be 2 issues regarding the following step:
1st:
while the removing of the ">" lines, sorting and getting the unique files works and produces something like the following: (grep -v '^>' ${genome}_${enzyme}flanking_sequences${fl}_unique_2.fa | sort | uniq -i -u)
the following command ( grep -xF -f - -B 1 ${genome}_${enzyme}flanking_sequences${fl}_unique_2.fa ) produces a zero bytes file.
Could you please see if there is an error?
2nd:
why you remove the duplicates? this is a PCR based technique and it is expected for the file to be full of PCR duplicates.
Thank you for your answer in advance
The text was updated successfully, but these errors were encountered: