-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Vicaller.pl 1164 line is strange. #2
Comments
Thanks for running the VIcaller_v1.1.
With those changes, you should be able to run the validation function now. Please let me know if you found any other bugs. Thanks. |
I appreciate your help. I download revised scripts and running validate function and calculate function. My data is targeted sequencing data. validate cmd: nohup perl /data/program/VIcaller_v1.1/VIcaller.pl validate -i 11FCD -c /data/program/VIcaller_v1.1/VIcaller.config -t 4 -S 11FCD_14_75126278_75126928_abelson_murine_leukemia_virus_61487 -G 61487 -V murine_leukemia_virus > validate.out calculate cmd: nohup perl /data/program/VIcaller_v1.1/VIcaller.pl calculate -i 11FCD -c /data/program/VIcaller_v1.1/VIcaller.config -t 4 -F .fastq.gz -C 14 -P 75126936 -B 2 -N 147 > cal.out |
A small correction was made for the VIcaller.pl script, can you try the latest one. May I ask if you enriched specific viruses for target sequencing? What is the read length? |
I'm sorry that the answer was delayed. Anyway, why does it change to 1 even if I put another number in the c parameter? cmd: nohup perl /data/program/VIcaller_v1.1/VIcaller.pl calculate -i PYH024 -t 4 -F _s_h.bam -C 4 -I -P 112387642 -B 2 -N 96 > cal.out |
If the input file is FASTQ format, you can use "" for -F parameter. Or you can change to "-F _h.bam" if you have the PYH024_h.bam file indexed. For the chromosome ID issue, because the "chr" for the chromosome IDs are deleted in the output file in the current VIcaller version, thus you can add the "chr" in the script to extract corresponding reads from the BAM file for now. I will update the VIcaller.pl in the next version to support any chromosome ID format soon. |
My previous question is even if i take parameter -C 4, VIcaller receive chr1. Sorry but i have another question. Please let me know how to modify viral genome library in detail. |
Sorry for my late reply. Really appreciate for your feedback, i have correct the bug in the VIcaller main script. For your second question: virus_list file: only have two column, 1) GI #, and 2) virus_name without space; As an example of HBV, you can modify the corresponding columns, and you should be able to find all the info from the NCBI NT database using the GI: 110225259$AB246317.1$3185$Hepatitis B virus; Viruses; Retro-transcribing viruses; Hepadnaviridae; Orthohepadnavirus.$Hepatitis B virus DNA, complete genome, isolate: TA112.$Hepatitis B virus$TA112$0$Hepatitis B virus$hepatitis_b_virus$0 |
thank you, but, I have another question, so, I did not use parameter -r. but, I'm worried that this might be big difference. |
For the repeatmasker, you need to apply for Rebase database and then use it with repeatmasker follow their online manuscript. You can show some example of TRF output, I can check it out for you. I would suggest you to disable Repeatmasker function in the script, and then only use TRF and DUST to remove repeat regions. I would be happy to help you with it if you want. |
Hi, I have three issue.
I find a strange line at Vicaller.pl
![image](https://user-images.githubusercontent.com/49423291/55931847-0e6bb180-5c62-11e9-842a-22defff44deb.png)
$virus_genome >$ {directory}Database/GI/${GI}.fa");
I don't know perl well but line 1164 seems to something wrong.
system ("perl ${directory}Scripts/Extract_fasta.pl $GI
and file name is incorrect: "Scripts/Extract_read_informationpl "--> "Scripts/Extract_read_information.pl"
Are the two changes correct?
I have trouble running validate function.
I get the following warning:
Warning: [blastn] Query is Empty!
Warning: [blastn] Query is Empty!
Warning: [blastn] Query is Empty!
but my config and blastn probably have no problem.
so, I take a look at Vicaller.pl
blastn run using query "${input_sampleID2}_aligned_both.fas"
That means "${input_sampleID2}_aligned_both.fas" is not created.
${input_sampleID2}_aligned_both.fas is made this function.
validate function
sub v_obtain_seq {
system ("perl ${directory}Scripts/Extract_specific_loci_final_reads.pl ${input_sampleID}_f2 ${input_sampleID2}.virus_f2 >${input_sampleID2}.information");
my $cmd=q(awk '{print$8}');
system ("$cmd ${input_sampleID2}.information |sort |uniq >${input_sampleID2}.id");
system ("perl ${directory}Scripts/Extract_fastq.pl -f ${input_sampleID}_1.1fuq -b ${input_sampleID2}.id -o ${input_sampleID2}_aligned_1.fuq");
system ("perl ${directory}Scripts/Extract_fastq.pl -f ${input_sampleID}_2.1fuq -b ${input_sampleID2}.id -o ${input_sampleID2}_aligned_2.fuq");
system ("perl ${directory}Scripts/Extract_fuq_split.pl -f ${input_sampleID}_1sf.fuq -b ${input_sampleID2}.id -o ${input_sampleID2}_aligned_sf.fuq");
system ("perl ${directory}Scripts/Convert_fastq_to_fasta.pl ${input_sampleID2}");
system ("perl ${directory}Scripts/Convert_fastq_to_fasta_for_split_read.pl ${input_sampleID2}");
system ("perl ${directory}Scripts/Convert_fastq_to_fasta.pl ${input_sampleID2}");
system ("perl ${directory}Scripts/Convert_fastq_to_fasta_for_split_read.pl $input_sampleID2");
system ("cat ${input_sampleID2}_aligned.fas ${input_sampleID2}_aligned_sf.fas >${input_sampleID2}_aligned_both.fas");
}
In conclusion, i wonder ${input_sampleID2}.virus_f2 is a previously generated file.
Or is there something else wrong?
Should not the full name in the output file 'Candidate_virus' column is possible?
The text was updated successfully, but these errors were encountered: