Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need information related to example given in NextPolish Software under the folder test_data #106

Closed
DrNavi opened this issue Feb 15, 2023 · 4 comments

Comments

@DrNavi
Copy link

DrNavi commented Feb 15, 2023

Hi all,

I am trying to use NextPolish tool to polish my draft assembly. First I wanted to understand about the example data provided in the tool under the folder test_data. The folder contain these files:
~/NextPolish/test_data/hifi.fasta.gz
~/NextPolish/test_data/hifi.fofn
~/NextPolish/NextPolish/test_data/hifi.run.cfg
~/NextPolish/NextPolish/test_data/lgs.fofn
~/NextPolish/NextPolish/test_data/lreads.fasta.gz
~/NextPolish/NextPolish/test_data/raw.genome.fasta
~/NextPolish/NextPolish/test_data/run.cfg
~/NextPolish/NextPolish/test_data/sgs.fofn
~/NextPolish/NextPolish/test_data/sreads.R1.fastq.gz
~/NextPolish/NextPolish/test_data/sreads.R2.fastq.gz

Here, In this file list, I am confused what is raw.genome.fasta file, Is this is the reference genome of the target specie from any database like NCBI, Ensemble etc ? this raw.genome.fasta file is used in run.cfg. Here is the run.cfg file content....

[General]
job_type = local
job_prefix = nextPolish
task = default
rewrite = yes
deltmp = yes
rerun = 3
parallel_jobs = 2
multithread_jobs = 3
genome = ./raw.genome.fasta
genome_size = auto
workdir = ./01_rundir
polish_options = -p {multithread_jobs}

[sgs_option]
sgs_fofn = ./sgs.fofn
sgs_options = -max_depth 100

[lgs_option]
lgs_fofn = ./lgs.fofn
lgs_options = -min_read_len 5k -max_depth 100
lgs_minimap2_options = -x map-ont

Please, help me to understand this......

Another question is if I have only long read then do we only need to avoid/eliminate the sgs_option commands???

Looking forward to hearing from you

Regards,
Dr. Naveed

@moold
Copy link
Member

moold commented Feb 15, 2023

raw.genome.fasta is the assembly file to be polished.
for Another question is if I have only long read then do we only need to avoid/eliminate the sgs_option commands??? Yes.

@moold
Copy link
Member

moold commented Feb 15, 2023

@DrNavi
Copy link
Author

DrNavi commented Feb 15, 2023 via email

@moold
Copy link
Member

moold commented Feb 15, 2023

Yes, but you need to decompress KI.fasta.tar.gz first.

@moold moold closed this as completed Feb 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants