polyploid-potato-assembly

Code for assembly approach presented in "Haplotype-resolved assembly of a tetraploid potato genome using long reads and low-depth offspring data"

1. Dosage estimation of the nodes in the hifiasm assembly graph:

run snakemake in the coverage-analysis directory. Requires minimap2 and samtools.

2. K-mer analysis:

Installation:

Note: Requires an installation of the jellyfish package. If not installed yet, you can install it via conda install jellyfish

git clone git@github.com:rebeccaserramari/polyploid-potato-assembly.git

cd polyploid-potato-assembly/kmer-counting

mkdir build; cd build; cmake ..; make

Running k-mer counting procedure

To run the full procedure, including finding unique k-mers in <targetfile>, counting the found unique k-mers in a set of sequences samples, and merging the resulting files:

run snakemake within the kmer-counting directory.

Make sure to update the config files accordingly!
2. To run the first step individually, i.e. find k-mers of length <len> that are uniquely present in <targetfile> and not in <comparisonfile>:

`./polyassembly_findkmers find_kmers -r <targetfile> -s <comparisonfile> -k <kmerfile> -l <len>`

The resulting k-mers are stored in <kmerfile>.

To run the second step individually, i.e. count the unique k-mers in <samplefile>:

/polyassembly_findkmers count_kmers -s <samplefile> -k <kmerfile> -c <output> -l <len>

3. Phased clustering

To run the full clustering procedure:

run snakemake in the cluster-phasing directory.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
clustering-phasing		clustering-phasing
coverage-analysis		coverage-analysis
kmer-counting		kmer-counting
LICENSE		LICENSE
README.md		README.md
filter_and_replace.py		filter_and_replace.py
merge_countfiles_allnodes.py		merge_countfiles_allnodes.py
sum_node_cov_5kb.py~		sum_node_cov_5kb.py~

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

polyploid-potato-assembly

1. Dosage estimation of the nodes in the hifiasm assembly graph:

2. K-mer analysis:

Installation:

Running k-mer counting procedure

3. Phased clustering

About

Releases 1

Packages

Languages

License

rebeccaserramari/polyploid-potato-assembly

Folders and files

Latest commit

History

Repository files navigation

polyploid-potato-assembly

1. Dosage estimation of the nodes in the hifiasm assembly graph:

2. K-mer analysis:

Installation:

Running k-mer counting procedure

3. Phased clustering

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages