Penguin

Penguin: A Tool for Predicting Pseudouridine Sites in Direct RNA Nanopore Sequencing Data

Getting Started and pre-requisites

The following softwares and modules should be installed before using Penguin

python 3.6.10

minimpa2 (https://github.com/lh3/minimap2)

Nanopolish (https://github.com/jts/nanopolish)

samtools (http://www.htslib.org/)

numpy 1.18.1

pandas 1.0.1

sklearn 0.22.2.post1

tensorflow 2.0.0

keras 2.3.1 (using Tensorflow backend)

Running Penguin:

In order to run Penguin, the user has do the following:

1- Ensure that the bedfile in the same path where penguin main.py file exists: 2- Run the following python command:

python main.py -r ref.fa -f reads.fastq

Where the penguin tool needs the following two inputs files when running it:

A reference Genome file (ref.fa)
The fastq reads file (reads.fastq)

Note:

The user should enter the bed file name with the absolute path and extension
The user should include the fast5 files folder (fast5_files) from which reads.fastq file was generated in the same path of main.py

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
LICENSE		LICENSE
NN.py		NN.py
NN_validate.py		NN_validate.py
README.md		README.md
RF.py		RF.py
SVM.py		SVM.py
SVM_validate.py		SVM_validate.py
gen_coors.py		gen_coors.py
main.py		main.py
plot_learning_curves.py		plot_learning_curves.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Penguin

Getting Started and pre-requisites

Running Penguin:

Note:

About

Releases

Packages

Languages

License

Janga-Lab/Penguin

Folders and files

Latest commit

History

Repository files navigation

Penguin

Getting Started and pre-requisites

Running Penguin:

Note:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages