GitHub - pawelosipowski/fasta_file_processing_scripts: Script to randomly delete sequences from files in fasta or fastq format.

#####These are some scripts to use freely, to process files in fasta or fastq format. They are used by me in genome assembly and comparative genomics workflows.

fasta_file_random_trimming.py

From range of fasta records in paired end files randomly deletes defined number of records and puts left records into output file. All the fasta file content is loaded into memory so much depend on a file size. Dealing with whole libraries will be usually not manageable for regular PCs.

USAGE:

fasta_random_trimming.py paired_ends_1.fastq paired_ends_2.fastq output_1.fastq output_2.fastq <number of seq to delete=integer>

fasta_sequence_oneliner.py

Sometimes, mostly from web tools, you get fasta files where sequences in fasta records are fragmented by lines. This script makes them written in one line.

USAGE:

fasta_sequence_oneliner.py input.fasta output.fasta

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
README.md		README.md
README.md~		README.md~
fasta_random_trimming.py		fasta_random_trimming.py
fasta_sequence_oneliner.py		fasta_sequence_oneliner.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fasta_file_random_trimming.py

fasta_sequence_oneliner.py

About

Releases

Packages

Languages

pawelosipowski/fasta_file_processing_scripts

Folders and files

Latest commit

History

Repository files navigation

fasta_file_random_trimming.py

fasta_sequence_oneliner.py

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages