Skip to content

pawelosipowski/fasta_file_processing_scripts

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 

Repository files navigation

#####These are some scripts to use freely, to process files in fasta or fastq format. They are used by me in genome assembly and comparative genomics workflows.

fasta_file_random_trimming.py

From range of fasta records in paired end files randomly deletes defined number of records and puts left records into output file. All the fasta file content is loaded into memory so much depend on a file size. Dealing with whole libraries will be usually not manageable for regular PCs.

USAGE:

fasta_random_trimming.py paired_ends_1.fastq paired_ends_2.fastq output_1.fastq output_2.fastq <number of seq to delete=integer>
fasta_sequence_oneliner.py

Sometimes, mostly from web tools, you get fasta files where sequences in fasta records are fragmented by lines. This script makes them written in one line.

USAGE:

fasta_sequence_oneliner.py input.fasta output.fasta

About

Script to randomly delete sequences from files in fasta or fastq format.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages