Skip to content

This repository hosts a Python script designed to facilitate the conversion of protein sequences stored in NumPy files (.npy) into the FASTA format (.fasta). The script serves as a convenient tool, particularly useful for bioinformatics applications, and specifically intended for use with peptide protein data.

License

Notifications You must be signed in to change notification settings

kylwein/Protein-Sequence-Converter-NPY-to-FASTA

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Protein Sequence Converter: NPY to FASTA

This Python script converts protein sequences stored in a NumPy file (.npy) into the FASTA format (.fasta). It is a convenient tool, particularly useful for bioinformatics applications, and specifically intended for use with the Alphafold Multimer and CAMP

Usage

  1. Prepare Your NumPy File: Place your NumPy file containing protein sequences in the CONVERTER folder.

  2. Update Input File Path: Open the npy-to-fasta.py script and locate the convert_to_fasta() function. Inside this function, update the input_npy_file variable to point to your NumPy file. If your file is named differently than protein_sequences.npy, make sure to update the file name accordingly.

  3. Run the Script: Simply execute the npy-to-fasta.py script. It will automatically convert the protein sequences in your NumPy file to FASTA format and save the output in the same directory.

python npy-to-fasta.py

Multiple Sequence Separator

Additionally, this repository now includes a new folder called SEPARATOR. It contains a script that separates FASTA files with multiple sequences into individual files. The usage of this script is identical to that of the CONVERTER script.

Usage

Prepare Your FASTA Files

Place your FASTA files with multiple sequences in the SEPARATOR folder.

Run the Script

Execute the fasta-separator.py script in the SEPARATOR folder. It will automatically separate each sequence into individual files and save them in the same directory.

About

This repository hosts a Python script designed to facilitate the conversion of protein sequences stored in NumPy files (.npy) into the FASTA format (.fasta). The script serves as a convenient tool, particularly useful for bioinformatics applications, and specifically intended for use with peptide protein data.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages