Biohackathon-2022
Prerequisites to run the code:
- Have python and gcc/mingw installed and have them in PATH
- Have biopython and blast installed and have them in PATH(Have the bin folder itself if possible)*at your own risk
steps to run the pipeline
- Choose the complete genome of your choice in FASTA format and keep it in the project directory as "input.fasta"
- run the source executable file
- Check the blast_output3 for all data, blast_output4 for titles of the related toxin and blast_output5 for evalues in order of identical nature
How to try ?
- there are complete genome sequence of a few bacteria in data-genome folder which can be copied outside as "input.fasta"
- follow the steps of pipeline to get the desired result
Major libraries used
- Blast Manual for Blast - https://www.ncbi.nlm.nih.gov/books/NBK279690/
- pandas for preparing database pandas documentation - https://pandas.pydata.org/docs/