# Using BLAST to Diagnose Genetic Disorders
This notebook demonstrates how to use BLAST (Basic Local Alignment Search Tool) for real-life diagnosis of genetic disorders using publicly available genetic sequence data.

**Learning Objectives:**
- Understand how to input a DNA sequence and perform a BLAST search
- Analyze the output and identify potential genetic disorders
- Explore a real dataset from NCBI or GenBank
- Interpret BLAST results in a clinical context

## Setup
We'll use Biopython to interact with NCBI's BLAST API (online BLAST search).

In [7]:
!pip install biopython -q

In [8]:
from Bio.Blast import NCBIWWW, NCBIXML
from Bio import SeqIO
from Bio.Seq import Seq
from Bio.SeqRecord import SeqRecord

# Example: SMN1 gene mutation from a patient suspected of Spinal Muscular Atrophy
sequence_data = SeqRecord(Seq("ATGGCGGAGACCCGAAATGAGCAGCGGCGGCAGCGGCGG"), id="PatientSMN1")

## Run BLAST Search
We'll submit the sequence to NCBI BLAST and get the results (requires internet access).

In [10]:
# Warning: This query takes time and requires internet access.
# Uncomment to run in a live environment with internet.

result_handle = NCBIWWW.qblast("blastn", "nt", sequence_data.format("fasta"))
with open("blast_result.xml", "w") as out_handle:
    out_handle.write(result_handle.read())
result_handle.close()

## Parse BLAST Output
Let's load and parse the saved XML file from the BLAST results.

In [19]:
# Load BLAST XML output (simulated)
result_handle = open("blast_result.xml")
blast_record = NCBIXML.read(result_handle)
for alignment in blast_record.alignments:
    for hsp in alignment.hsps:
        if hsp.expect < 0.01:
            print(f"****Alignment****")
            print(f"sequence: {alignment.title}")
            print(f"length: {alignment.length}")
            print(f"e value: {hsp.expect}")
            print(hsp.query)
            print(hsp.match)
            print(hsp.sbjct)

## Clinical Interpretation
Matches to SMN1 or related sequences would support a diagnosis of Spinal Muscular Atrophy.

**Example Resources:**
- https://www.ncbi.nlm.nih.gov/gene/6606
- https://blast.ncbi.nlm.nih.gov/Blast.cgi

**Reference:** Lefebvre et al., Cell, 1995