Code for CS229 project:
generate_gb1_fasta.ipynb: generate fasta sequences for GB1 double mutants
embed_fasta_sequences.ipynb: generate ProtTrans embeddings for GB1 double mutants
cs229_epistasis_pipeline_gb1.ipynb: primary pipeline for running model, including TE and EP components
visualize_embeddings.ipynb: eigendecomposition and clustering of KG embeddings for comparison with biophysical features