# **Submodule 1.2 - Protein Structure Analysis**

## **Learning Objectives:**
1. Understand dihedral and torsion angles
2. Understand bond angles
3. Understand bond energy and distances
4. Understand homology analysis

## **Prerequisites:**
**Conceptual Knowledge:**
- **Protein Basics:** Familiarity with amino acids, peptide bonds, and the four levels of protein structure (primary, secondary, tertiary, quaternary).
- **Dihedral Angles** Understanding of dihedral angles (phi, psi, chi) and their role in protein conformation and stability.

**Software Proficiency:**
- **PyMOL Basics:** Basic proficiency in using PyMOL for protein visualization, including loading PDB files, changing display modes, and navigating the interface.
- **Python Fundamentals:** Basic understanding of Python programming, including syntax, variables, loops, and functions, for script-based analysis.

## **Introduction**
This submodule explores the fundamental aspects of protein geometry and evolutionary relationships that govern protein structure and function. Using PyMOL as our visualization platform, we'll investigate how the backbone dihedral angles (φ and ψ) and side chain torsion angles define protein conformation, and how these parameters are visualized and analyzed using Ramachandran plots. Through hands-on exercises, you'll learn to distinguish between conformational and configurational changes, understand the allowed regions of protein geometry, and recognize how these geometric constraints influence protein folding and structure. Building on these geometric principles, we'll examine how protein sequences and structures are related through evolution, using homology analysis to understand conservation patterns and structural similarities across protein families. You'll learn to analyze sequence alignments, identify conserved structural motifs, and appreciate how this information can be applied to predict protein structures and guide drug design.

## **Protein Conformation and Dihedral Angles:**
Protein conformation refers to the spatial arrangement of atoms in a protein, primarily determined by the rotation around single bonds in the polypeptide backbone. This flexibility allows proteins to adopt different shapes that are crucial for their biological functions. The dihedral angles—denoted as Φ (phi) and Ψ (psi) —are the key rotational parameters in proteins. These angles represent the rotations between the nitrogen and alpha carbon (Φ) and between the alpha carbon and carbonyl carbon (Ψ). The IUPAC definition of dihedral angles helps standardize how these rotations are described in the context of protein structures.

<center><img src="images/dihedral_angles.png" width=800 /></center><br><br>

The specific values of Φ and Ψ angles influence the secondary structure that a protein can adopt (e.g., alpha-helix, beta-sheet). Dihedral angles also affect the energy states of proteins, where unfavorable rotations can lead to steric clashes and high-energy conformations, while favorable angles result in stable structures.

### **Ramachandran Plot:**

The Ramachandran plot, developed by Dr. G.N. Ramachandran, is a graphical representation of the allowed and disallowed regions of Φ and Ψ angles in proteins. It helps visualize which combinations of dihedral angles are energetically favorable for a given protein. The plot maps the distribution of these angles, showing regions where secondary structures like alpha helices and beta sheets are found.

<center><img src="images/Ramachandran.png" width=400 /></center><br><br>

#### **How does it help ?**
By visualizing the allowed regions, scientists can predict or confirm the secondary structure of a protein based on its dihedral angles. Alpha helices typically fall within a specific region, while beta sheets occupy a different area. Disallowed regions correspond to conformations with steric clashes or unfavorable energy states, making the Ramachandran plot an essential tool in validating protein models.

## **Side Chain Torsion Angles (Chi Angles)**
To better understand side chain orientations, we need to explore torsion angles, also known as chi (χ) angles. These angles represent the rotation around specific bonds within the side chain of an amino acid, which allows different conformations:
* Chi 1 (χ1): This is the first and most common torsion angle, measuring rotation around the C-alpha (Cα) to C-beta (Cβ) bond.
* Chi 2 (χ2), Chi 3 (χ3), Chi 4 (χ4): These angles are measured for amino acids with more complex side chains that contain additional rotatable bonds.

<center><img src="images/torsion_angle.png" width=800 /><br><br>

In [None]:
#Side chain torsion angles
from IPython.display import YouTubeVideo

# Youtube
YouTubeVideo(id='zoIBJxGOt4g', height=200, width=400)

## **Conformation and Configuration**
Different arrangement of atoms in space in a molecule that results from rotation of a chemical group around a single bond (changing the position of the chemical groups without breaking the bond) is conformation.<br><br>
Particular arrangement of atoms in a molecule is configuration. Example, cis and trans arrangement of atoms.

<center><img src="images/configuration.png" width=600 /><br><br>

<center><img src="images/proline.png" width=600 /><br><br>

## **Homology Analysis**
Homology analysis is an important tool in structural biology that helps us understand how proteins are related and how they evolved. By comparing the 3D shapes of similar proteins, this analysis can pinpoint key functional regions, predict where proteins might bind to other molecules, and reveal shared molecular functions across different species. By recognizing these similarities and differences, scientists can make informed guesses about the role of newly discovered proteins, study how proteins have adapted over time, or gain insight into disease mechanisms.

In the activity below, we will:

1. Load and view two related protein structures using PyMOL’s command line tools, coloring them differently for better clarity.
2. Align the protein structures to compare them and calculate their similarity using the Root Mean Square Deviation (RMSD) value, which measures how alike they are in structure.
3. Highlight interaction areas by showing residues that are close together in the aligned structures, helping us identify conserved regions or potential contact points between the proteins.
4. Investigate active sites by identifying important residues and representing them using spheres or sticks to clearly mark their positions.

This hands-on tutorial will show you how to use these tools to explore how structural similarities between proteins can provide insights into shared functions and interaction sites. Homology analysis in PyMOL is not only a valuable approach for studying evolution but also plays a crucial role in fields like drug discovery and protein engineering.

Let’s get started with the activity where you’ll learn how to run each command and analyze the results to understand the similarities and functional importance of related protein structures.

### **Homology Analysis in Drug Discovery**  

Homology analysis plays a key role in drug discovery by using evolutionary relationships between proteins to predict their structure, function, and potential drug binding sites. By comparing protein sequences and structures across different species and protein families, scientists can identify conserved regions that are often crucial for protein function and stability.  

**How homology analysis helps in drug design:**  
- Identifies conserved drug binding sites, making it easier to target functionally important regions.  
- Builds structural models for proteins that lack experimental data, aiding in virtual screening and structure-based drug design.  
- Predicts potential off-target effects by analyzing sequence conservation across species, improving drug selectivity.  

**When is it useful?**  
- No crystal structure is available for the target protein.  
- Developing selective inhibitors within protein families.  
- Investigating drug resistance mechanisms.  
- Identifying new drug targets.  
- Assessing drug safety across species.  

**Steps in homology analysis:**  
- Aligning protein sequences to find conserved regions.  
- Comparing protein structures to detect similarities.  
- Identifying conserved binding pockets and key residues.  
- Evaluating species differences for better drug selectivity.  
- Predicting potential off-target interactions.  

In **Activity 5**, you will use various tools to perform homology analysis and explore its applications in drug discovery.


-------------
# 📊 Tutorials
In these tutorials, we will use the PyMOL, ClusPro, and other cheminformatic tools to work through <u>**five**</u> applied activities to:
- Measuring dihedral angles
- Torsion angle measurements and sidechin-ligand interactions
- Hemoglobin Ramachandran plot
- Analyzing importance of proline
- Homology example

## Before you begin:
- Run the PyMOL GUI by following the directions provided in the Submodule 0 notebook, provided here: [pymol_notebook](../submodule0_pymol_setup/pymol_notebook.ipynb)


## 🌟 **Activity 1: Visualizing and Measuring Phi (ϕ) and Psi (ψ) Dihedral Angles**

### **Objective:**  
To visualize and measure *phi (ϕ)* and *psi (ψ)* dihedral angles of a peptide fragment using PyMOL.

### **Steps to Perform the Activity**

#### **Step 1: Load the Peptide Structure**
1. Open the PDB File using the the *GUI menu* to load the peptide structure:<br>
     `File > Open > peptide.pdb`<br>
   -<u>Note</u>: The instructor will provide the `peptide.pdb` file. Place it in the designated folder for easy access.

2. Change the display mode to *sticks* to hide the default cartoon representation:<br>
     `Hide > Cartoon`
     
   - The peptide will now be displayed in stick representation with the following *color codes*:  
       - Green: Carbon  
       - Blue: Nitrogen  
       - Red: Oxygen  
       - Yellow: Sulfur  

#### **Step 2: Orient the Peptide**
1. Display the sequence (at the top of the PyMOL interface) by navigating to:
     `Display > Sequence`

2. Select *Serine (S)* by clicking *S* in the sequence display.

3. Rotate the peptide such that it follows the convention for displaying peptides:
     - *N-terminal (Serine)* is on the *left*.
     - *C-terminal (Aspartic acid)* is on the *right*.  

#### **Step 3: Measure Dihedral Angles for Ala296 and Pro297**
1. Set the selection mode by navigating to:<br>
   `Mouse > Selection Mode > Atoms`

2. Activate the measurement wizard by navigating to:<br>
    `Wizard > Measurement > Distances > Dihedral`
   - The wizard will allow you to click on atoms to measure dihedral angles.

3. Measure Phi (ϕ) of Ala296 by clicking on the following atoms in sequence:
     1. *C=O* of Ser295  
     2. *N* of Ala296  
     3. *C-alpha* of Ala296  
     4. *C=O* of Ala296  
   - The *ϕ angle* for Ala296 will be displayed as *-108°*.

4. Measure Psi (ψ) of Ala296 by clicking on the following atoms in sequence:
     1. *N* of Ala296  
     2. *C-alpha* of Ala296  
     3. *C=O* of Ala296  
     4. *N* of Pro297  
    - The *ψ angle* for Ala296 will be displayed as *89°*.

5. Measure *ϕ* and *ψ* for Pro297 using the same method outlined above.

### **Understanding the Dihedral Angles**
- <u>Phi (ϕ) Angle:</u> The rotation around the bond between the nitrogen (N) and the alpha carbon (Cα).
- <u>Psi (ψ) Angle:</u> The rotation around the bond between the alpha carbon (Cα) and the carbonyl carbon (C=O).
- These angles define the secondary structure (alpha helices, beta sheets, turns) of proteins.

-------------------
## 🌟 **Activity 2: To Plot and Analyze a Ramachandran Map for Hemoglobin**

### **Objective:**  
To generate a **Ramachandran plot** for hemoglobin and analyze the phi (ϕ) and psi (ψ) angles of residues in the protein, categorized by their structural properties.

### **Steps to Perform the Activity**

#### **Step 1: Install the DynoPlot Plugin**
1. Install the *DynoPlot Plugin* within PyMOL by following these steps:<br>
   - Go to `Plugin > Plugin Manager > Install New Plugin` <br>
   - Upload or install the plugin from the GitHub repository: <br>
       + https://github.com/Pymol-Scripts/Pymol-script-repo/blob/master/plugins/dynoplot.py<br>
   - Follow the prompts to complete installation.<br>

#### **Step 2: Fetch the Hemoglobin Structure**
1. Fetch the PDB file for Hemoglobin:<br>
   `PyMOL> fetch 3BCQ`
   - This command downloads and displays the hemoglobin structure in PyMOL.

#### **Step 3: Generate a Ramachandran Plot**
1. Plot the Ramachandran Map for All Residues:<br>
   `PyMOL> rama all`
   - This command generates a *Ramachandran plot* for all residues in the hemoglobin structure.
   - Shapes and meanings within the plot include the following:
     - <u>Triangles (Glycine)</u>: Glycine lacks a side chain, so it can appear in any region of the plot.
     - <u>Squares (Proline)</u>: Proline typically has **phi = -60°** due to its restricted flexibility.
     - <u>Circles (All other residues)</u>: Represent standard amino acids, positioned according to their phi and psi values.

#### **Step 4: Color the Secondary Structure**
1. Using the PyMOL GUI, display the protein by secondary structure by following these steps:<br>
   - Go to the menu next to `3BCQ` (on the right side).<br>
   - Click `C (Color)* > Color by SS (Secondary Structure)`<br>
       - This will color-code the structure based on secondary elements (e.g., helices, sheets, loops).

2. Replot the Ramachandran Map with Secondary Structure Colors:<br>
   `PyMOL> rama all`<br>
   - The updated Ramachandran plot now uses the *secondary structure coloring scheme*.

#### **Step 5: Analyze the Plot**
1. Compare with Standard Ramachandran Plot:
  - Analyze the distribution of residues in *allowed regions* (e.g., alpha-helical, beta-sheet regions).
  - Compare the plot to standard Ramachandran distributions to identify structural patterns.
2. Take not of the following <u>Key Observations</u>:
  - Helical residues cluster in the *alpha region* (phi ≈ -60°, psi ≈ -40°).
  - Beta-sheet residues cluster in the *beta region* (phi ≈ -120°, psi ≈ 120°).
  - Glycine residues are widely distributed due to their flexibility.

#### **Step 6: Save the Ramachandran Plot**
1. Save the plot as a *PostScript file* for documentation:
   `PyMOL> rama all, none, rama_plot.ps`
   - The file `rama_plot.ps` will be saved in the working directory.

### ❓ **Discussion Questions**
Click through the flashcards below to explore additional questions about Ramachandran plots.

In [None]:
#Render flashcards
from IPython.display import IFrame
IFrame('quiz/question_flashcards_1.2.2.html', width=600, height=500)

---------------------
## 🌟 **Activity 3: Measuring the Side Chain Torsion Angle and Understanding the Importance of Side Chain Interaction in Ligand Binding**

### **Objective:**  
To measure the *side chain torsion angle (chi)* and study the role of side chain interactions in ligand (ibuprofen) binding using PyMOL.

### **Steps to Perform the Activity**

#### **Step 1: Open the Pre-Prepared PyMOL Session File**
1. Load the pre-saved PyMOL Session File:<br>
     `PyMol> File > Open > sidechain.pse`
    - This session file is pre-configured with the PDB structure *2BXG*, showing ibuprofen bound to human serum albumin.

2. For the initial display, Ibuprofen is displayed as **thick sticks**. Parts of the human serum albumin protein involved in interaction are shown as *thin sticks*.

#### **Step 2: Visualize Side Chain Interactions**
1. Rotate and zoom into the structure to focus on the *binding pocket* where ibuprofen interacts with the protein.
2. Observe how side chains of the protein residues interact with the ligand.
3. Color Protein Side Chains for using different colors for interacting side chains to distinguish between them:<br>
     `PyMol> select sidechains, resi 105+112+187+199  # Example interacting residues`<br>
     `PyMol> color blue, sidechains`

#### **Step 3: Measure Side Chain Torsion Angles (Chi)**
1. Activate the measurement wizard by navigating to:<br>
     `Wizard > Measurement > Distances > Dihedral`

2. Select atoms for to measure the *chi angle* (side chain torsion) by clicking on the atoms in the following sequence:
    - Alpha carbon (Cα)
    - Beta carbon (Cβ)
    - Gamma carbon (Cγ) or corresponding atom<br>
    - Delta carbon (Cδ) or corresponding atom<br>
       - The dihedral angle will be displayed on the screen.<br>

3. Repeat for all key residues, measuring chi angles for all residues in contact with ibuprofen.

#### **Step 4: Analyze the Side Chain Interaction**
1. <u> Observe the Chi Angles </u>: Note the side chain orientations (chi angles) of residues in the binding pocket.
   Analyze whether the angles optimize interaction with ibuprofen.

2. Observer the following <u>Key Points</u>:**
   - Hydrogen bonding.
   - Hydrophobic interactions.
   - Steric complementarity between ibuprofen and the protein side chains.

#### ❓ **Additional Discussion Questions**
Click through the flashcards below to explore additional questions about sidechain angles and ligand binding.

In [None]:
#Render flashcards
from IPython.display import IFrame
IFrame('quiz/question_flashcards_1.2.4.html', width=600, height=500)

## 🌟 **Activity 4: Studying the Proline Conformation**

### **Objective:**  
To study the unique conformation of <u>proline</u> residues in a protein using PyMOL.

### **Steps to Perform the Activity**

#### **Step 1: Load the Protein Structure**
1. Fetch and display the PDB file using the following PyMOL command:<br>
     `PyMol> fetch 5GV0`

2. Modify the display to hide the default cartoon representation:<br>
     `PyMol> hide cartoon`
     
3. Show the structure in *stick representation*:<br>
     `PyMol> show sticks`
     
#### **Step 2: Select Proline Residues**
1. Use the following command to select all prolines in the structure:<br>
     `PyMol> select ////PRO/ca`
   - This selects only the *alpha carbon (Cα)* atoms of proline residues.

2. Color the selected prolines for better visualization:<br>
     `PyMol> color red, sele`

#### **Step 3: Analyze Proline's Conformation**
1. <u>Observe the Conformation</u>: Rotate and zoom in to examine the cyclic structure of proline. Proline has a rigid structure due to its cyclic side chain, which restricts its *phi (ϕ)* dihedral angle to around *-60°*.
2. Note the following <u>Key Characteristics</u>:
   - Proline's rigidity introduces *kinks* in secondary structures.
   - It often occurs in *turns* or *loops* rather than helices or sheets.

--------------------
## 🌟 **Activity 5: Homology Analysis PyMol Example**
Homology analysis is an important tool in structural biology that helps us understand how proteins are related and how they evolved. By comparing the 3D shapes of similar proteins, this analysis can pinpoint key functional regions, predict where proteins might bind to other molecules, and reveal shared molecular functions across different species. By recognizing these similarities and differences, scientists can make informed guesses about the role of newly discovered proteins, study how proteins have adapted over time, or gain insight into disease mechanisms. In this tutorial, we will explore various aspects of homology analysis using PyMOL, from structural alignment to measuring specific distances and angles between important regions.

### **Objectives:**
- Load and view two related protein structures using PyMOL’s command line tools, coloring them differently for better clarity.
- Align the protein structures to compare them and calculate their similarity using the Root Mean Square Deviation (RMSD) value, which measures how alike they are in structure.
- Highlight interaction areas by showing residues that are close together in the aligned structures, helping us identify conserved regions or potential contact points between the proteins.
- Investigate active sites by identifying important residues and representing them using spheres or sticks to clearly mark their positions.

### **Steps to Perform the Activity**

#### **Step 1: Load the Structures**
1. Run the following PyMOL commands to load the protein structures: <br>
`PyMol> fetch 1A3N, async=0  # Load the first structure (e.g., protein A)` <br>
`PyMol>fetch 1BVC, async=0  # Load the second structure (e.g., protein B)`<br>

<details>
  <summary>Click to see Step 1 Results</summary>
  <center><img src="images/step1.png" width=400 /></center><br><br>

</details>

#### **Step 2: Set Initial Representation for Better Clarity**
1. Run the following PyMOL commands to set appropriate display parameters: <br>
`PyMol> hide everything, all  # Hide all elements initially for clarity` <br>
`PyMol> show cartoon, all  # Show cartoon representation for all proteins` <br>
`PyMol> color cyan, 1A3N  # Color protein A cyan` <br>
`PyMol> color magenta, 1BVC  # Color protein B magenta` <br>

<details>
  <summary>Click to see Step 2 Results</summary>
  <center><img src="images/step2.png" width=400 /></center><br><br>

</details>

#### **Step 3: Align the Structures**
1. Align protein B (1BVC) to protein A (1A3N). This step aligns the backbone atoms of the structures to assess their homology.<br>
`PyMol> align 1BVC, 1A3N  # Align protein B (1BVC) to protein A (1A3N)`

<details>
  <summary>Click to see Step 3 Results</summary>
  <center><img src="images/step3.png" width=400 /></center><br><br>
    
  ##### RESULT:
  - Match: assigning 464 x 1029 pairwise scores.
  - MatchAlign: aligning residues (464 vs 1029)...
  - MatchAlign: score 114.000
  - ExecutiveAlign: 862 atoms aligned.
  - ExecutiveRMS: 33 atoms rejected during cycle 1 (RMSD=2.56).
  - ExecutiveRMS: 26 atoms rejected during cycle 2 (RMSD=1.71).
  - ExecutiveRMS: 19 atoms rejected during cycle 3 (RMSD=1.57).
  - ExecutiveRMS: 10 atoms rejected during cycle 4 (RMSD=1.50).
  - ExecutiveRMS: 5 atoms rejected during cycle 5 (RMSD=1.47).
  - Executive: RMSD =    1.456 (769 to 769 atoms)

    The two structures are aligned. The output will include the RMSD (Root Mean Square Deviation) value, which measures how similar the two structures are.
</details>

#### **Step 4: Refined Alignment**
1. Perform a more refined alignment using the `super` command, which can be more accurate than `align`:<br>
    `PyMol> super 1BVC, 1A3N`
    - The superposition is refined further, and RMSD values are recalculated, often resulting in a lower RMSD.

<details>
  <summary>Click to see Step 4 Results</summary>
    
  ##### RESULT:
  - MatchAlign: aligning residues (153 vs 574)...
  - MatchAlign: score 540.114
  - ExecutiveAlign: 862 atoms aligned.
  - ExecutiveRMS: 32 atoms rejected during cycle 1 (RMSD=1.93).
  - ExecutiveRMS: 24 atoms rejected during cycle 2 (RMSD=1.58).
  - ExecutiveRMS: 14 atoms rejected during cycle 3 (RMSD=1.49).
  - ExecutiveRMS: 12 atoms rejected during cycle 4 (RMSD=1.45).
  - ExecutiveRMS: 11 atoms rejected during cycle 5 (RMSD=1.41).
  - Executive: RMSD =    1.379 (769 to 769 atoms)

</details>

#### **Step 5: Calculate RMSD for Backbone Atoms**
1. We calculate RMSD specifically for CA (alpha carbon) atoms of the structures to evaluate backbone alignment. <br>
`PyMol> rms_cur 1BVC and name CA, 1A3N and name CA`

<details>
  <summary>Click to see Step 5 Results</summary>
    
  ##### RESULT:
  - Executive: RMSD =  5.743 (25 to 25 atoms)

    The RMSD value between the two structures is 27.242 Å for 25 CA atoms, which indicates their alignment. A higher RMSD usually means less similarity between the aligned structures.

</details>

#### **Step 6: Perform Alignment Without Refinement Cycles**
1. Use the `align` command with zero refinement cycles to see the initial alignment before refinement. <br>
`align 1BVC, 1A3N, cycles=0`

<details>
  <summary>Click to see Step 6 Results</summary>
    
  <center> <img src="images/step6.png" width=400 /></center><br><br>
    
  ##### RESULT:
  - Match: read scoring matrix.
  - Match: assigning 464 x 1029 pairwise scores.
  - MatchAlign: aligning residues (464 vs 1029)...
  - MatchAlign: score 114.000
  - ExecutiveAlign: 862 atoms aligned.
  - Executive: RMSD =    2.563 (862 to 862 atoms)

    The alignment score is 114. The alignment without refinement cycles provides an RMSD of 2.563 Å for 862 atoms.

</details>

#### **Step 7: Partial Region Alignment**
1. Align residues 50-100 of both structures to focus on a specific region.<br>
`PyMol> align 1BVC and resi 50-100, 1A3N and resi 50-100`

<details>
  <summary>Click to see Step 7 Results</summary>
    
  <img src="images/step7.png" width=600 /><br><br>
    
  ##### RESULT:
  - Match: read scoring matrix.
  - Match: assigning 51 x 204 pairwise scores.
  - MatchAlign: aligning residues (51 vs 204)...
  - MatchAlign: score 38.000
  - ExecutiveAlign: 296 atoms aligned.
  - ExecutiveRMS: 26 atoms rejected during cycle 1 (RMSD=5.96).
  - ExecutiveRMS: 11 atoms rejected during cycle 2 (RMSD=3.19).
  - ExecutiveRMS: 12 atoms rejected during cycle 3 (RMSD=1.53).
  - ExecutiveRMS: 16 atoms rejected during cycle 4 (RMSD=1.23).
  - ExecutiveRMS: 11 atoms rejected during cycle 5 (RMSD=1.04).
  - Executive: RMSD =    0.931 (220 to 220 atoms)

    The alignment between residues 50-100 yields an RMSD of 0.931 Å, indicating a good alignment for that specific region.

</details>

#### **Step 8: Highlight Close Interacting Regions**
1. Highlight atoms within 2 Å of each structure to observe close contacts.<br>
`PyMol> color red, (1BVC within 2 of 1A3N)  # Color atoms in 1BVC that are within 2 Å of 1A3N in red`<br>
`PyMol> color blue, (1A3N within 2 of 1BVC)  # Color atoms in 1A3N that are within 2 Å of 1BVC in blue`<br>

<details>
  <summary>Click to see Step 8 Results</summary>
    
  <img src="images/step8.png" width=600 /><br><br>
    
  ##### RESULT:
  - Executive: Colored 1171 atoms
  - Executive: Colored 1151 atoms

    This highlights atoms involved in close contacts between the two proteins, providing insight into possible interaction sites.

</details>

#### **Step 9: Measure Specific Distance and Angles**
1. Measure a specific distance and an angle in the aligned structures to understand spatial relationships.<br>
`PyMol> distance my_distance, 1BVC/100/CA, 1A3N/100/CA`<br>
`PyMol> angle my_angle, 1BVC/50/N, 1BVC/51/CA, 1BVC/52/C`

<details>
  <summary>Click to see Step 9 Results</summary>
    
  ##### RESULT:
    PyMOL will show a visual distance measurement and a calculated angle between the specified atoms.

</details>

#### **Step 10: Select and Visualize an Active Site**
1. Select a hypothetical active site in the structure and display it in a different style.<br>
`PyMol> select active_site, resi 50+58+62 and 1A3N`<br>
`PyMol> show spheres, active_site`<br>
`PyMol> color yellow, active_site`<br>

<details>
  <summary>Click to see Step 10 Results</summary>
    
  <img src="images/step10.png" width=600 /><br><br>
    
  ##### RESULT:
    - Selector: selection "active_site" defined with 92 atoms
    - Executive: Colored 92 atoms
    
    The active site is highlighted with spheres in yellow, making it stand out for better visualization.

</details>

#### **Step 11: Show Interacting Residues Between the Structures**
1. Show interacting residues from one structure that are within a certain distance from the other.<br>
`PyMol> show sticks, (1BVC within 3 of 1A3N and not 1A3N)  # Show interacting residues from one structure not found in the other`

<details>
  <summary>Click to see Step 11 Results</summary>
    
  ##### RESULT:
    Interacting residues are displayed in stick representation, which can be useful for studying potential binding interactions.

</details>

#### **Step 12: Surface Representation of Ligand Binding Pocket**
1. Use the surface representation with adjusted transparency to highlight the ligand binding pocket.<br>
`PyMol> show surface, 1A3N`
`PyMol> set transparency, 0.5, 1A3N  # Use surface representation and adjust transparency to show the ligand binding pocket clearly`

<details>
  <summary>Click to see Step 12 Results</summary>
  
  <img src="images/step12.png" width=600 /><br><br>
    
  ##### RESULT:
    - Setting: transparency set to 0.50000 in object "1A3N"
    
    The surface of 1A3N is shown with a transparency of 50%, making it easy to visualize the underlying binding pocket while also seeing through to the cartoon representation.

</details>

#### **Additional Analyses:**
You can perform additional analyses on the obtained results, such as:

1. <u>Active Site Characterization</u>: Using the active site selections (active_site), you can dive deeper into the geometry of residues involved in binding or catalysis. Visualizing these residues in different representations (e.g., spheres or sticks) can reveal key structural features important for ligand binding, which is particularly useful in drug design.

2. <u>Protein-Ligand Interaction Analysis</u>: With the distances (my_distance) and angles (my_angle) calculated between atoms, you can understand the stability and orientation of a ligand within a binding site. Such information is critical when optimizing a ligand to enhance binding affinity or designing inhibitors that fit precisely.

3. <u>Comparative Structure Analysis</u>: The alignment steps and the RMSD values we derived allow us to understand how similar two protein structures are. This kind of analysis is essential to determine conserved structural features among homologous proteins, potentially hinting at similar biological functions or evolutionary relationships.

#### **Key Takeaways:**
- Highlighting close contacts and coloring interacting residues provides a way to focus on the critical regions that mediate protein-protein or protein-ligand interactions. This helps in understanding how changes in these regions (e.g., mutations) might affect overall protein stability or function.
- By using the PyMOL commands that we discussed, you can extend this analysis further, whether for drug discovery, protein engineering, or understanding the effects of mutations on protein activity. These approaches offer detailed insights into the structural relationships between proteins, their functional domains, and how they can interact with ligands or other biomolecules.

---------------
## Conclusions
Users should be able to understand the anatomy of protein structures in detail, including measuring backbone and side-chain torsion angles, Phi, psi and Chi using PDB file os any protein 3D structure and analyze the Ramachandran plot for allowed phi and psi values in the protein structure. They should also be able to analyse the protein sequence information and compare the sequences of different proteins for homologous proteins using Python code.
## Clean Up
<div class="alert alert-block alert-warning"> <b>Attention:</b> Remember to shutdown VM and delete any relevant resources</a>. </div>