
### **KinDyn Project Overview**

**Description:**  
The **KinDyn** project involved filtering 3DED (Three-Dimensional Electron Diffraction) data using a developed filtering program called **BKinD** to refine the dataset based on an equation called the **Deviation from Model (DFM)**. This filtering was performed iteratively until a desired level of completeness was achieved in the dataset, with the goal of analyzing how the filtering process affected key crystallographic metrics and structure determination outcomes.

**DFM Equation:**  
The DFM for each reflection was calculated as follows:

$$
DFM = \frac{F_o^2 - F_c^2}{\sqrt{(\sigma_{F_o^2})^2 + (2 \cdot u \cdot F_c^2)^2}}
$$

where:
- $F_o^2$ is the squared observed structure factor,
- $F_c^2$ is the squared calculated structure factor,
- $\sigma_{F_o^2}$ is the uncertainty in the observed structure factor,
- $u$ is a scaling factor that adjusts the influence of the calculated structure factor.

The DFM equation is inspired by a weighting scheme discussed on page 221 of the **JANA98 Software Manual**. This weighting scheme helps account for uncertainties in both the observed and calculated structure factors, ensuring that larger uncertainties reduce the contribution of those reflections to the overall dataset.

The instability factor $u$ was optimized during the process so that the DFM distribution had a **mean** equal to the **median**, ensuring a balanced distribution of deviations. This optimization aimed to minimize bias in the data by adjusting $u$ such that the central tendency of the DFM values was symmetric.

The filtering process progressively removed reflections with high DFM values, under the assumption that these reflections diverged significantly from the expected model.

---

### **Filtering Process and Objectives**

The primary objective of the project was to understand how DFM filtering affected key metrics used in crystallographic analysis, such as:
- **R1 and Rint values**: Metrics that indicate the quality of the fit between the observed and calculated structure factors.
- **Structure Solvability**: The ability to solve the crystal structure based on the filtered dataset.
- **Atomic Displacement Parameters (ADPs)**: Effect on atomic motion within the crystal.
- **Resolution**: The level of detail in the resulting crystal structure.

Both the **removed data** (high DFM reflections) and the **remaining data** (after filtering) were analyzed at various stages of the process.

---

### **Impact of Filtering on Crystallographic Metrics**

#### 1. **R1 and Rint Values:**
   - R1 and Rint values were calculated after each filtering step to observe how the removal of high-DFM reflections impacted the overall fit between observed and calculated structure factors.
   - The goal was to assess whether the removal of these reflections improved the quality of the remaining data, as indicated by a reduction in R1 and Rint values.

#### 2. **Structure Solvability:**
   - Structure solvability was evaluated for both the removed and remaining data after each filtering step. The key focus was on when the structure could be solved using the **removed data**, which indicated that critical reflections were being discarded.
   - When the structure could be solved from the removed data, it indicated that filtering had gone too far, as important reflections had been eliminated. This served as a signal that further filtering was no longer justifiable, since removing reflections that still contribute to structure solution undermines the validity of the filtering process.

#### 3. **Atomic Displacement Parameters (ADPs) – Removed Data Focus:**
   - ADP positions, obtained through refinement using the correct structure, stabilized progressively with **successive filtering** of the **removed data**. This stabilization served as a **proxy** for structure solvability without the need to explicitly solve the structure using SHELXT and SHELXLE.
   - The stabilization of ADP positions in the removed data indicated that critical reflections were being discarded, signaling that further filtering might lead to a loss of important structural information and should be avoided.

#### 4. **Resolution:**
   - The effect of filtering on resolution was also evaluated, with the goal of determining if certain filtering steps led to a loss of high-resolution data or compromised the overall clarity of the structure.
   - Any shifts in the resolution limit were documented to understand the trade-off between filtering and data completeness, ensuring that high-quality, high-resolution data was retained as much as possible.

---

### **Analysis of DFM Across Reflections and Frames**

#### 1. **DFM vs. Frame:**
   - The DFM was calculated for each reflection across multiple frames of data collection. By tracking the DFM across frames, it was possible to determine if certain frames consistently produced higher deviations, potentially indicating areas of poorer data quality.
   - This frame-by-frame analysis provided insights into how the quality of data varied during the experiment and guided decisions on which data to retain.

#### 2. **DFM Distribution:**
   - The distribution of DFM values was examined to understand the overall spread of deviations within the dataset. This helped in identifying outliers and assessing the impact of filtering on the remaining reflections.
   - A comparison of DFM distributions before and after filtering highlighted how the data quality improved, as the dataset became progressively "cleaner" with fewer high-DFM reflections.

---
### **Conclusion and Insights**

The filtering of 3DED data using the DFM equation had several key effects on crystallographic analysis:

- **R1 and Rint values:** As expected, **R1 and Rint values decreased** with filtering, particularly when the dataset had sufficient initial completeness and multiplicity. This decrease occurred because reflections that deviated significantly from the model were progressively removed, improving the overall fit. However, this improvement depended on maintaining enough data to avoid over-filtering and losing too many reflections.

- **Structure solvability (removed data):** While filtering did not noticeably improve the **structure solvability** of the remaining data, it revealed an important insight when examining the **removed data**. When the structure could be solved using the removed data, it indicated that critical reflections had been discarded. This provided a clear signal that filtering had gone too far, and further filtering was no longer justifiable. The ability to solve the structure from the removed data became a key criterion for assessing the appropriateness of the filtering process.

- **ADPs as a proxy for structure solvability:** The stabilization of ADP positions in the **removed data** served as a proxy for tracking structure solvability without having to solve the structure at each filtering step. As filtering progressed, ADP stabilization marked the point at which important reflections were being removed, signaling that further filtering could compromise the dataset’s integrity. This allowed for efficient tracking of filtering effects without repeated structure solution attempts.

- **Resolution impact:** The filtering process predominantly removed **low-resolution reflections** (higher Å), meaning that as filtering advanced, higher-resolution data was generally retained. However, aggressive filtering could still lead to the removal of essential reflections, impacting the dataset's completeness and the overall quality of the final structure.

The project demonstrated that the DFM filtering method was effective in refining the dataset by removing reflections that deviated from the model. However, careful control was required to avoid discarding critical reflections, particularly those necessary for structure solution. The stabilization of ADP positions in the removed data provided a valuable tool for determining when further filtering was no longer appropriate. Future work could focus on refining the filtering process to optimize data retention while improving overall data quality.

---

### SCXRD Filtering and Comparison with 3DED Data

As part of the **KinDyn** project, a version of the filtering program **BKind** was developed for **SCXRD data**. This version is also applicable to merged **3DED data**, allowing for a comparison between SCXRD and 3DED results. The comparison provides insights into how filtering affects different types of diffraction data and the impact on key crystallographic metrics.

---

### Center Beam Intensity and DFM Correlation

An additional outcome of the **KinDyn** project was the observed correlation between **Center Beam Intensity** and **High DFM Values** in 3DED data for LTA and STW samples captured with the **Themis Z** detector. This discovery led to a follow-up project exploring the relationship between center beam intensity and dynamic effects, with the goal of improving data merging in diffraction experiments.


---

### Example Plots and Filtering Statistics

Here are several example plots showing the outcome of the filtering process for the **KinDyn** project. These plots, created using **Plotly**, explore various metrics such as **DFM values**, **completeness**, **R1 and Rint values**, and more.

- [ADP vs Target Completeness](assets/ADP_vs_TC.html)
- [DFM Distribution](assets/DFM_Distribution.html)
- [DFM vs Frame](assets/DFM_vs_Frame.html)
- [R1 and Rint vs Completeness](assets/R1_Rint_vs_Completeness.html)
- [Resolution vs DFM](assets/Resolution_vs_DFM.html)
- [FVAR NPD vs Completeness](assets/FVAR_NPD_vs_Completeness.html)

If the links above don't work, you can find these files in the `assets` folder that comes with the notebook.

Additionally, filtering statistics for the process are provided in the following `.txt` file:

- [Download Filtering Statistics](assets/filtering_stats.txt)

If you are unable to open the link, the file is also located in the `assets` folder.

---


### Project Report: Studying Structure Solvability of Removed Data After BKinD Filtering for LTA and FEACAC Samples

This report is one outcome of the **KinDyn** project, focusing on the structure solvability of the **removed data** after filtering for LTA and FEACAC samples. The analysis explores the threshold where filtering becomes counterproductive by discarding reflections still valuable for structure determination.

The document contains detailed findings, including the completeness of removed data, structure solvability insights, and key metrics for both LTA zeolite and FEACAC samples.

[Download the full project report](assets/Solving_Structure_Filtered_Diffraction_Data.pdf)

If the link does not work, you can find the report in the `assets` folder that comes with the notebook.

---

### Project Report: Analysis of Atom Removal and DFM Filtering Effects on Highest Difference Peak for LTA and STW Zeolites

This report is another outcome of the **KinDyn** project, focusing on the effects of **Atom Removal** and **BKinD** filtering on the **Highest Difference Peak (HDP)** for LTA and STW zeolite samples. The analysis examines how filtering impacts HDP and target completeness when one atom is removed from the structure.

The report includes detailed findings, SHELXL maps comparing filtered and unfiltered data, and insights on how filtering and atom removal affect the HDP and data quality.

[Download the full project report](assets/Highest_Diff_Peak_Analysis.pdf)

If the link does not work, you can find the report in the `assets` folder that comes with the notebook.

---
