# LiteFarm Project Proposal 

Authors: He Ma, Hanchang Qin, Yi Han

Project Mentor: Simon Goring


## Executive Summary

The project aims to refine and enhance existing GHG modeling for the LiteFarm platform, specifically focusing on nitrous oxide (N2O) emissions. The project has two main objectives. First, the initiative aims to improve existing modeling framework to incorporate dynamic global data for greater accuracy and broader applicability. Second, the project will conduct sensitivity analysis to identify influential factors on emissions estimation to guide targeted data collection. Deliverables will feature an updated dashboard for visualizing emissions and a Python script for automated data integration and GHG calculation. The broader goal is to provide actionable insights for stakeholders, aiding decision-making in sustainable agricultural practices.


## 1. Introduction

Climate change poses a critical challenge to the sustainability of our planet, largely due to rising greenhouse gas (GHG) emissions. Agriculture is a major contributor, with activities such as livestock rearing, fertilizer use, and land management practices contributing heavily to GHG emissions. Quantifying agricultural GHG emissions can help optimize farming practices from crop selection to tillage methods, thereby reducing emissions. Our broader objective is to empower stakeholders, including farmers, researchers, and policymakers to make data-driven decisions, transforming sustainability into a measurable and actionable goal.

This project focuses on a potent greenhouse gas, nitrous oxide (N2O); and we propose to refine the broad problem into two objectives: (1) enhance the existing GHG emissions modeling framework by integrating comprehensive global data; and (2) conduct sensitivity analyses to determine how changes in input parameters affect GHG emission estimates


### 1.1 Enhanced Modeling

The agricultural GHG model HOLOS uses static model parameters from a Canadian context. Enhanced modeling will incorporate dynamic datasets that reflect global soil, climate, and crop characteristics. Our model aims to be more robust and adaptable to global agricultural practices. 


### 1.2 Sensitivity Analysis

Sensitivity analysis will pinpoint the most significant factors influencing emissions. Gaining insights from these analyses can help refine our data gathering methods, improving both the accuracy and dependability of our models. It is worth noting that future analysis plans to expand to carbon dioxide (CO2) and methane (CH4) as more data become available from our partner.


### 1.3 Deliverables  


Our project will deliver a comprehensive data product suite tailored for the LiteFarm platform. 

* An updated module in the LiteFarm dashboard that allows users to calculate and visualize GHG emissions under various farming scenarios. This feature will support decision-making processes by providing actionable insights to the users.
* A Python script that calculates GHG emissions with our updated GHG modeling framework. The script includes a data pipeline that automates the collection, processing, and/or integration of various data sources into the LiteFarm system. The script not only performs precise GHG calculations but also ensures the data used is consistently up-to-date and relevant.
* Comprehensive documentation and user guides articulating the functionalities of the new features, including methodologies and data sources used in the GHG models and instructions for utilizing the dashboard.

By delivering these components, our project will provide LiteFarm and its users with a tool to address the pressing issue of GHG emissions in agriculture, thereby contributing to the global effort to combat climate change.

## 2. Data Science Techniques


### 2.1 Data description

The GHG modeling framework employed by LiteFarm is derived from the Holos project [1], an open source software developed for estimating GHG emissions in Canadian farming systems. This model requires 22 distinct parameters for estimating N2O emissions, including farm-specific, crop-specific, and climatic and soil parameters as detailed in Appendix Table A1. 

To enhance the existing modeling framework, we have identified three key data sources:

First, farm data collected by the LiteFarm team covers 124 farms throughout Canada, providing details such as geographic locations, farm sizes, type of crop planted, and estimated yield (Fig.1 and Appendix Table A1). 




Fig 1. The 124 farms in Canada from the LiteFarm database.

Second, based on farms’ geographic locations, we will integrate high-quality soil and climate parameters from external databases (identified in Fig. 2).



Fig 2. The external data sources.

Third, the Holos frame limits crop-specific parameters to a limited set of crop types, excluding many crop varieties found in LiteFarm data. These parameters are also static and lack specified ranges, limiting our ability to assess their variability. To address these limitations, we plan to expand and refine the crop-specific parameters using data from peer-reviewed articles and government reports.


#### Data Management Plan


* Climate Data: Scripts will be provided to automate NASA POWER Project API calls and data download.
* Soil data: Due to their large size, scripts for downloading and setting up these datasets will be provided. 
* Crop Data: Data will be stored directly in the repository in CSV format.

### 2.2 Method and techniques

The project aims to enhance the LiteFarm dashboard by integrating a crop residue nitrogen estimation feature. Our initial task is to develop a Python script replicating the Holos model to serve as a baseline. This model will retrieve farm data from the LiteFarm database via SQLAlchemy. Next, we will enhance the model to support multiple input variables. Data from external sources will be retrieved through API calls. In parallel, we will conduct sensitivity analyses to identify key drivers of GHG emission changes across different conditions. 

The dashboard will be updated with a Farmer tab and a Scientist tab. The Farmer tab displays GHG emission data for selected farms, and the Scientist tab provides sensitivity analysis results, highlighting influential factors for emissions estimates.


### 2.3 Partner’s expectation:

LiteFarm expects the setup of an emission tab on the dashboard to showcase emission data for crop residual nitrogen direct emissions of selected farms. A successful outcome would include validating LiteFarm’s GHG model against the Holos desktop version to ensure that the estimations are reasonable and reliable.


### 2.4 Success criteria

To meet partner’s expectation, we have set the following success criteria:



1. Precision in GHG Estimation: A more precise GHG estimation will be assessed by comparing GHG estimates against those produced by Holos software. 
2. Flexibility in Input Handling: Success will be evaluated by the system’s ability to accept a range of input variables, thereby providing more flexible and practical calculations for users. 
3. Uncertainty Measurement: Providing outputs with uncertainty measurements from the sensitivity analysis will help users understand the reliability of the GHG estimates and make informed decisions. 
4. Dashboard Enhancement: The effective establishment of Farmer and Scientist tabs within the LiteFarm dashboard will facilitate communication by showcasing GHG estimation results and insights from sensitivity analysis.

## 3. Timeline


<table>
  <tr>
   <td>Date
   </td>
   <td>Milestone
   </td>
  </tr>
  <tr>
   <td>Week 1: 29 April - 5 May
   </td>
   <td>
<ul>

<li>Define project questions / objectives

<li>Initial EDA

<li>Explore external data
</li>
</ul>
   </td>
  </tr>
  <tr>
   <td>Week 2: 6th - 12th May
   </td>
   <td>
<ul>

<li>Establish GHG model

<li>Project Proposal
</li>
</ul>
   </td>
  </tr>
  <tr>
   <td>Week 3: 13th - 19th May
   </td>
   <td>
<ul>

<li>Refine GHG model

<li>Conduct sensitivity analysis
</li>
</ul>
   </td>
  </tr>
  <tr>
   <td>Week 4: 20th - 26th May
   </td>
   <td>
<ul>

<li>Sensitivity analysis 
<ul>
 
<li>Integrate external data
 
<li>Pinpoint key variables
</li> 
</ul>
</li> 
</ul>
   </td>
  </tr>
  <tr>
   <td>Week 5: 27th May - 2 June
   </td>
   <td>
<ul>

<li>Set up initial dashboard version
</li>
</ul>
   </td>
  </tr>
  <tr>
   <td>Week 6: 3rd - 9th June
   </td>
   <td>
<ul>

<li>Refine dashboard
</li>
</ul>
   </td>
  </tr>
  <tr>
   <td>Week 7: 10th - 16th June
   </td>
   <td>
<ul>

<li>Capstone presentation 

<li>Draft final report 
</li>
</ul>
   </td>
  </tr>
  <tr>
   <td>Week 8: 17th - 23rd June
   </td>
   <td>
<ul>

<li>Finalize dashboard

<li>Complete final report
</li>
</ul>
   </td>
  </tr>
</table>

## 4. References

[1] Minister of Agriculture & Agri-Food Canada. (2022). Holos [Computer software]. GitHub. https://github.com/holos-aafc/Holos

## 5. Appendix

Table A1: Input requirements for Holos GHG model


<table>
  <tr>
   <td><strong>Parameters</strong>
   </td>
   <td><strong>Explanation</strong>
   </td>
   <td><strong>Type</strong>
   </td>
   <td><strong>Data source</strong>
   </td>
  </tr>
  <tr>
   <td>Farm 
   </td>
   <td>A unique identifier, each represents a specific farm
   </td>
   <td>Farm-specific
   </td>
   <td>LiteFarm
   </td>
  </tr>
  <tr>
   <td>Crop common name
   </td>
   <td>Text, common names of the crop grown by the farm, e.g., soybeans, wheat, and etc.
   </td>
   <td>Farm-specific
   </td>
   <td>LiteFarm
   </td>
  </tr>
  <tr>
   <td>Total area (ha)
   </td>
   <td>Numerical, total area of the farm
   </td>
   <td>Farm-specific
   </td>
   <td>LiteFarm
   </td>
  </tr>
  <tr>
   <td>Estimated yield (kg / ha)
   </td>
   <td>Numerical, the estimated yield
   </td>
   <td>Farm-specific
   </td>
   <td>LiteFarm
   </td>
  </tr>
  <tr>
   <td>Lifecycle
   </td>
   <td>Binary, perennial or annual
   </td>
   <td>Farm-specific
   </td>
   <td>LiteFarm
   </td>
  </tr>
  <tr>
   <td>Province
   </td>
   <td>Required by existing Holos framework
   </td>
   <td>Farm-specific
   </td>
   <td>Team extracted
   </td>
  </tr>
  <tr>
   <td>Moisture  (%)
   </td>
   <td>Moisture content of product 
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>N_p
   </td>
   <td>N concentration in the product (kg kg-1)
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>N_s
   </td>
   <td>N concentration in the straw (kg kg-1) 
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>R_s
   </td>
   <td>Relative biomass allocation coefficient for straw
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>R_p
   </td>
   <td>Relative biomass allocation coefficient for product
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>N_r
   </td>
   <td>N concentration in the roots (kg kg-1)
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>N_e
   </td>
   <td>N concentration in the extra root material (kg kg-1) (until known from literature, the same N concentration used for roots will be utilized)
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>R_r
   </td>
   <td>Relative biomass allocation coefficient for roots
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>R_e
   </td>
   <td>Relative biomass allocation coefficient for extra-root material
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>RF_CS
   </td>
   <td>Reduction factor for Cropping System
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>RF_NS
   </td>
   <td>N source modifier RF_NSk (SN = Synthetic Nitrogen; ON = Organic Nitrogen; CRN = Crop Residue Nitrogen)
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>RF_AM
   </td>
   <td>Reduction factor based on application method, only applicable to calculations of EF specific for SN
   </td>
   <td>Crop-related
   </td>
   <td>Holos default
<p>
External information
   </td>
  </tr>
  <tr>
   <td>P_i
   </td>
   <td>Annual growing season precipitation (May –
<p>
October), in ecodistrict “i” (mm)
   </td>
   <td>Climate-related
   </td>
   <td>Holos default
<p>
NASA Power Project Database
   </td>
  </tr>
  <tr>
   <td>PE
   </td>
   <td>Growing season potential
<p>
evapotranspiration, by ecodistrict (May – October)
   </td>
   <td>Climate-related
   </td>
   <td>Holos default
<p>
NASA Power Project Database
   </td>
  </tr>
  <tr>
   <td>FR_Topo
   </td>
   <td>FR_topo_i: Fraction of land occupied by lower portions of landscape
   </td>
   <td>Soil-related
   </td>
   <td>Holos default
<p>
SLC or FAO Harmonized World Soil
   </td>
  </tr>
  <tr>
   <td>RF_TX
   </td>
   <td>RF_TX i/j/i,j weighted modifier which provides a correction of the EF_Topo in ecodistrict ‘‘i’’ based on the soil texture
   </td>
   <td>Soil-related
   </td>
   <td>Holos default
<p>
SLC or FAO Harmonized World Soil
   </td>
  </tr>
  <tr>
   <td>RF_till
   </td>
   <td>Tillage modifier RF_Till (Conservation or Conventional Tillage)
   </td>
   <td>Soil-crop-related
   </td>
   <td>Holos default
<p>
SLC or FAO Harmonized World Soil
   </td>
  </tr>
</table>