
# Cohort 4 (SR-C0421Mx, C04A) Manual Biomass Samplings Report

Within this report contains an anaylsis of Cohorts 4 from 2022 biomass sampling measurements.



## Sampling Frequency

Since FiveTrain was connected (the data ingestion tool used to ingest data points from Aquamanager into the Datalake) there has not been a new measurement inserted since February of 2023. Below is a transaction log of all the new data points that Fivetran has detected.

The bar graph below is a general overview of all the recording manual biomass measurements. Recorded is the total number of samples per date grouped by the cohort.

In [0]:
%sql WITH t AS (
    DESCRIBE HISTORY hive_metastore.aquamanager_growout_dbo.cltranssampledetails
) SELECT to_date(timestamp, 'yyyy-MM-dd'), operation, operationMetrics.numOutputRows as num_output_rows FROM t

"to_date(timestamp, yyyy-MM-dd)",operation,num_output_rows
2023-02-04,MERGE,4449.0
2023-01-27,MERGE,421.0
2023-01-19,COPY INTO,4028.0
2023-01-19,CREATE TABLE,


In [0]:
import seaborn as sn, matplotlib.pyplot as plt, numpy as np, pandas as pd

def select_samplings(cohort_id, start_date, end_date):
    return spark.sql(f"""
SELECT
    sampleweight as sampled_weight, batch, to_date(date, 'MM-dd-yy') as date
FROM
    hive_metastore.aquamanager_growout_dbo.view_sampling_details
WHERE 
    batch = '{cohort_id}' AND to_date(date, 'MM-dd-yy') BETWEEN to_date('{start_date}', 'yyyy-MM-dd') AND to_date('{end_date}', 'yyyy-MM-dd')
ORDER BY
    date
ASC              
    """)

In [0]:
%sql
SELECT 
  count(sampleweight) as total_sampled,
  batch,
  to_date(date, 'MM-dd-yy') as date
FROM
  hive_metastore.aquamanager_growout_dbo.view_sampling_details
GROUP BY
  date, batch
ORDER BY 
  date
ASC    

total_sampled,batch,date
480,SR-C0421Mx,2021-12-30
496,SR-C0421Mx,2022-01-27
496,SR-C0421Mx,2022-02-28
507,SR-C0421Mx,2022-03-15
178,SR-C0421Mx,2022-04-20
18,SR-C0421Mx,2022-04-26
91,SR-C0421Mx,2022-05-20
106,SR-C0421Mx,2022-06-07
112,SR-C0522MxEc,2022-06-07
118,SR-C0522MxEc,2022-06-14


Databricks visualization. Run in Databricks to view.


# Histograms

Starting on 2021-11-22 and ending in 2023-03-11, below is a 30, 60, 90 day analysis of the Cohort's sampling histograms.


## First 30 Days Post Stocking

Samples were recorded on December 30th, after 30 days.

In [0]:
display(select_samplings('SR-C0421Mx', '2021-12-01', '2022-01-01'))

sampled_weight,batch,date
140.0,SR-C0421Mx,2021-12-30
108.0,SR-C0421Mx,2021-12-30
22.0,SR-C0421Mx,2021-12-30
36.0,SR-C0421Mx,2021-12-30
102.0,SR-C0421Mx,2021-12-30
160.0,SR-C0421Mx,2021-12-30
172.0,SR-C0421Mx,2021-12-30
128.0,SR-C0421Mx,2021-12-30
38.0,SR-C0421Mx,2021-12-30
240.0,SR-C0421Mx,2021-12-30


Databricks visualization. Run in Databricks to view.

## 60 Days Post Stocking

Samples were recorded on January 27th.

In [0]:
display(select_samplings('SR-C0421Mx', '2022-01-01', '2022-01-31'))

sampled_weight,batch,date
240.0,SR-C0421Mx,2022-01-27
100.0,SR-C0421Mx,2022-01-27
150.0,SR-C0421Mx,2022-01-27
340.0,SR-C0421Mx,2022-01-27
190.0,SR-C0421Mx,2022-01-27
460.0,SR-C0421Mx,2022-01-27
470.0,SR-C0421Mx,2022-01-27
300.0,SR-C0421Mx,2022-01-27
300.0,SR-C0421Mx,2022-01-27
280.0,SR-C0421Mx,2022-01-27


Databricks visualization. Run in Databricks to view.


## 90 Days Post Stocking

In [0]:
display(select_samplings('SR-C0421Mx', '2022-02-01', '2022-02-28'))

sampled_weight,batch,date
700.0,SR-C0421Mx,2022-02-28
550.0,SR-C0421Mx,2022-02-28
750.0,SR-C0421Mx,2022-02-28
860.0,SR-C0421Mx,2022-02-28
640.0,SR-C0421Mx,2022-02-28
650.0,SR-C0421Mx,2022-02-28
560.0,SR-C0421Mx,2022-02-28
880.0,SR-C0421Mx,2022-02-28
400.0,SR-C0421Mx,2022-02-28
800.0,SR-C0421Mx,2022-02-28


Databricks visualization. Run in Databricks to view.


## 60-90 Days Post Stocking: 2022-03-22 to 2022-06-22

In [0]:
%sql

SELECT 
  sampleweight as sampled_weight, batch, to_date(date, 'MM-dd-yy') as date
FROM
  hive_metastore.aquamanager_growout_dbo.view_sampling_details
WHERE 
  batch = 'SR-C0421Mx' AND to_date(date, 'MM-dd-yy') BETWEEN to_date('2022-03-22', 'yyyy-MM-dd') AND to_date('2022-06-22', 'yyyy-MM-dd')
ORDER BY
  date
ASC

sampled_weight,batch,date
1060.0,SR-C0421Mx,2022-04-20
1140.0,SR-C0421Mx,2022-04-20
1360.0,SR-C0421Mx,2022-04-20
1040.0,SR-C0421Mx,2022-04-20
900.0,SR-C0421Mx,2022-04-20
960.0,SR-C0421Mx,2022-04-20
1100.0,SR-C0421Mx,2022-04-20
1100.0,SR-C0421Mx,2022-04-20
1060.0,SR-C0421Mx,2022-04-20
1200.0,SR-C0421Mx,2022-04-20


Databricks visualization. Run in Databricks to view.


## 90-120 Days Post Stocking: 2022-06-22 to 2022-09-22

In [0]:
%sql

SELECT 
  sampleweight as sampled_weight, batch, to_date(date, 'MM-dd-yy') as date
FROM
  hive_metastore.aquamanager_growout_dbo.view_sampling_details
WHERE 
  batch = 'SR-C0421Mx' AND to_date(date, 'MM-dd-yy') BETWEEN to_date('2022-06-22', 'yyyy-MM-dd') AND to_date('2022-09-22', 'yyyy-MM-dd')
ORDER BY
  date
ASC

sampled_weight,batch,date
2060.0,SR-C0421Mx,2022-07-05
1800.0,SR-C0421Mx,2022-07-05
2750.0,SR-C0421Mx,2022-07-05
1500.0,SR-C0421Mx,2022-07-05
1350.0,SR-C0421Mx,2022-07-05
850.0,SR-C0421Mx,2022-07-05
1910.0,SR-C0421Mx,2022-07-05
1950.0,SR-C0421Mx,2022-07-05
2150.0,SR-C0421Mx,2022-07-05
1950.0,SR-C0421Mx,2022-07-05


Databricks visualization. Run in Databricks to view.