Skip to content

Resolve R1-R8 data concerns for desert_farm_leverage_points.csv#25

Closed
madsCodeBuddy wants to merge 3 commits into
mainfrom
desert-farm-csv-fixes
Closed

Resolve R1-R8 data concerns for desert_farm_leverage_points.csv#25
madsCodeBuddy wants to merge 3 commits into
mainfrom
desert-farm-csv-fixes

Conversation

@madsCodeBuddy
Copy link
Copy Markdown
Collaborator

Resolves R1–R8 data concerns from desert_farm_dataset_issues working list.

Changes

ID Row Change
R1 CO2 fixation Time_min 7.69e-3 → 6.67e-2 (Bar-On natural RuBisCO kcat ceiling, 15/s)
R2 Nutrient transport, Biochemical synthesis, Cell growth Replace Moore et al. (2013) mis-citation with Milo & Phillips (2015) Cell Biology by the Numbers
R3 Growth → Cell growth Tighten bounds to single algal cell scope: Time 1e3–1e5 s, Space 1e-18 to 1e-14 m³. Population/community scales remain covered by Community Ecology row
R4 Molecular Dynamics Models Space_max 1e-27 → 1e-22 m³ (modern atomistic MD reaches 100-nm boxes)
R5 Community Metabolic Models Time_min 991.00E+02 (formatting consistency)
R6 MD / Community Metabolic / BGC Circulation Models Fill empty References (see below)
R7 Extraction → Fossil-fuel formation Time bounds reflect formation duration (kerogen maturation + coalification, ~1–100 My) rather than deposit age
R8 CO2 fixation Extend Reference text to justify both time (Bar-On kcat) and space (RuBisCO L8S8 volume) bounds

New references

  • Karplus & McCammon (2002) Nat Struct Biol 9:646 — foundational MD review
  • Zakem et al. (2020) ISME J 14:288 — community-scale microbial metabolic model
  • Levine et al. (2025) Annu Rev Earth Planet Sci 53:595 — bridges genome-scale → BGC scales
  • Milo & Phillips (2015) Cell Biology by the Numbers — single-cell physical/chemical rates

Rename impact check

Searched repo for Growth and Extraction string usage. No external code/docs reference these specific row labels in the desert farm context (other CSVs use them in unrelated rows like Grass blade growth, Coral reef growth, Oil reservoir extraction — all distinct).

Risks / open items

  • Population growth split (Option B from working doc) was dropped — overlap with Community Ecology row was too large to justify
  • BioNumbers spreadsheet itself is not yet in the repo; @MDunitz to add to data/references/ and link from Reference cells when ready
  • Igamberdiev (2015) carbonic-anhydrase reference dropped from CO2 fixation cell since the new Time_min reflects intrinsic RuBisCO kcat rather than CA-enhanced apparent rate; revert if you want to keep the mechanistic note

R1: CO2 fixation Time_min 7.69E-03 → 6.67E-02 (Bar-On natural kcat ceiling)
R2: Replace Moore et al. 2013 mis-citations on Nutrient transport, Biochemical
    synthesis, and Cell growth (formerly Growth) with Milo & Phillips (2015)
    Cell Biology by the Numbers
R3: Rename Growth → Cell growth; tighten bounds to single algal cell
    (Time 1e3-1e5 s, Space 1e-18 to 1e-14 m³). Population scale stays in
    Community Ecology row.
R4: MD Space_max 1e-27 → 1e-22 m³ (genuine modern atomistic-MD reach)
R5: Community Metabolic Models Time_min 99 → 1.00E+02 (formatting consistency)
R6: Fill empty Reference cells:
    - Molecular Dynamics Models: Karplus & McCammon (2002) Nat Struct Biol 9:646
    - Community Metabolic Models: Zakem et al. (2020) ISME J 14:288
    - Biogeochemical Circulation Models: Levine et al. (2025) Annu Rev
      Earth Planet Sci 53:595
R7: Rename Extraction → Fossil-fuel formation; Time bounds 3.16E+13 to
    3.16E+15 s reflect formation duration (kerogen maturation + coalification)
    rather than deposit age
R8: Extend CO2 fixation Reference text to justify both time (Bar-On kcat) and
    space (RuBisCO L8S8 enzyme volume) bounds. Drop Igamberdiev CA mechanism
    reference; no longer needed once Time_min reflects intrinsic RuBisCO kcat.
25 entries justifying time/space bounds for Cell growth (8), Biochemical
synthesis (8), and Nutrient transport (9) rows. Each entry includes
bion_id, value, units, organism, and direct URL to the BioNumbers page.

Phototroph-specific where available; generic-organism proxies used where
phototroph data was unavailable in BioNumbers.
Schema, curation criteria, known gaps (including the Cell growth Time_min
mismatch flagged by the BioNumbers data), update procedure, and BioNumbers
attribution per Milo et al. (2010) Nucleic Acids Res 38:D750.
@madsCodeBuddy
Copy link
Copy Markdown
Collaborator Author

Superseded by a fresh-from-main port — branch fell behind main during the BioNumbers subset addition. New PR uses branch desert-farm-csv-fixes-v2 with the same content plus the curated BioNumbers subset and README.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant