Interpretation framing lit review: rag-build + 4 papers + cite map (3/3)#64
Merged
Merged
Conversation
…art) Phase 2 — POSTed 4 candidates to NewGraphEnvironment/climate (8MH9LCC9) via Web API with PDFs attached via S3. CrossRef-driven metadata; tags interpretation-framing-methodology + cd-issue-63. 2 fresh PDF uploads (Arguez & Vose, Hawkins & Sutton), 2 md5- deduped (Hansen 2012, Livezey 2007). Auto-restart fired and all 4 BBT keys captured cleanly: arguez_vose2011DefinitionStandard livezey_etal2007EstimationExtrapolation hawkins_sutton2012Timeemergence hansen_etal2012Perceptionclimate Phase 3 — adds scripts/rag_interpretation_framing_build.R cloning the precip+drying build script with a 4-paper pdf_specs map. Runs in ~10 s on Ollama nomic-embed-text: Found 4 / 4 PDFs Chunks: 291 Sources: 4 Phase 4 — adds scripts/rag_interpretation_framing_query.R running 16 queries across 6 topics (narrower than #58/#61's 24 queries since framing topic surface is smaller). Captures top-5 chunks each to planning/active/interpretation_framing_quotes.md (373 lines). Topics: baseline window methodology, normals when trends exist, time of emergence, cumulative-impact / loaded dice, shifting baseline climate, departure from recent variability. Phase 5 — synthesis in findings.md: per-topic methodology quotes selected from the rag retrieval, cross-cutting methodology section (Hansen 2012's choice of 1951-1980 base period validates cd's choice for cumulative-impact reporting — strongest direct precedent across all three lit reviews), 3 documented deviations (1951-1980 vs WMO 1961-1990 baseline, no AC correction, ToE not quantified per-AOI), and an 11-row cite-this-for-that menu mapping vignette framing claim types to BBT-auto-derived citation keys. 3-split scoreboard added to findings.md — pointer to the four findings.md files (#53/#54 snow, #58/#60 temp, #61/#62 precip+ drying, this) for the downstream vignette wire-up branch. Refs #63.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Summary
[@key]markers from all four findings.md files.NewGraphEnvironment/climateZotero collection with PDFs attached: Arguez & Vose 2011 (WMO climate-normal definition + alternatives), Livezey 2007 (estimating normals when trends exist), Hawkins & Sutton 2012 (time of emergence), Hansen 2012 (perception of climate change / loaded dice). Plus 6 reuse-relevant existing climate-collection items (Mora 2013, Pauly 1995, Rodrigues 2019, Alleway 2023, IPCC AR6 WGI + SYR) + 4 cross-rag references from snow + temp + precip+drying stores.data/rag/interpretation_framing.duckdb(291 chunks, 4 sources, ~10 s ingest), built by the newscripts/rag_interpretation_framing_build.R. 16-query mining viascripts/rag_interpretation_framing_query.Rproducedplanning/active/interpretation_framing_quotes.md(373 lines), synthesized intofindings.mdwith methodology-quotes-by-topic, cross-cutting methodology, deviations, and an 11-row "cite this for that" menu (BBT-auto-derived keys ready for downstream[@key]insertion).Citation Key:overrides inextra(BBT auto-derives persoul#43); all 4 papers had ≥2 individual creators per CrossRef (no Pepin-style PATCH); macOS auto-restart fired without user prompt and all 4 BBT keys captured first try.Headline finding: Hansen 2012 uses the 1951–1980 base period — same as cd — providing the strongest direct precedent for cd's baseline window choice across all three lit reviews. Per Arguez & Vose 2011, cd's choice is an "alternative climate normal" (departure on the period-start attribute) — defensible if documented, and Hansen 2012 is the documentation.
Wraps the 3-split. BEC zone-shifts (#59 tracker) is the natural next pickup if reporting needs warrant it.
Relates to NewGraphEnvironment/sred-2025-2026#23.
Fixes #63.
Test plan
devtools::test()clean on63-interpretation-framing-lit-review(verified: 166 PASS, 0 FAIL)lintr::lint()on new scripts → 0 lints (verified)Rscript scripts/rag_interpretation_framing_build.Rreproduces the store (4 sources, ~291 chunks) given the local PDF cache populated from ZoteroRscript scripts/rag_interpretation_framing_query.Rregeneratesplanning/active/interpretation_framing_quotes.mdfindings.mdPhase 2 table resolve to items inNewGraphEnvironment/climate(verified via local sqlite)planning/active/task_plan.mdmatch landed work🤖 Generated with Claude Code