Hologram Puller/Parser

Prototype for processing sensor data.

The first version:

Called API from Hologram
Stored device list
Stored latest timestamps for each device
Archived raw strings
Parsed raw strings into tabular format
Archived data as CSVs

~~The next version will:~~

~~Call API from the Raw DB (MySQL server which does its own API calls)~~
- ~~Keep track of which Raw rows have been parsed into Shadow DB~~
- ~~We will have a table in Shadow for each table in Raw that only has a record of parsed-row uids, so we can subset Raw using >max(uid) (import_from_*)~~
- ~~We will have a table to do try-catch style errors (try_again_*) to store anything that has unexpected values~~
- ~~That way all rows will be pulled only when new or when explicitly batched from the retry table~~
~~Extract raw strings from the response object in each row~~
~~Parse raw strings~~
~~Push into Shadow~~

~~Then the tech dashboard will use the Publish function to push from Shadow to Published DB.~~

~~Except for TST, DEV, DV1, TS1, etc for testing (which will otherwise propagate through all functions)~~

Major rewrite:

New architecture:

Webhooks push to Raw DB (MariaDB ~~or Mongo~~) from each asset (Kobo forms) and device
Parsing script pulls from Raw

We will have a table in Shadow DB (see below) for each table (assets and Hologram) in Raw that only has a column of parsed-row uids and a column of assets, so we can subset Raw using group_by(asset) + >max(uid) (imported_from_*)
We will have a table to do try-catch style errors (try_again_*) to store anything that has unexpected values
That way all rows will be pulled only when new or when explicitly batched from the retry table

Extract raw strings from the response object in each row
Parse raw strings
Push into SQLite/flatfiles (Shadow DB)
Collate Shadow into appropriate forms to mirror tables in Production DB
Push row updates into Published, with Validated = FALSE column

Then, the Tech Dashboard will have the permissions for Shepherd users to "Publish" rows in the Production DB (change Validated = TRUE). Additionally the State Lead users will have the ability to Suggest Changes to data values, which the Shepherds can resolve in an issue tracker, which will edit the Production DB.

Additional important note: The Export CSV button in the Tech Dashboard needs to filter out rows matching TST, DEV, DV1, TS1, etc for testing. Power users who query the Production DB directly MUST KNOW:

Ignore those testing rows
Don't rely on "unpublished" rows (Validated = FALSE)

Name		Name	Last commit message	Last commit date
Latest commit History 100 Commits
archive-2019		archive-2019
renv		renv
templates		templates
utils		utils
.Rprofile		.Rprofile
.gitignore		.gitignore
README.md		README.md
etl-on-farm.Rproj		etl-on-farm.Rproj
prefill_rows.R		prefill_rows.R
raw_to_shadow_hologram.R		raw_to_shadow_hologram.R
raw_to_shadow_kobo.R		raw_to_shadow_kobo.R
raw_to_shadow_stresscams.R		raw_to_shadow_stresscams.R
renv.lock		renv.lock
shadow_backup.R		shadow_backup.R
shadow_to_prod_hologram.R		shadow_to_prod_hologram.R
shadow_to_prod_kobo.R		shadow_to_prod_kobo.R
shadow_to_prod_stresscams.R		shadow_to_prod_stresscams.R

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hologram Puller/Parser

Major rewrite:

About

Releases

Packages

Languages

precision-sustainable-ag/etl-on-farm

Folders and files

Latest commit

History

Repository files navigation

Hologram Puller/Parser

Major rewrite:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages