Skip to content

soilwise-he/harvesters

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 

Repository files navigation

harvesters

  • Component: Harvested
  • Lead: Paul & Cenk

A component or approach to fetch metadata (and data?) from remote sources as documented at https://soilwise-he.github.io/SoilWise-documentation/technical_components/ingestion/

In this approach each harvester runs in a dedicated container. The result of the harvester is ingested into a (temporary) storage, where follow up processes pick up the results. Typically this process runs as a GITHub actions.

flowchart LR
    c[CI-CD] -->|task| q[/Queue\]
    r[Runner] --> q
    r -->|deploys| hc[Harvest container]
    hc -->|harvests| db[(temporary storage)]
    hc -->|data cleaning| db[(storage)]

This repository will contain the definition of a container used as a github action to harvest resources; as well as git action definitions to harvest actual sources.

This component is tightly related to the triple store component. Harvested records are stored on the triple store. Some interaction with the triple store to understand if existing records need to be kept, overwritten or even removed.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published