DataHarvester

Takes downloaded source files and loads their data into the mdr's source databases.

The program uses the XML files already downloaded for a data source, located in a source folder (one is designated for each source). The XML files, or a subset as controlled by the parameters - see below - are converted into data in the 'sd' schema (= session data) tables within each source database. Note that on each run the sd tables are dropped and created anew, and thus only ever contain the data from the most recent harvest. The tables present will vary in different databases, though if a table is present it will have a consistent structure in every database. The conversion to sd data therefore represents the second and final stage of the conversion of the source data into the consistent ECRIN schema. For that reason the detailed code for different sources can vary widely.

The program represents the second stage in the 4 stage MDR extraction process:
Download => Harvest => Import => Aggregation

For a much more detailed explanation of the extraction process,and the MDR system as a whole, please see the project wiki (landing page at https://ecrin-mdr.online/index.php/Project_Overview).
In particular, for the harvesting process, please see
http://ecrin-mdr.online/index.php/Harvesting_Data, and
https://ecrin-mdr.online/index.php/Contextual_Data
and linked pages

Provenance

Author: Steve Canham
Organisation: ECRIN (https://ecrin.org)
System: Clinical Research Metadata Repository (MDR)
Project: EOSC Life
Funding: EU H2020 programme, grant 824087

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
DataHelpers		DataHelpers
GeneralHelpers		GeneralHelpers
MonitoringHelpers		MonitoringHelpers
Properties		Properties
SourceSpecific		SourceSpecific
TableBuilders		TableBuilders
TestHelpers		TestHelpers
TopLevelClasses		TopLevelClasses
.gitattributes		.gitattributes
.gitignore		.gitignore
DataHarvester.csproj		DataHarvester.csproj
DataHarvester.sln		DataHarvester.sln
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

DataHarvester

Provenance

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

ecrin-github/DataHarvester

Folders and files

Latest commit

History

Repository files navigation

DataHarvester

Provenance

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages