Skip to content
This repository has been archived by the owner on Oct 21, 2020. It is now read-only.

Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRefine and Solr. Part of the FID Romanistik software stack

License

Notifications You must be signed in to change notification settings

subhh/FID-Romanistik-metadataTransformations

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

FID-Romanistik-MetadataTransformations

Codacy Badge

Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRefine and Solr. Part of the FID Romanistik software stack.

See upstream git repo HOS-MetadataTransformations for use case, features and reuse.

Installation

tested with Ubuntu 16.04 LTS and Ubuntu 18.04 LTS

install git:

sudo apt install git

clone this git repository:

git clone https://github.com/subhh/FID-Romanistik-MetadataTransformations.git
cd FID-Romanistik-MetadataTransformations

install openjdk-8-jre-headless, curl, jq, metha 1.29, OpenRefine 2.8, openrefine-client 0.3.4 and Solr 7.3.1:

sudo ./install.sh

Configure Solr schema:

./init-solr-schema.sh

Usage

Data will be available after first run at:

Run workflow with data source "dialnet-tesis" and load data into local Solr (-s) and local OpenRefine service (-d)

bin/dialnet-tesis.sh -s http://localhost:8983/solr/fid -d http://localhost:3333

Run workflow with all data sources in parallel and load data into local Solr (-s) and local OpenRefine service (-d):

./run.sh -s http://localhost:8983/solr/fid -d http://localhost:3333

Run workflow with all data sources and load data into external Solr core

./run.sh -s "http://..."

About

Automated workflow for harvesting, transforming and indexing of metadata using metha, OpenRefine and Solr. Part of the FID Romanistik software stack

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages