Nodejs tools for extracting data from DSpace repositories into DataCrate format.
This repository is for tools similar to the DataCrate Omeka tools for the DSpace repository, but written in Node. Development will start with a DSpace collection to DataCrate script.
NOTE: This is very early in development, it has been written around a single collection, but if and when we use it more we'll add more features.
See an example of the output from the this script run on this collection.
This script is suitable for use by Nodejs developers only, it is not production ready and will need to be adapted for use on new data sources. Contributions welcome!
-
Install node.js. This will also install the nodejs part of CalcyteJS a command line tool for manipulating DataCrates.
-
Get the code:
git clone https://code.research.uts.edu.au/eresearch/datacrate-dspace-tools.git
-
Link the binary for development. Type: npm link
./download_dspace --help
Usage: download_dspace [options] <endpoint>
Generates DataCrate HTML from a DSpace instance.
Options:
-V, --version output the version number
-c, --collection [collection-id] Dowlnoad collection by ID
-d, --depth Maximum depth to recurse into directories
-m, --metadata [metadata-file] Metadata for the DataCrate
-o, --output_dir [] Output directory
-a, --all Download all bitstreams - defaults to only downloading things in the ORIGINAL bundle
-h, --help output usage information
To produce the output in the example, type:
mkdir ~/working/prs_mani # Create a dir
./download_dspace -c 316 -m examples/prs_mani/prs_mani_CATALOG.json -o ~/working/prs_mani https://opus.lib.uts.edu.au/rest
calcyte -o ~/working/prs_mani