Skip to content

jic-dtool/dtoolcore

Repository files navigation

Manage scientific data sets

PyPi package

Travis CI build status (Linux)

AppVeyor CI build status (Windows)

Code Coverage

Documentation Status

Features

  • Core API for adding different types of metadata to files on disk
  • Automatic generation of structural metadata
  • Programmatic discovery and access of items in a dataset
  • Structural metadata includes hash, size and modification time for subsequent integrity checks
  • Ability to annotate individual files with arbitrary metadata
  • Metadata stored on disk as plain text files, i.e. disk datasets generated using this API can be accessed without special tools
  • Ability to create plugins for custom storage solutions
  • Plugins for iRODS and Microsoft Azure storage backends available
  • Cross-platform: Linux, Mac and Windows are all supported
  • Works with Python 2.7, 3.5 and 3.6
  • No external dependencies

Overview

The dtoolcore project provides a Python API for managing (scientific) data. It allows researchers to:

  • Package data and metadata into a dataset
  • Organise and backup datasets easily
  • Find datasets of interest
  • Verify the contents of datasets
  • Discover and work with data programatically

About

Core API to manage (scientific) data

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages