Workflow for the Integration of Heritage Digital Resources
The aim of the workshop is to introduce the participants to the concept of linked data and to a selection of data curation tools that can be used for re-using and integrating data from across data silos, including a demonstration of how to semantically map them. The tutorial will make use of:
- OpenRefine together with Wikidata, Geonames, AAT and VIAF for analysing, cleaning, normalising and enriching the data
- 3M for mapping the data to CIDOC-CRM.
We are going to demonstrate the possible applications of these softwares, as well as the methodologies behind them, using diverse datasets from four museums:
- MoMa - original data available here
- MET - original data available here
- TATE - original data available here
- CMOA - original data available here
We specifically chose collection datasets containing information about works of art from Picasso.
In order to have a unique working environment we set up a vagrant virtual machine with OpenRefine, some utilities and the file we will be used during the workshop. Vagrant VM available clicking here. For the instruction on how to use the virtual machine please refer to the Wiki page here
We elaborated, for the second part of the workshop, some exercises useful to understand how CRM works. You can find them here 1 + 2 + 3. Please print them in advance. Moreover, In order for everyone to use 3M, it is necessary to register to the service. You can do it at this address: http://18.104.22.168/3M/SignUp?lang=en.
We will keep updating the repository and the Wiki adding the slides and information on the commands and methodologies used
[More to be added]