Skip to content
Pietro Liuzzo edited this page Apr 27, 2016 · 14 revisions

Export Mycore data to TEI

Description

Catalogue records from Ethio-Sphere project like http://mycms-vs03.rrz.uni-hamburg.de/domlib/receive/domlib_document_00000015 can be transformed in TEI records to constitute the basic needs for a catalogue records encoding.

Notes

all data in DomLib repository.

Implementation

una tantum XSLT transformation. The XSLT can not only map but in many cases also split and structure information in the coherently entered data.

Needs to add publicationStmt date taking it from the myCore original last modified date (not creation date?)

Institutions should be exported to a separate institution file, containing place references. Coordinates in MyCore refer to the institution and not to the settlement.

Prerequisites

homogeneous data in mycore text fields

Progress

  • an XSLT has been prepared to take myCore and transform it in valid TEI-all XML catalogue records. this requires two steps because of html/richtext escaped in the XML view of the mycore data. The second step simply cleans from spurious elements the result documents.

  • text in fields is hard coded in html. this means that the parsing is harsh and cannot get to let's say the second part of a field in Amharic rather than in English.

  • in listBibl type mss the information is split but identifiers are given from the text, substituting spaces with underscore. this will need to be fixed by hand, while creating mini records for the mss to which these records point.

  • listBibl type edition and type text carry over editions and bibliographic information about contents, which should go to Work entities rather than in the manuscript. This are thus left behind.


  • for institutions @when to settlment/ab[@type='history'] in institutions from EthioSpare

ATTT!!! SETTLEMENT HAS WRONG CONTENT, should go into an element which describes the localization into a different system from the region and nation one

@type values should be mapped as described here


results for manuscripts

results are not valid in most cases for minor problems, which need hand fixing. a first stage of revision shall decide if there is more that can be done in the automated conversion or handfixing can start from these files.

back to list