Skip to content
tingletech edited this page Dec 28, 2011 · 32 revisions

Project Objective

This project is to build a collection of DTD or w3c schema valid EAD 2002 XML specimens that are made available with an open source license for use in the testing of EAD systems.

The collection seeks to capture specimens that sample and represent a wide diversity of valid encoding practice. The purpose of the collection is to test that systems can handle a variety of mark up features, and this testing may include use of the files in a publicly accessible systems.

A Catalog of Practice

Method

Specimens of XML "in the wild" are being collected from encoded archival description produced by archival practice. Collected specimens will be systematically obscured by replacing nouns in text nodes with nonsensical words of similar length and inflection. Original specimens will not be part of the open source specimen collection made available by the project.

Collection License

CC0 Public Domain Dedication

Misc.

( the greeking script: https://github.com/tingletech/greeker.py )

original post to EAD listserv about project http://bit.ly/rPV1hJhttp://listserv.loc.gov/cgi-bin/wa?A2=ind1112&L=ead&T=0&P=1437

Clone this wiki locally