GitHub - LibraryofVA/plugin-Scripto: Adds the ability to transcribe items using the Scripto library.

plugin-Scripto

Scripto is an open-source tool developed by Center for History and New Media that allows for the viewing and collaborative transcription of digital files. Built on MediaWiki, Scripto is intended to be used as an extension of a content management system. Scripto is currently available as a plugin for Omeka, Wordpress, and Drupal. Making History|Transcribe uses Scripto with Omeka based on a technology stack originally implemented by DIYHistory|transcribe Visit http://diyhistory.lib.uiowa.edu/code.html for an overview of their original implementation information.

The UI Libraries fork of plugin-Scripto adds the following features:

Track completion status of document pages (i.e., ‘Not Started’, ‘Needs Review’, ‘Completed’)
Track completion progress of documents based on page statuses.
Sort documents within their collection by most completed, floating least completed to the top.
Initialize document page text entry box with pre-existing text, if available (helpful if using Scripto to correct OCR for typescript pages).
On every page action, automatically import transcriptions from MediaWiki as file metadata.

The UI Libraries also created an Omeka theme, Scribe, to make use of these new features. The Library altered that theme for use in our project and is available at Scribe.

To accommodate the above addded features, UI Libraries added three elements to the Scripto metadata element set: “Status”, “Percent Needs Review”, and “Percent Completed”. These values are calculated and updated with every “Save”, “Approve”, or “Unapprove” action made on a document page. Additionally, with every action, a 6-digit sorting number is calculated and stored in dc:Audience.

“Status” stores the completion status of a document page (i.e., “Not Started”, “Needs Review”, “Completed”) in the page-level metadata. Scribe displays this value on each page thumbnail in the items/show page. Values for “Percent Needs Review” and “Percent Completed” are stored at the document (item) level and represent the percentage of pages within the document carrying the “Needs Review” and “Completed” statuses, respectively. These percentages are used by Scribe to display the stacked progress bars for each item in the collections/show page. The sorting number stored in dc:Audience is used by Scribe to sort items on collections/show page by completion progress, sinking the most completed items to the bottom and floating the least completed to the top.

The new fields for the Scripto element set will automatically be installed with a fresh install of the UI Libraries fork of plugin-Scripto. If you are currently running the Scripto plugin and do not wish to lose your existing Scripto data, you can instead manually add the new metadata elements (found in plugin-Scripto/ScriptoPlugin.php) to the elements table in Omeka, and replace your plugin code with the UI Libraries version.

Metadata

Metadata and content files are uploaded to Omeka using the UI libraries fork of plugin-CsvImport. CSV Import is an Omeka plugin for batch uploading metadata and content file to Omeka. The UI Libraries fork of this plugin allows for the batch uploading of page-level (file) descriptive metadata. To upload using the UI Libraries’ version, first upload the csv file of items as instructed in the CsvImport documentation (choosing ‘Record Type: Item’ in the Csv Import interface). After uploading and creating the items, upload the csv file of page-level metadata (choosing ‘Record Type: File’ in the CsvImport interface). Map the Original filename to ‘Filename?’ in Step 2 of the CSV Import interface. See _sample-data Item.csv and _sample-data File.csv for an example. If you are pulling content from your version 6 CONTENTdm installation, you may find this sample gist helpful in generating csv upload files.

Document-level (item) metadata

Element	Omeka element	Comments
Title	dc:Title	The title of the document
Source URL	dc:Source	(optional) The URL for the original location or master record for the document (If you are replicating the document from another digital environment).
Source identifier	dc:Identifier	(optional) An identifier for the document that ties it to the original location or master record (If you are replicating the document from another digital environment). Note that Omeka will generate a system identifier for each document (item).
Digital collection URL	dc:Is Part Of	(optional) The URL for the digital collection that the document belongs to in its original digital environment (If you are replicating the document from another digital environment).
Finding aid URL	dc:Relation	(optional) The URL for the finding aid of the document’s source collection.
Sorting number	dc:Audience	6-digit number for sorting the item within its collection in the collections/show display. Set to ‘000000’ as default. This will get updated every time a document page from the document is saved, approved, or unapproved.
Percent needs review	scripto:Percent Needs Review	Percentage of pages with status ‘Needs Review’. No default needed. This will get updated every time a document page from the document is saved, approved, or unapproved.
Percent completed	scripto:Percent Completed	Percentage of pages with status ‘Completed’. This will get updated every time a document page from the document is saved, approved, or unapproved.

Page-level (file) metadata

Element	Omeka element	Comments
Original filename		The file location specified in the csv item upload file. The UI Libraries fork of plugin-CsvImport uses this filename to find the Omeka file record for applying the page-level metadata.
Page label	dc:Title	The label for the page
Page-level source URL	dc:Source	(optional) The URL for the original location or master record for the document page (If you are replicating the document from another digital environment).
Source identifier	dc:identifier	(optional) An identifier for the document page that ties it to the original location or master record (If you are replicating the document from another digital environment). Note that Omeka will generate a system identifier for each document page (file).
Transcription	scripto:Transcription	The transcription for the document page. This will get updated every time a document page is saved or approved. You may pre-populate this field on ingest to Omeka with OCR or existing transcription, if desired.
Status	scripto:Status	Completion status of the document page (‘Not Started’, ‘Needs Review’, ‘Completed’). Set to ‘Not Started’ as default. This will get updated every time a document page is saved, approved, or unapproved.
Omeka file order	no map	The order of the page within the document. When used with the ui-libraries/plugin-Csv-Import fork, this value will assure files are in the correct sequence.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
controllers		controllers
libraries		libraries
models		models
views/shared		views/shared
README.md		README.md
ScriptoPlugin.php		ScriptoPlugin.php
config_form.php		config_form.php
desktop.ini		desktop.ini
plugin.ini		plugin.ini
plugin.php		plugin.php
routes.ini		routes.ini
sample-data_File.csv		sample-data_File.csv
sample-data_Item.csv		sample-data_Item.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

plugin-Scripto

Metadata

About

Releases

Packages

Languages

LibraryofVA/plugin-Scripto

Folders and files

Latest commit

History

Repository files navigation

plugin-Scripto

Metadata

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages