Skip to content

Book Solution Pack Roadmap May 2, 2012

kstapelfeldt edited this page May 16, 2012 · 34 revisions

Updates

Paul's work

  • Paul has been working to keep things close together. Book solution pack is Alan's from his github, book viewer is a combination of everybody's work. Paul's been trying to add things - search, configuration for DSID, relationship to book object, and other stuff. Solr module is a dependency. Jason's changes have been merged. The only real difference between Jason's and ours is that we still do book metadata, not page metadata, but Paul thinks that he can make things work together.

  • Global namespace suggestion of Jonathan's has been used. Page relationship has replaced .

  • Upgrade path for all people using book solution pack - Jason has a script, and Alan has a module. Alan's module can be modified to provide an upgrade path. Script so that we don't have time out issues (not GUI based).

Resolution

  • Need to get this done for testing and documentation by end May
  • Paul is lead on basic solution pack
  • All people who want to use the book solution pack from this point forward must upgrade to provide page numbers in relsext. Book Module will pack with an ingest script.
  • Make iiv page aware.
  • One book solution pack with two bookCModels and shared PageCModel - CModel chosen on ingest and defines the viewer that the book will show in (either IAViewer or iiv). Names will be IABookCmodel and BookCModel.

Action Items

  • Links instead of tabs? - put out to google dev list (Alan)
  • Solr dependency for book solution pack (Committers Meeting - David and Alan)
  • Create the ability to change weights and labels for tabs in the interface in a configuration panel in book solution pack (Paul)
  • Make iiv page aware
  • Bring together script work for upgrade path
  • Crosswalk from AbbyyXML to HOCR for storage in the ENCODED_OCR datastream.

Digital Humanities Module (Advanced Book Solution Pack)

  • Regular tab labels and just define a new tab called critical editing.

  • Relevant datastreams and functions

  • backward compatibility

  • Alan's work relies on the current release of the solution pack. Digital Humanities Module is currently two modules (cwrc and shared canvas) will be amalgamated.

Datastreams

PageCModel

Default datastreams

RELS-EXT
POLICY
DC

Descriptive metadata

MODS

Archival binary

ARCHIVAL_BINARY (Type is TIFF)

Technical metadata derived from the archival binary

TECHMD

Image-based derivatives

TN
PAGE (type is PDF)
LOSSY_DERIVATIVE (type is JP2)

Text-based derivatives

OCR (TEXT)
ENCODED_OCR (HTML)

BookCModel

Generate PDF on the fly Store MODS, DC, RELSEXT.

IABookCModel

Different RELSEXT.

Other

⚠️ This wiki is an archive for past meeting notes. For current minutes as well as onboarding materials, click here.

Clone this wiki locally