Skip to content

vpeil/SWIB2014-Catmandu

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Catmandu - Importing, transforming, storing and indexing data should be easy

SWIB2014 1 - 3 December 2014 Bonn, Germany

Johann Rolschewski / Jakob Voß

Staatsbibliothek zu Berlin, Germany / Verbundzentrale des GBV (VZG), Germany

Abstract

Catmandu provides a suite of software modules to ease the import, storage, retrieval, export and transformation of metadata records. Combine Catmandu modules with web application frameworks such as PSGI/Plack, document stores such as MongoDB and full text indexes such as Elasticsearch to create a rapid development environment for digital library services. After a short introduction to Catmandu and its features, we will present the domain specific language (DSL) and command line interface (CLI). Participants will be guided to transform (their) data records to a common metadata model, to store/index it in Elasticsearch or MongoDB and to export it as Linked Data.

Prior Experience

We will be using a simplified DSL language. Participants should be familiar with command line interfaces (CLI). Any programming experience is welcome but not required.

Requirements

Laptop with VirtualBox installed. Organisers will provide a VirtualBox image (Linux guest system) beforehand. You can also install your own environment, see "Installation". Participants should bring their own data (CSV, JSON, MAB2, MARC, PICA+, RDF or YAML).

VM

See Catmandu_VM.md.

Agenda

  • Introduction to LibreCat and Catmandu
    • Project LibreCat
    • Software Catmandu
  • The command line interface (CLI)
    • Introduction
    • Formats: JSON, MAB2, MARC, PICA, YAML, XLS
    • Stores: MongoDB & Elasticsearch
    • APIs: getJSON, OAI, SRU
    • Configuration
  • The domain specific language
    • Introduction
    • Fixes
    • Dictionaries
  • RDF
    • Introduction
    • Import RDF data
    • Export RDF data
  • Overview of Catmandu architecture
    • Catmandu::Importer
    • Catmandu::Fix
    • Catmandu::Cmd

Links

librecat.org

LibreCat:github

Catmandu:wiki

Catmandu:cpan

MAB2

MARC

KIM:RDF

PICA

MongoDB

Elasticsearch

About

Catmandu Workshop at SWIB2014

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • HTML 86.8%
  • Perl 13.2%