Skip to content
/ OWD Public
forked from nationalarchives/OWD

Operation War Diary raw data processing

Notifications You must be signed in to change notification settings

bogden1/OWD

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

40 Commits
 
 
 
 
 
 
 
 

Repository files navigation

OWD

Operation War Diary raw data processing

Summary

The code will eventually include perlpod documentation and comments so I'll summarise the raw dataset here.

The data contributed by Operation War Diary volunteers is provided to us by Zooniverse as a MongoDB database consisting of four collections:

  • war_diary_groups (where each document is a National Archives Catalogue Item - a war diary, or part of a diary where the diary was too large to provide as a single download. Each group has a zooniverse_id field which serves as the foreign key to the pages/subjects collection)
  • war_diary_subjects (where each document is a page from a diary. Each subject also has a zooniverse_id field to act as a foreign key for the classifications collection. The groups.zooniverse_id field references the group (diary) that the page belongs to)
  • war_diary_classifications (where each document represents the collection of classifications made by a single user. Each completed page therefore has between 5 and 7 classifications)
  • war_diary_users (where each document is a volunteer contributor)

About

Operation War Diary raw data processing

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Perl 100.0%