Crowdsourcing platform for full text transcription and tagging
Switch branches/tags
Clone or download
rstorey Merge pull request #645 from LibraryOfCongress/in-progress-bugfix
Look up status key by value in asset detail view
Latest commit 1480e49 Nov 15, 2018
Permalink
Failed to load latest commit information.
.github/ISSUE_TEMPLATE Add questions to the template for clarity Sep 24, 2018
cloudformation Merge pull request #604 from LibraryOfCongress/export-to-s3 Nov 15, 2018
concordia look up status key by value in asset detail view Nov 15, 2018
docs Update developer documentation Nov 13, 2018
exporter Merge pull request #604 from LibraryOfCongress/export-to-s3 Nov 15, 2018
importer Fix docker container build for pylibmc dependency Nov 7, 2018
jenkins_scripts Adding dependencies required for pylibmc to jenkins server Nov 8, 2018
postgresql add required semicolon to drop database statement Oct 31, 2018
static-pages Update latest.md Nov 15, 2018
static Make /favicon.ico work Oct 26, 2018
.coveragerc Enable coverage.py Oct 5, 2018
.editorconfig Add config for linters Jun 15, 2018
.eslintrc.yaml Add config for linters Jun 15, 2018
.gitignore Fix staticfiles configuration Oct 26, 2018
.pre-commit-config.yaml Add flake8 back Nov 13, 2018
.prettierrc Add config for linters Jun 15, 2018
.stylelintrc.yaml Fix stylelint indentation rules Nov 2, 2018
.travis.yml Add coverage run command Nov 6, 2018
Dockerfile Fix docker container build for pylibmc dependency Nov 7, 2018
LICENSE.md Closes #458, make license file detectable by GH community profile Nov 1, 2018
MANIFEST.in Add setuptools_scm to make application version available in Django, R… Oct 4, 2018
Makefile Remove deployment configuration for separate services Nov 7, 2018
Pipfile Add pylint to dev packages, modify exporter test case setup Nov 7, 2018
Pipfile.lock Add pylint to dev packages, modify exporter test case setup Nov 7, 2018
README.md Add badge to README Nov 2, 2018
build_containers.sh Add flexibility for docker container tag names in build script Nov 15, 2018
docker-compose.yml Remove obsolete AWS_ACCESS_KEY settings Oct 10, 2018
entrypoint.sh Call `raven test` on startup Nov 9, 2018
manage.py Did flake8, black, isort Jul 17, 2018
package.json Add toolchain for static image compression Nov 13, 2018
setup.cfg Replace FAQ app with static page Sep 25, 2018
setup.py Add setuptools_scm to make application version available in Django, R… Oct 4, 2018
tasks.py Did flake8, black, isort Jul 17, 2018

README.md

Build Status Coverage Status

Welcome to Concordia

Concordia is a platform developed by the Library of Congress (LOC) for crowdsourcing transcription and tagging of text in digitized images. The first iteration of Concordia was launched as crowd.loc.gov in the autumn of 2018.

The application asks volunteers to transcribe and tag digitized images of manuscripts and typed materials from the Library’s collections that cannot be translated well by optical character recognition (OCR). All transcriptions are made by volunteers and reviewed by volunteers. The completed transcriptions will be returned to back to loc.gov to improve search, readability, and access to handwritten and typed documents.

Concordia leverages the LOC’s API to pull materials from the Library's catalog. In future developments, completed transcriptions will be exported as a single document, in bulk by item, project or campaign, or as BagIt bags.

Concordia and crowd.loc.gov are supported by the National Digital Library Trust Fund.

Want to help?

We are so excited that you want to jump right in. To get started:

  1. Check out our CONTRIBUTING page and see the different ways you can help out.
  2. Next, take a look at How we work, there you'll learn more about how we use GitHub and what we are looking for if you are contributing code.
  3. To learn how to set up the Concordia on your computer, check out the For Developers page.