Skip to content

lyceum-quest/texts

Repository files navigation

Lyceum Texts

Data repository for all Lyceum reader texts. Part of the lyceum-quest GitHub organization.

What is this?

This repository contains processed Greek texts ready for use with the Lyceum reader toolchain. Each text includes structured data, interlinear translations, morphological analyses, and quality assurance reports.

Structure

Each text is stored in its own slug directory with the following contents:

text-slug/
├── manifest.json      # Metadata (title, author, language, edition)
├── state.json         # Pipeline state and progress tracking
├── raw/               # Original source files
├── clean/             # Cleaned and normalized Greek text
├── structured/        # JSON-structured text with citations
├── interlinear/       # Word-by-word analyses and translations
├── qa/                # Quality assurance reports
├── replay/            # LLM interaction logs
└── ...                # Additional subdirectories (sources/, versification/, etc.)

Database Files

The following database files are distributed as GitHub Release assets, not checked into git:

  • editions.db (320K) — Edition metadata and source tracking
  • texts.db (324K) — Core text content and structure
  • morph.db (380M) — Morphological analyses (Perseus/Morpheus)
  • lsj.db (29M) — Liddell-Scott-Jones dictionary

Download these from the latest release.

Current Texts

  • Aesop's Fables #150 (aesop-fable-150)
  • Homer, Iliad Proem (1.1–5) (homer-iliad-proem-1-1-5)
  • Homer, Odyssey Proem (1.1–5) (homer-odyssey-proem-1-1-5)
  • Marcus Aurelius, Meditations Book 1 (marcus-aurelius-meditations-book-1)

Generation

All content is generated by the orchestrator tool, which orchestrates:

  • Text ingestion and validation
  • Morphological analysis and lemmatization
  • Interlinear translation generation (LLM-assisted)
  • Quality assurance and consistency checks

Usage

Set the LYCEUM_TEXTS_DIR environment variable to point to this directory:

export LYCEUM_TEXTS_DIR=/path/to/texts

Lyceum tools will use this path to discover available texts and load their data.

Versioning

This repository uses Calendar Versioning (CalVer):

vYYYY.MM.DD

Each release includes:

  • The full text workspace directories
  • Database files as release assets
  • A changelog describing what texts were added or updated