Skip to content

C03: Add Rachel Bluwstein — 4 PDM-1.0 pages, highest-res modern Hebrew cursive #6

@shaypal5

Description

@shaypal5

Summary

Add Rachel Bluwstein (Rachel the Poetess, 1890–1931) as the second writer in the corpus. Four PDM-1.0 pages are available in the upstream repo, including one explicitly identified as the highest-resolution Bluwstein scan on Wikimedia Commons. Her handwriting is clear, flowing secular Hebrew cursive — a stylistically distinct contrast to Bialik's literary hand.

Writer record

Field Value
writer_id rachel_bluwstein
display_name Rachel Bluwstein
also_known_as Rachel the Poetess, רחל המשוררת, Rachel Bluwstein-Sela, Рахель Блувштейн
born / died 1890 / 1931 — PDM in all life+70 jurisdictions
scripts_written Hebr
languages_written he, ru
period 1910–1931
VIAF https://viaf.org/viaf/5722988/
Wikipedia https://en.wikipedia.org/wiki/Rachel_Bluwstein

Upstream sources

entry_id license Notes
commons__rachel_gan_naul__p0001 PDM-1.0 Highest-resolution Bluwstein scan on Commons — start here
commons__rachel_rak_al_atzmi__p0001 PDM-1.0 Poem Only About Myself
commons__rachel_aqara_1928__p0001 PDM-1.0 Poem Aqara (Barren Woman), dated 1928
commons__begani_netatikha__p0001 PDM-1.0 Poem In My Garden

All four: rights_basis: public_domain, attribution_required: false.

Acceptance criteria

  • New rachel_bluwstein writer row in writers.jsonl with status: verified
  • At least 15 distinct letter forms added (target: full 27-form coverage across the four pages)
  • Additional variants of the same letter (from different pages) ingested as v0002, v0003, etc.
  • All entries pass python3 scripts/validate_indexes.py --upstream-path <upstream>
  • python3 scripts/generate_release_artifacts.py --check passes
  • python3 -m pytest passes

Ingest notes

  • Open commons__rachel_gan_naul__p0001 first — highest resolution = best per-letter pixel density
  • entry_id format: rachel_bluwstein__<letter_name>__v0001
  • File path: data/letters/rachel_bluwstein/<letter_name>/rachel_bluwstein__<letter_name>__v0001.<ext>
  • If the same letter appears across multiple pages, add further variants rather than skipping — variant diversity is valuable for HTR training
  • letter.style should reflect her secular Ashkenazi cursive hand

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions