improve pylaia dataset parser by milanalimova · Pull Request #12 · achimrabus/polyscriptor

milanalimova · 2026-02-10T11:29:36Z

example usage:
python convert_to_pylaia_new.py --input_train_csv output_from_transkribus_parser\train.csv --input_val_csv output_from_transkribus_parser\val.csv --output_dir output_dir\ --train_img_root output_from_transkribus_parser\ --val_img_root output_from_transkribus_parse\ --height 96 --process_images_from train

…eklia.com/pylaia/usage/datasets/format/

Parser now accepts both "image.png text" (space) and "image.png,text" (CSV) formats, auto-detected per line. Fixes compatibility with convert_to_pylaia.py variants and milanalimova's PR #12 without breaking existing datasets. Also updates module docstring to reference Puigcerver (2017) architecture and clarify that the PyLaia package is not required.

milanalimova added 4 commits February 10, 2026 11:28

Possible fix for dataset convertation for pylaia

1a8b6fa

Possible fix for dataset convertation for pylaia

909fee0

Possible fix for dataset convertation for pylaia

2a48df5

Adjusted the old format for pylaia dataset according to https://doc.t…

24b6426

…eklia.com/pylaia/usage/datasets/format/

achimrabus mentioned this pull request Feb 20, 2026

convert_to_pylaia.py with ids instead images not expected by train_pylaia.py #11

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

improve pylaia dataset parser#12

improve pylaia dataset parser#12
milanalimova wants to merge 4 commits intoachimrabus:mainfrom
milanalimova:fix-pylaia-dataset-parser

milanalimova commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

milanalimova commented Feb 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant