Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Empty output files created when input path contains spaces #23

Open
andrewphilipsmith opened this issue Jan 26, 2022 · 0 comments
Open
Labels
bug Something isn't working
Milestone

Comments

@andrewphilipsmith
Copy link
Contributor

How to reproduce

Try these command line options:

./src/alto2txt/extract_publications_text.py -p single "~/example path with spaces/input data/0002647" "~/example path with spaces/output files"

Actual output:

  • The output directory structure is created correctly
  • The files <source>_<year><date>_art<xxxx>_metadata.xml are create and populated (presumably correctly)
  • The files <source>_<year><date>_art<xxxx>.txt are created, but contain only a single newline char and no other text

Expected output:

The output *.txt files should contain the relevant article text.

@andrewphilipsmith andrewphilipsmith added the bug Something isn't working label Jan 26, 2022
@andrewphilipsmith andrewphilipsmith added this to the v0.4 milestone Jun 30, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant