Skip to content

Conversation

@MicheleNuijten
Copy link
Owner

Replace pdf conversion via Xpdf with the functionality of the R package pdftools. Streamline the testing of file-to-text conversions with a manual reference spreadsheet.

* realign test with regex: accept the letter l as df 1.
* more stable programming; instead of hardcoding what to expect, retrieve manual result from .csv file with manually coded results
this includes some workarounds to deal with weird encoding and typesetting (e.g., columns)
… an error ("input string 1 is invalid UTF-8").
…html version (these are hidden behind a link)
@MicheleNuijten MicheleNuijten merged commit 0038858 into master Jul 26, 2024
@MicheleNuijten MicheleNuijten deleted the update-pdftext branch July 26, 2024 08:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants