Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Make IIIF Print split remote PDF URLs in Bulkrax #97

Closed
laritakr opened this issue Jan 27, 2023 · 3 comments
Closed

Make IIIF Print split remote PDF URLs in Bulkrax #97

laritakr opened this issue Jan 27, 2023 · 3 comments
Assignees

Comments

@laritakr
Copy link
Contributor

laritakr commented Jan 27, 2023

IIIF Print bulkrax imports do not work with remote PDF files. They create the parent work and fileset only.

The following file should be imported into a parent GenericWork AND ALSO create a child Image TIFF work.

i97-sample-remote-fileset.csv

Note: a CSV with the remote url in the GenericWork row will NOT work. This would require changes to bulkrax.

@laritakr laritakr changed the title Make IIIF Print split remote PDF URLs Make IIIF Print split remote PDF URLs in Bulkrax Jan 27, 2023
@ShanaLMoore
Copy link
Contributor

ShanaLMoore commented Feb 6, 2023

This scenario is also not hitting the actor.

Remote url may not always include or end w pdf. but even if you have a url that ends a pdf, it doesn't hit the actor.

how do we know it's a pdf to know when to split it?

Prob need to update the splitting mechanism from the job. io processing to identify it's a pdf.

also will need to consider amending bulkrax.

@laritakr
Copy link
Contributor Author

Test via hyku branch i98-iiif-print-fileset-row at samvera/hyku#1912

@laritakr
Copy link
Contributor Author

@jeremyf jeremyf closed this as completed Feb 20, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants