Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add content-type: check to externally retrieved images #200

Open
joe-lipson opened this issue Apr 22, 2024 · 0 comments
Open

Add content-type: check to externally retrieved images #200

joe-lipson opened this issue Apr 22, 2024 · 0 comments
Assignees

Comments

@joe-lipson
Copy link

joe-lipson commented Apr 22, 2024

Rose has reported that with there are a significant number of images in a large (200k records) upcoming Australian Museum images ingest where the image URL they provide is invalid. We can let AM know about these records and it's up to them to fix it but what seems to be happening is when the image service goes to fetch the image the AM servers come back with a 200 OK but the page is text. The image server then seems to be saving this text as a file and presenting it as an image with shows as broken in the UI.

An example of a good image coming from AM is
http://203.22.224.10/collection/imu/request.php?request=Multimedia&method=fetch&key=1112947&filter=width:bf:400

A bad one, this is still a 200 OK
http://203.22.224.10/collection/imu/request.php?request=Multimedia&method=fetch&key=1112347&filter=width:bf:400

And this is how a bad image looks after ingest
https://images-test.ala.org.au/image/384ed76f-6961-450b-97b6-e23250807c08

https://images-test.ala.org.au/?q=&fq=dataResourceUid%3Adr340&offset=150&max=50&sort=dateUploaded&order=desc

Would it be possible for the image-service to check the content-type: header for external images and reject bad ones, seem like this would be a good thing to have for all image ingests

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants