You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Images that already meet Bluesky's specifications are always recompressed, leading to images that degrade over time if re-uploaded. This means that each image is stored with different content hashes, provided by the CDN as a different file, etc. If matching images, a consumer needs to download and use a perceptual hash.
It would be useful to switch to an idempotent alternative, where images are processed in a way that preserves the original as much as possible.
This would enable:
Better caching
File hashes & cids could now be used to identify re-uploads; which is useful for labelling & identifying scraped content
Preservation of the highest quality images - including compression profiles for tools like photoshop that 'play nice' with the predictable settings to maximise perceived quality
Other cool things - search by image to identify a source, automatically proposed alt-text, etc.
Attachments
No response
Describe Alternatives
No response
Additional Context
I have used this in bash before with imagemagick's convert - and it does the job. I imagine similar settings can be find in whatever the conversion library is?
I'll need to set up a test harness to do this myself with a PR (so pretty please, someone else, etc. etc.), would have to work out the PNG stuff, and make sure the min/max resizing further up is 'non destructive' to repeat images.
Describe the Feature
Images that already meet Bluesky's specifications are always recompressed, leading to images that degrade over time if re-uploaded. This means that each image is stored with different content hashes, provided by the CDN as a different file, etc. If matching images, a consumer needs to download and use a perceptual hash.
It would be useful to switch to an idempotent alternative, where images are processed in a way that preserves the original as much as possible.
This would enable:
Attachments
No response
Describe Alternatives
No response
Additional Context
I have used this in bash before with imagemagick's convert - and it does the job. I imagine similar settings can be find in whatever the conversion library is?
The text was updated successfully, but these errors were encountered: