Improve ImageDecoder trait #1869

fintelia · 2023-03-03T05:44:14Z

This sketches out possible changes we may want to make to the ImageDecoder trait in the next major release:

Makes the trait object safe
~~Adds a constructor (all non-deprecated implementations had one, but it wasn't part of the trait).~~
~~Combines ImageDecoder and ImageDecoderRect~~
Removes the into_reader, *_with_progress, and scanline_bytes methods. This is the aspect I'm least sure about, but few backends had meaningful implementations of these and it cuts down on the number of methods on the trait. Some decoders will still be able to support incremental decoding by doing multiple read_rect calls

The object safety change in particular makes our internal usage of the trait much nicer. Instead of having to work with a DecoderVisitor trait, we can rely on dynamic dispatch.

~~For now I've held on updating the various format decoders. That'll be a bunch of work so it makes sense to settle on what the API should be first.~~

Edit: scaled back some of these changes and updated all the format decoders.

sophie-h · 2023-05-05T01:29:43Z

While working on image-rs I got the idea that this trait could go into something like an image-decoder crate.

This would allow the format-specific crates to already implement ImageDecoder, which could then be used by image-rs. That could bring more of the format-specific code into one place and would give more overall flexibility with coupling things into image-rs and having a common abstraction trait already usable when a format crate is not yet used in image-rs.

fintelia · 2023-05-06T01:38:58Z

Yeah, it would be nice to restructure that way. The challenge is that we have to be very confident in the trait definitions we pick, because making any backwards incompatible changes to them would cause tons of ecosystem churn

HeroicKatora · 2023-05-06T09:57:32Z

The impact from a breaking change in another trait could be mitigated by some choices in usage. It's technically possible to depend on multiple versions of a single crate. What's hard is using the trait only in such ways that bumping to a new version is not a breaking change downstream.

For instance, consider an opaque wrapper that happens to implement From<Box<dyn image_core::ImageDecoder>>. Then adding a new version of the trait would be akin to having to add From<Box<dyn image_core_v2::ImageDecoder>> where image_core_v2 is just an alias for the new major version of the crate. And then we can be cautious with slowly deprecating the old versions. The main costs are compiling with multiple old versions that must be managed (and this pattern interacts badly with cargo vendor which may be used in some distributions? not quite sure).

johannesvollmer · 2023-09-22T06:31:32Z

I think a progress callback is an essential feature for such a mature library. It's fine to not have it for small prototypes, but any real ui app or game would want to display some kind of progress. Instead, we should work on adding it to the decoders and encoders. For hdr, for example, it would be trivial to implement.

I think it's more valuable to have it, even if support is rare, than to remove it and add it later again.

Here's something we could try: Code up a wrapper around std::io::Write and std::io::Read that tracks the number of bytes being written or read. Many codecs could use this to implement the progress callback quickly, albeit imprecise. I could try to sketch out that idea

fintelia · 2023-09-22T17:30:34Z

Retaining read_image_with_progress with tweaks to make it object safe wouldn't be too hard. And I do agree that in principle it could be useful (that's why we added it in the first place!). The two questions I have:

Is the interface right? I don't have a good sense of how useful decoding progress is without the ability to get partial image contents as they are ready. Or at a more basic level, is having a Progress struct worth it over just an f32 saying indicating the percent completed? And it makes me nervous that the many of the third-party format crates we're using or planning to use don't seem provide a way to get progress callbacks (including qoi, dav1d, and zune-jpeg).
Is there enough implementation interest? Adding APIs is a lot easier than implementing them for all the image formats. It really isn't great to have an interface that appears to work for everything but essentially amounts to a no-op for most of them

johannesvollmer · 2023-09-22T18:02:08Z

About your second point: There is a way to tack-on progress on existing decoders: Observe the bytes that they read. But its only meaningful if we can make two assumptions:

the decoders implementation reads the file bytes incrementally
the compressed file is roughly as large as the in-memory representation

if the assumptions don't hold, the progress can still be useful, but will be imprecise

johannesvollmer · 2023-09-22T18:17:58Z

i sketched the idea at #2014

about your first point: i think a progress struct is nice. a percentage float would probably be enough though, but it would only be worth it, if our goal is to reduce the complexity in the repo. if we accept the code complexity of the progress struct, we can just tack on a ratio_0_to_1() ->f32 on the Progress to make it nicer to use. another advantage of f32 progress is that existing third party implementations might only expose that information, and not expose the total byte count.

maybe struct Progress { ratio: f32, order_of_magnitude: Option<u64> }?

An advantage of current & total would be that if you have multiple images, the sum of all progresses could be computed more precisely.

johannesvollmer · 2023-09-25T08:05:53Z

into_reader, *_with_progress, and scanline_bytes

I think this is the most interesting part. Why don't most implementations provide that functionality? How can we take away the pain of implementing those features?

Looking at the implementation of the hdr decoder, which has a loop that goes through all the scan lines.

The implementation interest is something we cannot change. But we can make it easier for the implementations to add support for the features.

The think core problem might be this: Those functions make assumptions about the internal file format layout. They can be easily implemented if the file stores scanlines from top to bottom. In any other case, they is no easy way to implement that functionality, or it would be extemely inefficient to the point of uselessness (example: to_reader reads the whole file and then just returns all the bytes at once, defeating the point of providing a reader). Many formats don't even read scanlines, but tiles.

Maybe it would make sense to relax the requirements, by allowing the reader to read contents in arbitrary order, instead of forcing scanlines top to bottom. This might be a nice decider api to work with (but still cumbersome for implementations):

let decoder = open("x.jpg")?;
let reader = decoder.reader(); // or optionally .sub_rect_reader(rect)

let mut pixel_buffer = vec![0_u8, reader.total_byte_size()];

loop {
    let progress = reader.read_next(&mut pixel_buffer)?;
   // note: the decoder can write any number of pixels anywhere into the buffer, in whatever order they appear in the file.
   // this might be scanlines or tiles or even individual pixels, or even the whole image.
   // maybe the progress object also returns the location of the pixels that were changed?
   // the intermediate buffer and pixel coordinates can be used for an incremental image display

    if progress.done() { return Ok(pixel_buffer) }
}

This API would allow us to build a lot of convenience and abstraction on top of it.

trait Decoder {
    // ...
    fn read_next(&self, pixel_buffer: &mut [u8]) -> Result<Progress>;
}

// we know the total number of pixels, so knowing the indices of the pixels that changed is enough to compute a reliable progress ratio: progress.bytes_writtyen_to_pixel_buffer.len() as f32 / total_byte_size as f32;
enum Progress {
    ProcessedImageTile (TileIndices),
    ProcessedPixelBufferBytes (Range<usize>),
    // .. possibly more
    Done,
}

This design requires the decoders to keep some state, which might be cumbersome. Maybe we can abuse async/await syntax to write code that looks like a generator function?

This design also will not work with external decoders that currently have a closure/callback mechanism for the progress.

Unfortunately, this question will remain: What about the decoders that have external implementations and don't support any of this?

fintelia · 2023-09-27T01:04:46Z

Another thing that makes it annoying to convert decoders is that you want to reasonably throttle how often you call the progress callback. Too often and you'll destroy performance. Not often enough and the user won't get progress indications. Even just checking "should I invoke the callback" too often might interfere with SIMD and slow down our fastest encoders (which can hit multiple hundred million pixels per second!)

I also realized that the current function signature won't work when trying to make the trait object safe. No methods with generic arguments can be called on a trait object. At which point, perhaps we might as well make a separate ImageDecoderProgress trait

fintelia mentioned this pull request Mar 17, 2023

Add GenericDecoder implementing ImageDecoder #1895

Closed

sophie-h mentioned this pull request May 15, 2023

WebP Extended animation format needs to be rendered into frames… #1925

Closed

fintelia mentioned this pull request Jun 10, 2023

Version 0.25.x #1943

Closed

19 tasks

fintelia mentioned this pull request Aug 27, 2023

Deprecate incremental decoding #1990

Merged

fintelia changed the base branch from master to next-version-0.25 February 11, 2024 22:36

fintelia force-pushed the decoder-v2 branch from e764281 to c8ec8f0 Compare February 11, 2024 23:15

fintelia added 9 commits February 11, 2024 15:25

Sketch possible V2 for ImageDecoder

ee906a6

Remove unused code in TGA decoder

b9186de

Fix warnings

8b55c3b

Remove another unused method

9e17d1e

Fix tests

070905c

cargo fmt

ba67729

Clippy fixes

c87d1a5

more clippy fixes

2a18d72

Update avif module

a014adf

fintelia force-pushed the decoder-v2 branch from c8ec8f0 to a014adf Compare February 11, 2024 23:26

cargo fmt

f25d318

fintelia changed the title ~~Sketch possible V2 for ImageDecoder~~ Improve ImageDecoder trait Feb 11, 2024

fintelia marked this pull request as ready for review February 12, 2024 00:02

fintelia added 2 commits February 11, 2024 16:10

Revert changes to .github/workflows/rust.yml

09f1c38

But dxt feature is removed

e9c78b7

fintelia merged commit 79ecd0f into image-rs:next-version-0.25 Feb 12, 2024
34 checks passed

fintelia deleted the decoder-v2 branch February 12, 2024 00:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve ImageDecoder trait #1869

Improve ImageDecoder trait #1869

fintelia commented Mar 3, 2023 •

edited

sophie-h commented May 5, 2023

fintelia commented May 6, 2023

HeroicKatora commented May 6, 2023

johannesvollmer commented Sep 22, 2023

fintelia commented Sep 22, 2023 •

edited

johannesvollmer commented Sep 22, 2023 •

edited

johannesvollmer commented Sep 22, 2023 •

edited

johannesvollmer commented Sep 25, 2023 •

edited

fintelia commented Sep 27, 2023

Improve ImageDecoder trait #1869

Improve ImageDecoder trait #1869

Conversation

fintelia commented Mar 3, 2023 • edited

sophie-h commented May 5, 2023

fintelia commented May 6, 2023

HeroicKatora commented May 6, 2023

johannesvollmer commented Sep 22, 2023

fintelia commented Sep 22, 2023 • edited

johannesvollmer commented Sep 22, 2023 • edited

johannesvollmer commented Sep 22, 2023 • edited

johannesvollmer commented Sep 25, 2023 • edited

fintelia commented Sep 27, 2023

fintelia commented Mar 3, 2023 •

edited

fintelia commented Sep 22, 2023 •

edited

johannesvollmer commented Sep 22, 2023 •

edited

johannesvollmer commented Sep 22, 2023 •

edited

johannesvollmer commented Sep 25, 2023 •

edited