AugmentationSequential: accept sample dict as input #2119

adamjstewart · 2022-12-28T20:44:59Z

🚀 Feature

I would like AugmentationSequential to support a dictionary as input.

Motivation

I'm a TorchGeo developer. In TorchGeo, every dataset returns a sample dictionary like so:

sample = {
    "input": torch.tensor(...),
    "mask": torch.tensor(...),
    "bbox": torch.tensor(...),
    ...
}

(the exact key names don't match at the moment, but we're working on standardizing those)

Pitch

With the feature I'm envisioning, the following would work:

augs = AugmentationSequential(...)
sample = augs(sample)

The exact implementation details would still need to be worked out, but *args would go from Tensor to Union[Tensor, Dict[str, Tensor]]. The dictionary may contain keys that Kornia doesn't know how to support, and these should be ignored. If a sample dictionary contains a known key that the user doesn't want to transform, they can simply pass data_keys to override the default detection. If the input is a dict, the output should also be a dict. If implemented correctly, this feature will be backwards compatible with the old behavior so people can still pass these inputs in manually if they want to.

Alternatives

At the moment, to use Kornia augmentations, we have to use:

augs = AugmentationSequential(..., data_keys=["input", "mask", "bbox", ...])
sample["input"], sample["mask"], sample["bbox"], ... = augs(sample["input"], sample["mask"], sample["bbox"], ...)

As you can see, this is much more verbose than necessary. There's no reason we need to duplicate the list of keys so many times.

Additional context

If this is something you would be interested in, I would be happy to submit a PR to support this. Just wanted to gauge interest first before working on it.

@isaaccorley

The text was updated successfully, but these errors were encountered:

shijianjian · 2022-12-29T08:12:58Z

It looks acceptable if you have decent solution, like using override. If we accept dict, I would like to support something like:

augs = AugmentationSequential(..., data_keys=NONE)  # so that we do not require to input datakeys in the first place
sample = augs(sample...)

Another question is that is we have multiple masks/bboxes for the input, how should we organize the input dict?

adamjstewart · 2022-12-29T16:57:04Z

It looks acceptable if you have decent solution, like using override.

Do you mean "overload" instead of "override"? Python doesn't support overloading functions, but you can overload their type hints. It should be possible for the function to be backwards compatible and still support dicts.

Another question is that is we have multiple masks/bboxes for the input, how should we organize the input dict?

I would not bother supporting this for dict input. If a user needs multiple input/mask/bbox, they can simply use the old syntax.

shijianjian · 2023-01-01T06:52:12Z

We can proceed the dict support with overload as our first attempt. But preferably, I would like to see something like a dict with duplicated keys, or the same key with different values. XD

It looks like the latter is much doable, but I do not want to use list values (since a user can pass a list of tensors).

johnnv1 · 2023-01-02T12:13:59Z

looking at the fact that we possibly wouldn't have many duplicate values, I don't think it would be a performance problem to match any case with, for example: mask-1, mask-2, maskAnything, ... -- match any mask* case

shijianjian · 2023-01-02T13:24:00Z

The match any strategy looks reasonable to me. Then a user can mark their inputs as mask-taskA, mask-taskB, etc. We can be more flexible comparing to other libs.

I will try finish the first refactor #2117 this week. Then start implementing this feature on top of the new base. What do you think? @adamjstewart

edgarriba · 2023-01-06T12:19:40Z

i also envision something like this in the midterm

aug = AugmentationPipeline(...)
gen = DataGenerator(...)
net = MyModel(...)
loss = MyGeometricLoss(...)

gen.output_dict >> aug.input
(gen.image | gen.mask) >> aug.input  # or this to be more selective
aug.output >> net.image
(net.output | aug.tranform_matrix) >>  loss.input

this is kinda the strategy behind https://github.com/kornia/limbus
which is now under heavy refactor to fully support asyncio

@adamjstewart idk if something like that would be interesting for you guys
/cc @lferraz

adamjstewart · 2023-01-30T22:47:09Z

I tried to take a stab at implementing this but supporting data_keys = None and supporting dicts as input/output of inverse/forward is actually quite a large refactor. I can try to force it in there, but it might be better for the person who originally designed it to redesign it with dicts in mind. Do any core developers have any interest in this, or should I try to redesign it myself and minimize any unrequired API changes?

edgarriba · 2023-01-31T06:54:18Z

I tried to take a stab at implementing this but supporting data_keys = None and supporting dicts as input/output of inverse/forward is actually quite a large refactor. I can try to force it in there, but it might be better for the person who originally designed it to redesign it with dicts in mind. Do any core developers have any interest in this, or should I try to redesign it myself and minimize any unrequired API changes?

This person is @shijianjian , we just did a huge refactor to the augmentations module, so makes sense to keep improving to support more features

shijianjian · 2023-01-31T07:14:59Z

I think we may totally ignore the data_keys argument here, just to overwrite the runtime data_keys.

augs = AugmentationSequential(...)
sample = augs({"image-a": imagea, "image-b": imageb})

In the implementation, the forward signature changed to:

def forward(self, *input: Union[PREVIOUS_TYPE, Dict[str, Tensor]], data_keys: Optional=None) -> Union[PREVIOUS_TYPE, Dict[str, Tensor]]:
      if len(input) == 1 and isinstance(input, dict):
           if data_keys is not None: raise Error
           data_keys = read_datakeys_from_dict(input[0])

      input = self._preproc_dict(...)
      RUN_ITERATIONS_HERE
      output = self._postproc_dict(...)

     return output

This shall be straight-forward to implement.

adamjstewart · 2023-02-06T18:59:47Z

@shijianjian do you want to submit a PR for this or should I try to hack on it? I have something with 10x as many lines of code changed and it still doesn't work because I'm mapping dicts to lists but I'm not yet mapping them back. It's not just data_keys that needs to change, it's also transform_op (which uses data_keys) that will need to be set dynamically. I can open a draft PR if you want to see my current solution but it's pretty ugly.

shijianjian · 2023-02-07T06:15:49Z

@adamjstewart I may not be working on this since it is not a critical feature. The only thing you need to implement is to map the data keys. You do not need to handle the transform_op, it shall read the input data_keys and perform the augmentations automatically.

def forward(self, *input: Union[PREVIOUS_TYPE, Dict[str, Tensor]], data_keys: Optional=None) -> Union[PREVIOUS_TYPE, Dict[str, Tensor]]:
      if len(input) == 1 and isinstance(input, dict):
           if data_keys is not None: raise Error
           data_keys = read_datakeys_from_dict(input[0])

      input = self._preproc_dict(...)
      RUN_ITERATIONS_HERE
      output = self._postproc_dict(...)

     return output

As shown here, having the implementation of read_datakeys_from_dict, _preproc_dict, _postproc_dict shall be enough to make it work.

adamjstewart added the help wanted Extra attention is needed label Dec 28, 2022

adamjstewart mentioned this issue Jan 30, 2023

DataKey: add 'image' as alias of 'input' #2193

Merged

8 tasks

adamjstewart mentioned this issue Feb 2, 2023

RasterDataset support for nodata masks. microsoft/torchgeo#1078

Open

adamjstewart mentioned this issue Jun 22, 2023

USAVars Augmentation maps to 0 microsoft/torchgeo#1432

Open

johnnv1 mentioned this issue Feb 11, 2024

Aug: Add simple support to dict for AugSequential #2799

Merged

6 tasks

johnnv1 closed this as completed in #2799 Feb 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AugmentationSequential: accept sample dict as input #2119

AugmentationSequential: accept sample dict as input #2119

adamjstewart commented Dec 28, 2022 •

edited

shijianjian commented Dec 29, 2022

adamjstewart commented Dec 29, 2022 •

edited

shijianjian commented Jan 1, 2023

johnnv1 commented Jan 2, 2023

shijianjian commented Jan 2, 2023

edgarriba commented Jan 6, 2023

adamjstewart commented Jan 30, 2023

edgarriba commented Jan 31, 2023

shijianjian commented Jan 31, 2023

adamjstewart commented Feb 6, 2023

shijianjian commented Feb 7, 2023

AugmentationSequential: accept sample dict as input #2119

AugmentationSequential: accept sample dict as input #2119

Comments

adamjstewart commented Dec 28, 2022 • edited

🚀 Feature

Motivation

Pitch

Alternatives

Additional context

shijianjian commented Dec 29, 2022

adamjstewart commented Dec 29, 2022 • edited

shijianjian commented Jan 1, 2023

johnnv1 commented Jan 2, 2023

shijianjian commented Jan 2, 2023

edgarriba commented Jan 6, 2023

adamjstewart commented Jan 30, 2023

edgarriba commented Jan 31, 2023

shijianjian commented Jan 31, 2023

adamjstewart commented Feb 6, 2023

shijianjian commented Feb 7, 2023

adamjstewart commented Dec 28, 2022 •

edited

adamjstewart commented Dec 29, 2022 •

edited