Format image by lhparker1 · Pull Request #14 · PolymathicAI/AION

lhparker1 · 2025-05-22T21:04:00Z

Update AION code to have image preprocessing native in the codec

TODO: Clarify the difference between encode and quantize

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

fixing a bunch of things

EiffL · 2025-05-23T10:22:39Z

I've made a few changes, it's important to respect and not modify the base Codec class.

It's not yet passing the tests but almost

aion/codecs/tokenizers/base.py

LTMeyer

It's getting shapes! Thank you. I made a couple of comments.

aion/codecs/preprocessing/image.py

aion/codecs/preprocessing/band_to_index.py

aion/codecs/preprocessing/image.py

aion/codecs/tokenizers/base.py

LTMeyer · 2025-05-23T15:22:02Z

aion/codecs/tokenizers/base.py

-        return self.encode(x, channel_mask)
-
-
-class QuantizedCodec(Codec):


Why removing this class?
It made the distinction simpler between codec that rely on quantization and tokenizer that don't. It avoids the if statements in several methods. Do we actually have any tokenizer that is not using a quantizer?

I wanted to do minimal changes. I didn't want to change the codec APIs, but I guess that ship has sailed ^^'

can you just give me this one?

ok, I'll change ^^. But just to explain why I didn't want to change the API here:

My objective was not to make a clean code base here, for two reasons:

The more we refactor, the more work it is to refactor all tokenizers + the higher the chance of introducing difference in behavior

This code is going to stay 100% frozen, we are not going to reuse any of it moving forward. So, this is not the right place to do a lot of refactoring, that would be on the MMOMA side.

Making small edits on what relates to the directly relevant user-facing API (i.e. encode/decode) that makes sense from the perspective of simplifying the user experience, but the rest I would have preferred not to change anything.

Hummm actually, looking at the code, I'd rather not change it.

All the codecs we have for AION-1 are QuantizedCodec, no point in adding subclasses. I'll remove the 'is_quantized' statements if you want, but no need for more complexity

Co-authored-by: Lucas Meyer <LTMeyer@users.noreply.github.com>

LTMeyer and others added 30 commits April 7, 2025 12:04

Add base class for quantizers

83b5582

Add base class for tokenizers

23c75bf

Change arborescence for tokenizers

b250bf4

Add utils functions

dc75b8c

Add MagVitAE model

8389823

Add subsampler module

b51e963

Add FiniteScale quantizer

57926ab

Use quantizer encode instead of quantize

2733ce7

TODO: Clarify the difference between encode and quantize

Add MagVitAE image tokenizer

c190989

Add test for MagVitAE image tokenizer

cd1f705

Add tests to CI

71ea594

Fix github CI

2f531e0

Remove unnecessary package listing to fix tests

c22e5fb

Update aion/utils.py

7ec31f9

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Update aion/tokenizers/base.py

f4c3bff

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Add ruff cache to gitignore

806da4b

Move tokenizers to dedicated codecs module

0b5c6e4

add notebooks

993bfd5

minor tweak

359c635

Merge branch 'main' into format_image

9b0c2ba

add image padder

7cdb176

fixes

0150f74

fixes in notebook

112b75d

port fixes to script

27d740a

minor updates

841ba42

add crop and rescaler

d3ab8bf

update minor tweaks

f016837

cleaning up range compression

791ca32

minor tweaks

08e32fd

remove test notebook

d8b8921

lhparker1 requested a review from EiffL May 22, 2025 21:04

EiffL and others added 7 commits May 23, 2025 01:37

fixing a bunch of things

eb3b6fe

fixing formatting

77b92b3

fixing dict

75ae1c2

fix

e394079

fix issue

b9b60d1

fix

de5c075

Merge pull request #15 from PolymathicAI/format_image_eiffl

d3f501c

fixing a bunch of things

improve things

1abb1f0

LTMeyer reviewed May 23, 2025

View reviewed changes

aion/codecs/tokenizers/base.py Outdated Show resolved Hide resolved

EiffL added 4 commits May 23, 2025 16:23

Removing dictionaries in favor of structured data

5ba2c1c

fix documentation

ca4ab14

adding dependency

7c25f1b

adjusting tests

d4a555f

LTMeyer requested changes May 23, 2025

View reviewed changes

EiffL and others added 4 commits May 23, 2025 17:30

Update aion/codecs/preprocessing/image.py

95b4550

Co-authored-by: Lucas Meyer <LTMeyer@users.noreply.github.com>

renamed things

b146695

Update aion/codecs/preprocessing/image.py

1887e17

Co-authored-by: Lucas Meyer <LTMeyer@users.noreply.github.com>

remove non-quantization

89f5158

EiffL merged commit 0ce9b47 into main May 23, 2025
2 checks passed

EiffL deleted the format_image branch May 23, 2025 22:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Format image#14

Format image#14
EiffL merged 46 commits intomainfrom
format_image

lhparker1 commented May 22, 2025

Uh oh!

EiffL commented May 23, 2025

Uh oh!

Uh oh!

LTMeyer left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LTMeyer May 23, 2025

Uh oh!

EiffL May 23, 2025

Uh oh!

EiffL May 23, 2025

Uh oh!

EiffL May 23, 2025

Uh oh!

EiffL May 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		return self.encode(x, channel_mask)


		class QuantizedCodec(Codec):

Conversation

lhparker1 commented May 22, 2025

Uh oh!

EiffL commented May 23, 2025

Uh oh!

Uh oh!

LTMeyer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

LTMeyer May 23, 2025

Choose a reason for hiding this comment

Uh oh!

EiffL May 23, 2025

Choose a reason for hiding this comment

Uh oh!

EiffL May 23, 2025

Choose a reason for hiding this comment

Uh oh!

EiffL May 23, 2025

Choose a reason for hiding this comment

Uh oh!

EiffL May 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants