Disallow invalid pointers in arrays and tuples #226

pacrob · 2024-02-07T22:14:07Z

What was wrong?

Incorrect values for in pointers can cause problems. If a pointer value is not large enough, i.e. it points to an area in the payload that is still within the pointers section, the encoding is malformed. In certain situations, ~infinite loops can occur.

How was it fixed?

When decoding pointers, determine the location in the stream that divides pointers and values and make sure all pointers point past that location. Also check for pointers that point beyond the end of the payload.

Added some code comments to make it easier to remember how HeadTailDecoder works.

Added pytest-timeout to dependencies, as if the new tests are run without the added offset checking, they'll spin for a long time before failing.

Todo:

Clean up commit history
Clear any breakpoints
clean up testing
Add or update documentation related to these changes
Add entry to the release notes

Cute Animal Picture

descriptive comments

reedsa · 2024-02-28T17:17:43Z

eth_abi/decoding.py

@@ -131,6 +132,13 @@ def __call__(self, stream: ContextFramesBytesIO) -> Any:


 class HeadTailDecoder(BaseDecoder):
+    """


reedsa · 2024-02-28T17:41:09Z

eth_abi/decoding.py

    @to_tuple  # type: ignore[misc] # untyped decorator
    def decode(self, stream: ContextFramesBytesIO) -> Generator[Any, None, None]:
+        self.validate_pointers(stream)


I could use more context here. Could this be called in the loop below and maybe allow removal of the inner decoder loops inside validate_pointers? I'm also curious if the validation is necessary before decoding? Could validation just be part of the decode in HeadTailDecoder?

There is no way to know how long the head section of a dynamic tuple will be until you have stepped through each decoder - if the decoder is for a dynamic type, it will be 32 bytes every time (because it's a pointer), but if it's for a non-dynamic array, there will be a single decoder for multiple chunks of 32 bytes.

I think it would be possible to take the logic from validate_pointers and put it in decode to eliminate the second loop through the decoders (where it actually checks the pointer values against the end_of_offsets). I like the current clarity and separation of concerns, but I can try if you like.

The validation needs to be in the tuple and array decoders, because only they have the context for how long they are. A HeadTailDecoder only has the info for a single dynamic value.

I see now what the difference means, assuming there may never be more than a few decoders at a time I don't have any concerns.

reedsa · 2024-02-28T22:06:04Z

eth_abi/decoding.py

    @to_tuple  # type: ignore[misc] # untyped decorator
    def decode(self, stream: ContextFramesBytesIO) -> Generator[Any, None, None]:
+        self.validate_pointers(stream)


I see now what the difference means, assuming there may never be more than a few decoders at a time I don't have any concerns.

reedsa · 2024-02-28T22:13:17Z

eth_abi/decoding.py

+        end_of_offsets = current_location + 32 * len_of_head
+        total_stream_length = len(stream.getbuffer())
+        for decoder in self.decoders:
+            if isinstance(decoder, HeadTailDecoder):


Nit: It would be nice to share this logic across decoders, maybe this could become a utility function that could take the stream and an array_size, which could be called from here using array_size=1.

Nit heard and politely declined. There is enough required difference in how tuples and arrays are checked that any logic extraction have a lot of if tuple/elseif array. And I don't foresee any future datastructures being created that would make use of such shared base methods, thus accept code that is ~repeated twice.

kclowes

Looks good to me! Nice work tracking it down! 🐞 I like the comments you made in the decoder too. Very helpful.

pacrob force-pushed the disallow-recursive-pointers branch 2 times, most recently from 401a381 to 9f99b5d Compare February 8, 2024 22:22

remove warning of ABIv2 being unstable

bf9cb50

pacrob force-pushed the disallow-recursive-pointers branch 6 times, most recently from 915b739 to 24afe35 Compare February 16, 2024 19:38

pacrob changed the title ~~Disallow recursive pointers in nested dynamic arrays~~ Disallow malformed pointers in nested dynamic arrays Feb 16, 2024

pacrob changed the title ~~Disallow malformed pointers in nested dynamic arrays~~ Disallow malformed pointers in dynamic arrays Feb 16, 2024

pacrob force-pushed the disallow-recursive-pointers branch 9 times, most recently from 90ee2b7 to f7fcbd8 Compare February 20, 2024 23:21

pacrob changed the title ~~Disallow malformed pointers in dynamic arrays~~ Disallow invalid pointers in arrays and tuples Feb 21, 2024

pacrob force-pushed the disallow-recursive-pointers branch from 2eeaa82 to bf5cf4e Compare February 21, 2024 20:26

pacrob marked this pull request as ready for review February 21, 2024 21:10

pacrob requested review from kclowes, fselmo and reedsa February 21, 2024 21:10

pacrob force-pushed the disallow-recursive-pointers branch from 03f069c to d285844 Compare February 21, 2024 21:20

pacrob removed request for reedsa, fselmo and kclowes February 21, 2024 21:50

pacrob force-pushed the disallow-recursive-pointers branch 2 times, most recently from a0ad898 to 04939a6 Compare February 22, 2024 19:36

pacrob requested review from kclowes, fselmo and reedsa February 22, 2024 19:51

add validate_pointers for tuples and arrays, add tests and more

1681383

descriptive comments

pacrob force-pushed the disallow-recursive-pointers branch from 04939a6 to 1681383 Compare February 22, 2024 19:56

reedsa reviewed Feb 28, 2024

View reviewed changes

kclowes approved these changes Mar 1, 2024

View reviewed changes

pacrob merged commit 82c1ad3 into ethereum:main Mar 1, 2024
16 checks passed

pacrob deleted the disallow-recursive-pointers branch March 1, 2024 20:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Disallow invalid pointers in arrays and tuples #226

Disallow invalid pointers in arrays and tuples #226

pacrob commented Feb 7, 2024 •

edited

reedsa Feb 28, 2024

reedsa Feb 28, 2024

pacrob Feb 28, 2024

reedsa Feb 28, 2024

reedsa Feb 28, 2024

reedsa Feb 28, 2024

pacrob Mar 1, 2024

kclowes left a comment

		@@ -131,6 +132,13 @@ def __call__(self, stream: ContextFramesBytesIO) -> Any:


		class HeadTailDecoder(BaseDecoder):
		"""

Disallow invalid pointers in arrays and tuples #226

Disallow invalid pointers in arrays and tuples #226

Conversation

pacrob commented Feb 7, 2024 • edited

What was wrong?

How was it fixed?

Todo:

Cute Animal Picture

reedsa Feb 28, 2024

Choose a reason for hiding this comment

reedsa Feb 28, 2024

Choose a reason for hiding this comment

pacrob Feb 28, 2024

Choose a reason for hiding this comment

reedsa Feb 28, 2024

Choose a reason for hiding this comment

reedsa Feb 28, 2024

Choose a reason for hiding this comment

reedsa Feb 28, 2024

Choose a reason for hiding this comment

pacrob Mar 1, 2024

Choose a reason for hiding this comment

kclowes left a comment

Choose a reason for hiding this comment

pacrob commented Feb 7, 2024 •

edited