Fix LZO decompression #28

Schamper · 2023-06-20T13:41:44Z

codecov · 2023-06-20T13:43:07Z

Codecov Report

Merging #28 (8351190) into main (1e6dcd2) will increase coverage by 0.03%.
The diff coverage is 86.00%.

@@            Coverage Diff             @@
##             main      #28      +/-   ##
==========================================
+ Coverage   84.97%   85.00%   +0.03%     
==========================================
  Files          16       16              
  Lines        1118     1107      -11     
==========================================
- Hits          950      941       -9     
+ Misses        168      166       -2

Flag	Coverage Δ
unittests	`85.00% <86.00%> (+0.03%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files	Coverage Δ
dissect/util/compression/lzo.py	`82.85% <86.00%> (+0.14%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

dissect/util/compression/lzo.py

Miauwkeru · 2023-06-29T12:29:45Z

dissect/util/compression/lzo.py

@@ -58,54 +46,49 @@ def decompress(src: Union[bytes, BinaryIO], header: bool = True, buflen: int = -
    val = src.read(1)[0]


Wouldn't something like this add a bit more readability to the whole thing?

LZO_VERSION_1 = 0x10 LZO_VERSION_2 = 0x11 def _determine_lzo_version(src: io.BytesIO, dst: bytearray) -> int: """Determines the LZO version of the src. Returns: The next value in LZO Stream """ val = src.read(1)[0] if val == LZO_VERSION_1: raise ValueError("LZOv1") elif val > LZO_VERSION_2: # LZO is a stream dst += src.read(val - LZO_VERSION_2) val = src.read(1)[0] if val < LZO_VERSION_1: raise ValueError("Invalid LZO stream") return val

This turns out to be not quite correct. See also https://docs.kernel.org/staging/lzo.html and https://github.com/torvalds/linux/blob/master/lib/lzo/lzo1x_decompress_safe.c.

Maybe good to add these references.
I looked a bit at making the code more readable, but that is far from trivial. Maybe comment the if statements a bit about which byte encoding is indicated.

Added them as references

pyrco

see comment

Miauwkeru

Would it be an idea to add some comments about which instruction encoding is used?
Including the first byte encodings

Miauwkeru · 2023-07-04T07:40:50Z

dissect/util/compression/lzo.py


-    trailing = 0
+    if val > 17:
+        dst += src.read(val - 17)


wouldn't this need an additional state variable?
As, according to the documentation you provided, state is already assigned at the first byte?

state looks unused there: https://github.com/FFmpeg/FFmpeg/blob/master/libavutil/lzo.c#L157

Schamper · 2023-07-04T16:56:01Z

Would it be an idea to add some comments about which instruction encoding is used?
Including the first byte encodings

You seem to understand the algorithm better than me by now so feel free to make some suggestions 😄

dissect/util/compression/lzo.py

Co-authored-by: Miauwkeru <Miauwkeru@users.noreply.github.com>

Fix LZO decompression

6210c13

Schamper requested a review from pyrco June 20, 2023 13:41

Schamper self-assigned this Jun 20, 2023

Schamper requested a review from Miauwkeru June 22, 2023 13:16

Miauwkeru reviewed Jun 29, 2023

View reviewed changes

Address comments

27f8734

pyrco reviewed Jul 4, 2023

View reviewed changes

Add more references

978ffd4

Schamper requested review from pyrco and Miauwkeru July 4, 2023 10:32

Miauwkeru reviewed Jul 4, 2023

View reviewed changes

Miauwkeru reviewed Jul 5, 2023

View reviewed changes

Schamper and others added 2 commits July 5, 2023 14:35

Apply suggestions from code review

377e26d

Co-authored-by: Miauwkeru <Miauwkeru@users.noreply.github.com>

Tweaks

2ffd5df

Schamper requested a review from Miauwkeru July 5, 2023 12:35

Miauwkeru approved these changes Jul 5, 2023

View reviewed changes

Merge branch 'main' into fix-lzo

8351190

Schamper merged commit 931f5ca into main Jul 5, 2023
18 checks passed

Schamper deleted the fix-lzo branch July 5, 2023 12:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix LZO decompression #28

Fix LZO decompression #28

Schamper commented Jun 20, 2023 •

edited

Loading

codecov bot commented Jun 20, 2023 •

edited

Loading

Miauwkeru Jun 29, 2023

Schamper Jun 29, 2023

pyrco Jul 4, 2023

Schamper Jul 4, 2023

pyrco left a comment

Miauwkeru left a comment

Miauwkeru Jul 4, 2023

Schamper Jul 4, 2023

Schamper commented Jul 4, 2023

		@@ -58,54 +46,49 @@ def decompress(src: Union[bytes, BinaryIO], header: bool = True, buflen: int = -
		val = src.read(1)[0]

Fix LZO decompression #28

Fix LZO decompression #28

Conversation

Schamper commented Jun 20, 2023 • edited Loading

codecov bot commented Jun 20, 2023 • edited Loading

Codecov Report

Miauwkeru Jun 29, 2023

Choose a reason for hiding this comment

Schamper Jun 29, 2023

Choose a reason for hiding this comment

pyrco Jul 4, 2023

Choose a reason for hiding this comment

Schamper Jul 4, 2023

Choose a reason for hiding this comment

pyrco left a comment

Choose a reason for hiding this comment

Miauwkeru left a comment

Choose a reason for hiding this comment

Miauwkeru Jul 4, 2023

Choose a reason for hiding this comment

Schamper Jul 4, 2023

Choose a reason for hiding this comment

Schamper commented Jul 4, 2023

Schamper commented Jun 20, 2023 •

edited

Loading

codecov bot commented Jun 20, 2023 •

edited

Loading