Refactored report format to use Pydantic #1258

garrett-zetier · 2025-09-09T12:50:59Z

Resolves #1250

Overview

These changes update the JSON report format (--report option) so that pydantic models are used to serialize report data to JSON format rather than attrs. This allows for easier deserialization in the future, and a Pydantic TypeAdapter is provided to make reading from the JSON files back into Python objects easier. A test has been added to test deserialization.

Testing

Update the environment to the latest since pydantic has been added as a dependency.

The tests have been updated to verify the latest report format, so they can be run to verify these changes. Specifically, the test_models.py (run uv run pytest tests/test_models.py -v) verifies both serialization and deserialization.

You can run unblob with the --report option to manually verify the JSON report. For example, run uv run unblob --report report.json my.bin

qkaiser

first quick pass over the code, some of my colleagues will also have a look :)

tests/test_models.py

python/unblob/report.py

qkaiser · 2025-09-09T13:18:38Z

Code looks ok, let's run the pipeline to see what fails.

qkaiser

Don't forget to run pre-commit before pushing code:

pre-commit install
pre-commit run -a

You can also check typing by running pyright from the repo root.

vulture_whitelist.py

python/unblob/report.py

vlaci · 2025-09-11T14:43:44Z

I went over the changes and I am mostly happy with them, good job!

The base64 process output encoding is cool, as it is more portable than python's string escapes.

I disliked that we need to fight with the type checker about the discriminator values introducing a bit convoluted inheritance hierarchy. My suggestion would be to use callable discriminators eliminating the need for the report_type field on the models themselves.

I've played around with the idea on how it would look like here: garrett-zetier/unblob@1250-pydantic-report...onekey-sec:unblob:pull/1250/review

tests/test_models.py

-        }
+        assert decoded_report == report
+
+        decoded_report = json.loads(json_text)


garrett-zetier · 2025-09-11T17:25:13Z

I like your implementation of the report models much better. It is much cleaner. Let me know if you want me to continue with those changes on my fork.

One other thing that seems to be missing is that the CI tests are failing because the pydantic package is missing from some list of dependencies. I'm not sure where this needs to be added (package.nix?).

vlaci · 2025-09-12T08:47:41Z

I like your implementation of the report models much better. It is much cleaner. Let me know if you want me to continue with those changes on my fork.

If you are up to it, then please do so. If you say that you've done enough, that's fine as well, we can take over.

One other thing that seems to be missing is that the CI tests are failing because the pydantic package is missing from some list of dependencies. I'm not sure where this needs to be added (package.nix?).

Yes, it is missing from the dependencies list in package.nix.

garrett-zetier · 2025-09-12T11:20:24Z

@vlaci I'll take a look and finish it up. Thank you guys for assisting!

garrett-zetier · 2025-09-12T14:47:12Z

I believe everything should be addressed. All tests and pipelines should pass now.

tests/test_report.py

vlaci

I am happy about the current state of the code

garrett-zetier · 2025-09-15T11:13:35Z

Thanks everyone for their assistance and time reviewing! I am looking forward to these changes in a future release.

…r deserialization.

…ons to support Python 3.9.

…ored vulture whitelisting for report.

garrett-zetier · 2025-09-15T11:41:59Z

I have rebased from the latest on the upstream main branch. Is there anywhere I should document usage of the updated report with Pydantic?

vlaci · 2025-09-15T12:48:31Z

I have rebased from the latest on the upstream main branch. Is there anywhere I should document usage of the updated report with Pydantic?

We don't publish API docs, and this change is API-only, so I don't think that this change needs extra documentation

qkaiser requested review from qkaiser and vlaci September 9, 2025 13:06

qkaiser self-assigned this Sep 9, 2025

qkaiser added enhancement New feature or request python Pull requests that update Python code dependencies Pull requests that update a dependency file labels Sep 9, 2025

qkaiser reviewed Sep 9, 2025

View reviewed changes

qkaiser reviewed Sep 10, 2025

View reviewed changes

vulture_whitelist.py Show resolved Hide resolved

python/unblob/report.py Outdated Show resolved Hide resolved

python/unblob/report.py Outdated Show resolved Hide resolved

python/unblob/report.py Outdated Show resolved Hide resolved

github-advanced-security bot found potential problems Sep 11, 2025

View reviewed changes

tests/test_models.py

}

assert decoded_report == report

decoded_report = json.loads(json_text)

Check notice

Code scanning / CodeQL

Unused local variable Note test

Variable decoded_report is not used.

e3krisztian reviewed Sep 12, 2025

View reviewed changes

tests/test_report.py Show resolved Hide resolved

vlaci approved these changes Sep 15, 2025

View reviewed changes

garrett-zetier and others added 9 commits September 15, 2025 07:35

Refactored report format to use Pydantic.

dd53b1e

Added TypeAdapter for deserialization from JSON report. Added test fo…

204d1a7

…r deserialization.

Minor update to documentation on TypeAdapter. Fixed Pydantic definiti…

b1858f1

…ons to support Python 3.9.

Refactored encode and decode for ExtractCommandFailedReport.

26d58ea

Updated vulture whitelist with exceptions for Pydantic models in report.

93baab8

Fixed minor formatting issues related to pre-commit checks and refact…

7300fc4

…ored vulture whitelisting for report.

Resolved typing issues in file_utils.py

8193124

Review suggestions

e637ab6

Updates related to report model refactor.

965fa39

garrett-zetier force-pushed the 1250-pydantic-report branch from fbacc68 to 965fa39 Compare September 15, 2025 11:37

vlaci added this pull request to the merge queue Sep 15, 2025

Merged via the queue into onekey-sec:main with commit 460b835 Sep 15, 2025
28 checks passed

Refactored report format to use Pydantic #1258

Refactored report format to use Pydantic #1258

Uh oh!

Conversation

garrett-zetier commented Sep 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

Testing

Uh oh!

qkaiser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qkaiser commented Sep 9, 2025

Uh oh!

qkaiser left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

vlaci commented Sep 11, 2025

Uh oh!

Check notice

garrett-zetier commented Sep 11, 2025

Uh oh!

vlaci commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

garrett-zetier commented Sep 12, 2025

Uh oh!

garrett-zetier commented Sep 12, 2025

Uh oh!

Uh oh!

vlaci left a comment

Choose a reason for hiding this comment

Uh oh!

garrett-zetier commented Sep 15, 2025

Uh oh!

garrett-zetier commented Sep 15, 2025

Uh oh!

vlaci commented Sep 15, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

garrett-zetier commented Sep 9, 2025 •

edited

Loading

vlaci commented Sep 12, 2025 •

edited

Loading