attestation malformed-input regression harness + hardening by liu971227-sys · Pull Request #462 · Scottcjn/Rustchain

liu971227-sys · 2026-03-01T03:02:54Z

Summary

harden /attest/submit to reject malformed attestation payload shapes fail-closed instead of accepting or crashing on them
add deterministic malformed-input regression coverage plus a replayable corpus under tests/attestation_corpus/
add stricter regression tests for malformed fingerprint handling, duplicate hardware binding rejection, and SQL-like / oversized / nested hostile inputs
document the attestation regression harness and wire a 10,000-case mutation regression gate into CI

Bounty

Ref: [BOUNTY] Attestation Fuzz Harness + Crash Regression Corpus (98 RTC) rustchain-bounties#475
Payout wallet: RTCa320f4334e7500987bce2fa0475f089ae9cd90e3

Validation

python -m pytest tests/test_attestation_fuzz.py -v
- local result: 24 passed
ATTEST_FUZZ_CASES=10000 python -m pytest tests/test_attestation_fuzz.py -k mutation_regression_no_unhandled_exceptions -v
- local result: 10,000 mutation regression cases passed without 5xx
python tests/replay_attestation_corpus.py tests/attestation_corpus/malformed_report_scalar.json
- local result: 400 INVALID_REPORT

Corpus Summary

Corpus entries: 8
Classes covered:
- invalid JSON root: null
- invalid JSON root: array
- invalid miner identifier shape
- invalid device object shape
- invalid signals object shape
- invalid signals MAC list shape
- invalid fingerprint checks shape
- invalid report object shape

Risk Notes / Limitations

This is a deterministic malformed-input regression harness, not a coverage-guided fuzzer.
The regression gate is focused on attestation request parsing / validation hardening around /attest/submit.
It does not attempt large-payload memory stress or recursive-depth exhaustion beyond the explicit hostile shapes now covered.

liu971227-sys · 2026-03-01T03:16:44Z

Draft PR updated.

Changes in this update:

narrowed the branch back down to the attestation fuzz / malformed-input hardening scope for bounty Add CODE_OF_CONDUCT.md (Bounty #510) #475
added a replayable corpus script: python tests/replay_attestation_corpus.py
added additional malformed corpus entries and documentation in docs/attestation_fuzzing.md
wired a 10,000-case attestation fuzz regression gate into CI

Validation completed locally:

python -m pytest tests/test_attestation_fuzz.py -v
python tests/replay_attestation_corpus.py tests/attestation_corpus/malformed_report_scalar.json
ATTEST_FUZZ_CASES=10000 python -m pytest tests/test_attestation_fuzz.py -k fuzz_no_unhandled_exceptions -v
Observed 10k run result: 1 passed in 250.90s

Scottcjn · 2026-03-01T03:54:45Z

@liu971227-sys — Good start on the fuzz harness. 15 files, 614 additions across attestation hardening and consensus probes.

A few notes:

The malformed-input hardening on /attest/submit is the highest-value piece here — we have seen garbage payloads from malfunctioning agents.
When you mark this ready for review, include a sample fuzz run output showing the corpus coverage.
Your wallet migration to RTCa320f4334e7500987bce2fa0475f089ae9cd90e3 has been processed (1,628.08 RTC).

Bounty will be assessed on merge based on coverage quality per #475.

larryjiang-star

Code Review: Attestation Fuzz Harness + Hardening (PR #462)

Summary

This is a well-executed security-focused PR that adds fuzz testing infrastructure and input validation hardening for the /attest/submit endpoint.

Strengths

Comprehensive Fuzz Infrastructure
- Deterministic fuzz harness with replayable corpus (tests/attestation_corpus/)
- Mutation-based fuzzing with 8 distinct mutation types covering common malformed inputs
- Clear documentation in docs/attestation_fuzzing.md
Input Validation Hardening
- _attest_mapping(), _attest_text(), _attest_positive_int(), _attest_string_list() - clean utility functions
- _normalize_attestation_device(), _normalize_attestation_signals(), _normalize_attestation_report() - proper normalization
- Fixes type errors in validate_fingerprint_data(): checks isinstance before using dict methods
Test Coverage
- Parametrized corpus tests for known malformed inputs
- Fuzz regression gate (10,000 cases in CI)
- Proper DB setup/teardown in fixtures

Minor Suggestions

Error Handling in Normalizers
- The normalizers return empty/None defaults silently. Consider logging when receiving unexpected types for debugging/monitoring.
Corpus Expansion
- Consider adding edge cases: extremely long strings, unicode edge cases, nested depth limits
CI Integration
- The CI gate runs 10,000 cases (~250s). Consider parallelizing if this becomes a bottleneck.

Security Assessment

✅ Approve - This PR significantly hardens the attestation endpoint against malformed input attacks. The fuzzing infrastructure allows continuous regression testing.

Risk: Low
Quality: High

Reviewer: larryjiang-openclaw (Miner ID)

Scottcjn · 2026-03-01T19:20:32Z

Great work on the attestation fuzz harness! The input hardening looks solid. However there are merge conflicts that need resolving — can you rebase on main? Once conflicts are resolved we'll merge this.

Scottcjn · 2026-03-01T20:08:33Z

@liu971227-sys this PR still has merge conflicts with main. Can you rebase onto the latest main branch? The fuzz harness work looks promising but we can't merge until the conflicts are resolved.

git fetch origin
git rebase origin/main
git push --force-with-lease

Scottcjn

Code Review: Attestation Fuzz Harness

Thanks for the contribution, liu971227-sys. The input normalization helpers added to the server code (_attest_mapping, _attest_text, _attest_positive_int, _attest_string_list, and the normalize* functions) are well-structured and genuinely useful hardening. The non-object JSON root guard (lines 1893-1900) is a solid fix. I appreciate the effort here.

However, the test harness itself has several issues that need to be addressed before this can be merged. A fuzz harness that gives false confidence is worse than no fuzz harness at all.

HIGH: Tests assert malformed inputs return `ok: True`

In test_attestation_fuzz.py line 145:

def test_attest_submit_corpus_cases_do_not_raise_server_errors(client, file_name):
    response = _post_raw_json(client, (CORPUS_DIR / file_name).read_text(encoding="utf-8"))
    assert response.status_code < 500
    assert response.get_json()["ok"] is True

This test sends deliberately malformed payloads (device as a scalar string, miner as an array, signals as a scalar, etc.) and then asserts the server accepts them with ok: True. A security fuzz test should verify that malformed inputs are rejected — not accepted. If these payloads currently return ok: True, that is the bug to fix, not the behavior to enshrine in CI.

The status_code < 500 assertion is reasonable (no crashes), but the ok is True assertion means this test suite will break if someone adds proper validation — it would actively block security improvements.

Suggested fix: Assert response.status_code in (400, 422) and response.get_json()["ok"] is False for payloads with type-mismatched fields. If the server currently accepts them, fix the server validation first (your normalization helpers already lay the groundwork for this).

HIGH: All security mechanisms are mocked out

The client fixture (lines 99-106) disables:

_check_hardware_binding → always returns (True, "ok", "")
check_ip_rate_limit → always returns (True, "ok")
HW_BINDING_V2 → False
HW_PROOF_AVAILABLE → False
record_attestation_success → no-op
record_macs → no-op

This means the fuzz harness tests the input parsing path only, with all downstream security checks removed. The test name "fuzz harness" implies security testing, but what is actually tested is JSON deserialization robustness. That is not nothing, but it is also not what the PR description ("harden /attest/submit to fail closed") suggests.

More critically, validate_fingerprint_data is NOT mocked, but the mocked _check_hardware_binding bypass means fingerprint validation results have no downstream consequence in these tests. The fingerprint could fail validation and the test would still pass because hardware binding is bypassed.

Suggested fix: Add a second test fixture that keeps security checks enabled (at minimum fingerprint validation and hardware binding) and verify that malformed fingerprints cause attestation rejection. The existing fixture can remain for the "no 500 errors" crash-resistance tests.

MEDIUM: `fingerprint` field type guard is incomplete

The server-side fix at line 1107-1108 adds:

checks = fingerprint.get("checks", {})
if not isinstance(checks, dict):
    checks = {}

This is good, but validate_fingerprint_data still assumes its fingerprint parameter is a dict (line 1103: if not fingerprint or not isinstance(fingerprint, dict)). If a caller passes fingerprint as a list or string directly to this function from a different code path, it would crash on the .get() call at line 1103 before the isinstance check. The _attest_mapping() guard in submit_attestation protects this specific path, but validate_fingerprint_data as a standalone function remains fragile.

This is actually a real bug the fuzzer should have caught — sending "fingerprint": [1, 2, 3] would have triggered an AttributeError in the original code. The normalization fix addresses it indirectly, but a direct type guard at the top of validate_fingerprint_data would be more defensive:

if not isinstance(fingerprint, dict):
    return False, "fingerprint_not_dict"

MEDIUM: Deterministic fuzzer with no coverage feedback

The fuzzer uses random.Random(475) with a fixed seed and only 8 mutation types (lines 149-175). This produces the exact same sequence of inputs every run. This is regression testing, not fuzzing. The distinction matters because:

Running 10,000 iterations of the same 8 mutations with the same seed produces only 8 distinct input shapes, repeated ~1,250 times each
No coverage-guided mutation means the fuzzer cannot discover new code paths
The PR docs call this a "10,000-case fuzz run" which overstates what it does

Suggested fix: Either rename this to "attestation input regression tests" (accurate) or integrate with a real coverage-guided fuzzer like Hypothesis or Atheris. At minimum, remove the fixed seed from the CI run so each run explores different combinations.

MEDIUM: Missing attack vectors

For a fuzz harness targeting a financial attestation endpoint, several important attack vectors are absent:

SQL injection in miner_id: e.g., "miner": "'; DROP TABLE balances; --" — the miner_id flows into SQLite queries
NaN/Infinity in numeric fields: "cores": float("inf") or "cores": float("nan") — these pass int() coercion differently across Python versions
Integer overflow: "cores": 99999999999999999999 — tests _attest_positive_int boundary behavior
Unicode normalization attacks: "miner": "fuzz\u200bminer" (zero-width space) — could bypass string matching while appearing identical
Oversized payloads: 10MB miner_id string to test memory/DoS behavior
Nested object depth: Deeply nested dicts/lists to test recursion limits
Empty string fields: "miner": "" vs "miner": " " vs "miner": "\t" — whitespace-only should be rejected

Summary

The server-side normalization code is solid work and should be preserved. The test harness needs rework:

Fix assertions to verify malformed inputs are rejected, not accepted
Add test fixtures with security checks enabled
Add the missing attack vectors listed above
Rename "fuzz" to "regression" or integrate actual coverage-guided fuzzing
Add a direct type guard in validate_fingerprint_data

I would be happy to re-review once these issues are addressed. The normalization helpers are a genuine improvement to the codebase.

Scottcjn

Review: APPROVED — Excellent security hardening work

This is exactly the kind of contribution we need. Specific highlights:

Input normalization (+78 lines): The _attest_mapping, _attest_text, _attest_positive_int, _attest_string_list helpers are well-designed fail-closed sanitizers. Every .get() call in the attestation path now goes through type-checked normalization.

Key fix: get_json(silent=True) + isinstance(data, dict) check prevents server crashes from null, array, or non-JSON payloads — this was a real attack surface.

CI integration: The 10,000-case fuzz gate in GitHub Actions is a strong regression safety net.

One issue: This PR has merge conflicts against main. Please rebase onto main and force-push to resolve. Once the conflicts are clear, this merges immediately.

Bounty: This qualifies for the security hardening bounty. Please reference the specific bounty issue number so we can process payment after merge.

liu971227-sys · 2026-03-02T09:21:46Z

Addressed the review items in a follow-up update.

Changes in this pass:

malformed attestation payloads are now rejected fail-closed instead of being accepted after normalization
added a direct type guard to �alidate_fingerprint_data() for non-dict input
split tests into: malformed-input rejection, mutation no-500 regression, and a stricter fixture with hardware binding left active
added strict-path tests covering malformed fingerprint rejection and duplicate hardware binding rejection
expanded attack-vector coverage for invalid miner identifiers, oversized / invalid device.cores, invalid nested field shapes, and SQL-like miner input
updated docs/CI wording to describe this as a malformed-input regression gate rather than overstating it as coverage fuzzing

Validation run locally:

python -m pytest tests/test_attestation_fuzz.py -v
ATTEST_FUZZ_CASES=10000 python -m pytest tests/test_attestation_fuzz.py -k mutation_regression_no_unhandled_exceptions -v

Current local result:

24 passed
10,000 mutation regression cases passed without 5xx

liu971227-sys · 2026-03-02T09:36:04Z

Rebased onto the latest main and resolved the merge conflicts.

I also fixed a small post-rebase regression around attestation client IP extraction and re-ran the attestation test targets locally:

python -m pytest tests/test_attestation_fuzz.py -v
ATTEST_FUZZ_CASES=10000 python -m pytest tests/test_attestation_fuzz.py -k mutation_regression_no_unhandled_exceptions -v

Current local result:

24 passed
10,000 mutation regression cases passed without 5xx

liu971227-sys · 2026-03-02T09:37:09Z

Rebase is fully complete now.

I resolved the conflicts against the latest main, fixed a small post-rebase regression (client_ip_from_request()), and re-ran the attestation regression targets locally after the final force-push.

Validation:

python -m pytest tests/test_attestation_fuzz.py -v
ATTEST_FUZZ_CASES=10000 python -m pytest tests/test_attestation_fuzz.py -k mutation_regression_no_unhandled_exceptions -v

Current local result:

24 passed
10,000 mutation regression cases passed without 5xx

liu971227-sys mentioned this pull request Mar 1, 2026

[BOUNTY] Attestation Fuzz Harness + Crash Regression Corpus (98 RTC) Scottcjn/rustchain-bounties#475

Open

liu971227-sys force-pushed the bounty/475-attestation-fuzz-hardening branch from ea19c99 to 0c4213a Compare March 1, 2026 03:16

github-actions bot added ci size/L PR: 201-500 lines labels Mar 1, 2026

larryjiang-star reviewed Mar 1, 2026

View reviewed changes

larryjiang-star mentioned this pull request Mar 1, 2026

[BOUNTY] Code Review Bounty Program — Review PRs, Earn RTC (100 RTC Pool) Scottcjn/rustchain-bounties#73

Open

Scottcjn marked this pull request as ready for review March 1, 2026 19:20

Scottcjn self-requested a review as a code owner March 1, 2026 19:20

Scottcjn requested changes Mar 1, 2026

View reviewed changes

Scottcjn previously approved these changes Mar 1, 2026

View reviewed changes

liu971227-sys dismissed Scottcjn’s stale review via f318714 March 2, 2026 09:21

liu971227-sys added 4 commits March 2, 2026 17:30

test: add attestation fuzz harness hardening

341294b

test: document attestation fuzz regression gate

4385fe5

fix: reject malformed attestation payload shapes

11147c5

fix: restore attestation client ip helper after rebase

c1f460c

liu971227-sys force-pushed the bounty/475-attestation-fuzz-hardening branch from f318714 to c1f460c Compare March 2, 2026 09:36

liu971227-sys changed the title ~~draft: attestation fuzz harness + malformed-input hardening~~ attestation malformed-input regression harness + hardening Mar 2, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

attestation malformed-input regression harness + hardening#462

attestation malformed-input regression harness + hardening#462
liu971227-sys wants to merge 4 commits intoScottcjn:mainfrom
liu971227-sys:bounty/475-attestation-fuzz-hardening

liu971227-sys commented Mar 1, 2026 •

edited

Loading

Uh oh!

liu971227-sys commented Mar 1, 2026

Uh oh!

Scottcjn commented Mar 1, 2026

Uh oh!

larryjiang-star left a comment

Uh oh!

Scottcjn commented Mar 1, 2026

Uh oh!

Scottcjn commented Mar 1, 2026

Uh oh!

Scottcjn left a comment

Uh oh!

Scottcjn left a comment

Uh oh!

liu971227-sys commented Mar 2, 2026

Uh oh!

liu971227-sys commented Mar 2, 2026

Uh oh!

liu971227-sys commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

liu971227-sys commented Mar 1, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Bounty

Validation

Corpus Summary

Risk Notes / Limitations

Uh oh!

liu971227-sys commented Mar 1, 2026

Uh oh!

Scottcjn commented Mar 1, 2026

Uh oh!

larryjiang-star left a comment

Choose a reason for hiding this comment

Code Review: Attestation Fuzz Harness + Hardening (PR #462)

Summary

Strengths

Minor Suggestions

Security Assessment

Uh oh!

Scottcjn commented Mar 1, 2026

Uh oh!

Scottcjn commented Mar 1, 2026

Uh oh!

Scottcjn left a comment

Choose a reason for hiding this comment

Code Review: Attestation Fuzz Harness

HIGH: Tests assert malformed inputs return ok: True

HIGH: All security mechanisms are mocked out

MEDIUM: fingerprint field type guard is incomplete

MEDIUM: Deterministic fuzzer with no coverage feedback

MEDIUM: Missing attack vectors

Summary

Uh oh!

Scottcjn left a comment

Choose a reason for hiding this comment

Review: APPROVED — Excellent security hardening work

Uh oh!

liu971227-sys commented Mar 2, 2026

Uh oh!

liu971227-sys commented Mar 2, 2026

Uh oh!

liu971227-sys commented Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

liu971227-sys commented Mar 1, 2026 •

edited

Loading

HIGH: Tests assert malformed inputs return `ok: True`

MEDIUM: `fingerprint` field type guard is incomplete