Add size limits to untrusted input parsing and replace unwrap on external responses by wksantiago · Pull Request #110 · privkeyio/keep

wksantiago · 2026-01-12T20:05:00Z

Summary

Add size limits to bincode deserialization in storage and hidden volume
Add size validation to FROST transport frame assembly
Add size limits to Nostr event parsing
Replace unwrap() with proper error handling on server responses

Test plan

All keep-core tests pass (67 tests)
All integration tests pass (7 tests)
Clippy passes with no warnings

Summary by CodeRabbit

Bug Fixes
- Enforced input size limits: event JSON capped at 64 KiB, assembled frames capped at 64 KiB, records capped at 1 MiB.
- Improved error reporting for serialization and deserialization failures.
- Added duplicate-frame detection, frame index bounds checks, stricter frame structure validation, and cumulative-size/overflow protections for multi-frame assembly.

_{✏️ Tip: You can customize this high-level summary in your review settings.}

…rnal responses

coderabbitai · 2026-01-12T20:05:10Z

Walkthrough

Adds defensive input validation and size limits: caps Nostr sign_event JSON, enforces FROST animated-frame count/assembled-size and frame sanity checks, and replaces bincode deserialization with Options using fixed-int encoding, allow_trailing_bytes, and per-record size limits.

Changes

Cohort / File(s)	Summary
Event JSON Size Limits `keep-cli/src/server.rs`	Adds `MAX_EVENT_JSON_SIZE` (64 KiB) and rejects oversize sign_event JSON. Replaces unconditional serialization unwraps with error-returning serialization handling for sign_event paths.
FROST Transport Validation `keep-core/src/frost/transport.rs`	Adds `MAX_FRAME_COUNT` (100) and `MAX_ASSEMBLED_SIZE` (64 KiB). Enforces frame count limit, duplicate and bounds checks for frame indices, per-frame JSON field validation, cumulative size tracking with overflow checks before assembling frames.
Bincode Deserialization Options `keep-core/src/hidden/volume.rs`, `keep-core/src/storage.rs`	Introduces `MAX_RECORD_SIZE` and switches from `bincode::deserialize` to `bincode::options().with_fixint_encoding().allow_trailing_bytes().with_limit(...)` for serialize/deserialize calls to enforce fixed-int encoding and size bounds across hidden storage and general storage paths.

Sequence Diagram(s)

(omitted)

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related issues

Add size limits to untrusted input parsing #106: Implements size limits and guarded parsing for untrusted inputs across sign_event, FROST transport frames, and bincode deserialization.

Poem

🐰 I hop through bytes with careful paws,

I count the frames and check the laws,
No jumbo JSON sneaks inside,
Fixed-int gates swing open wide,
Hooray for safe and bounded code! 🥕

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately captures the two main objectives: adding size limits to untrusted input parsing and replacing unwrap calls on external responses, which aligns with all file changes across the codebase.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

📜 Recent review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2984362 and efb7a01.

📒 Files selected for processing (1)

keep-core/src/hidden/volume.rs

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build

🔇 Additional comments (4)

keep-core/src/hidden/volume.rs (4)

45-54: LGTM! Appropriate defensive size limit for untrusted input.

The import and constant are correctly defined. The 1MB limit provides protection against malicious or corrupted data while being generous enough for legitimate key records.

458-460: LGTM! Consistent serialization with fixed-int encoding.

Using with_fixint_encoding() ensures a predictable binary format that will correctly roundtrip with the corresponding deserialization options.

545-547: LGTM! Consistent serialization options.

The serialization options match the pattern used in store_key_outer and will correctly roundtrip with load_hidden_records.

600-604: LGTM! Appropriate size limit for individual record deserialization.

The 1MB limit is well-suited for single KeyRecord deserialization. The options chain correctly matches the serialization format used in store_key_outer.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In @keep-core/src/hidden/volume.rs:
- Around line 519-523: Deserialization in the records path uses
bincode::options().with_fixint_encoding(), but the corresponding serialization
calls in store_key_outer and write_hidden_records (and the other serialize call
around lines 596-600) use plain bincode::serialize() with default varint
encoding, causing a mismatch; fix by changing those bincode::serialize(...)
calls to use the same options (e.g.,
bincode::options().with_fixint_encoding().serialize(&value)) so serialization
and deserialization use identical encoding; update every serialize call
referenced (store_key_outer, write_hidden_records, and the other serialize
instances) to use bincode::options().with_fixint_encoding() to ensure
consistency with the existing deserialize(...) usage.

📜 Review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e9b5abd and 2984362.

📒 Files selected for processing (4)

keep-cli/src/server.rs
keep-core/src/frost/transport.rs
keep-core/src/hidden/volume.rs
keep-core/src/storage.rs

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build

🔇 Additional comments (7)

keep-core/src/storage.rs (2)

15-21: Good defensive additions for size-limited deserialization.

The MAX_RECORD_SIZE constant (1 MiB) and bincode::Options import set up proper bounds for untrusted input parsing.

292-296: This review comment is based on an incorrect assumption about the bincode encoding behavior.

The code shows that bincode::serialize() (lines 261, 355) stores data, and bincode::options().with_fixint_encoding().deserialize() (lines 292-296, 315-319, 388-392) retrieves it. However, the test test_storage_key_operations (lines 474-500) explicitly validates this round-trip pattern by storing a KeyRecord and then loading it back with assertions that the data matches. This test passes, which demonstrates that no encoding mismatch exists in practice. If the encoding claim were correct, the test would fail during deserialization.

Likely an incorrect or invalid review comment.

keep-cli/src/server.rs (2)

17-18: Good size validation for untrusted input.

The 64 KiB limit for event JSON is reasonable and the check happens before parsing, which prevents potential DoS via malformed large payloads.

Also applies to: 293-296

317-320: Proper error handling for serialization.

Replacing unwrap with explicit error handling ensures graceful failure instead of panicking on serialization errors from external event data.

keep-core/src/frost/transport.rs (2)

12-13: Good constants for bounding frame assembly.

MAX_FRAME_COUNT (100) and MAX_ASSEMBLED_SIZE (64 KiB) provide reasonable limits for the animated frames protocol.

253-296: Comprehensive validation for multi-frame assembly.

The implementation correctly:

Limits total frame count upfront

Detects duplicate frame indices

Validates frame index bounds

Uses checked_add for overflow-safe size accumulation

Checks cumulative size limit incrementally

This provides robust protection against malformed or malicious frame data.

keep-core/src/hidden/volume.rs (1)

45-54: Size limit constant and Options import added correctly.

Same pattern as storage.rs for defensive deserialization bounds.

keep-core/src/hidden/volume.rs

coderabbitai

Actionable comments posted: 0

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

keep-core/src/hidden/volume.rs (1)

521-535: Document the hidden volume capacity limit and clarify the distinction between outer and hidden volume deserialization.

The MAX_RECORD_SIZE (1MB) limit applies differently depending on the volume type:

Outer volume (list_keys_outer): Deserializes individual KeyRecord items from the redb table—1MB per record is reasonable

Hidden volume (load_hidden_records): Deserializes the entire Vec<KeyRecord> in a single operation—1MB for the entire collection could be restrictive if users store thousands of keys

Additionally, the write_hidden_records method does not enforce a size limit on the serialized output, creating a potential mismatch where data could be written that exceeds the deserialization limit.

The use of allow_trailing_bytes() is consistent across the codebase and intentionally enables forward compatibility, but it does mask data corruption. Add a comment explaining this trade-off.

Recommendations:

Document the implicit hidden volume capacity (e.g., ~2,000–5,000 keys depending on key name and secret size)

Add validation in write_hidden_records to check serialized size against the limit, or use a larger limit for collection deserialization

Add comments explaining why allow_trailing_bytes() is used despite the corruption-masking trade-off

📜 Review details

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2984362 and efb7a01.

📒 Files selected for processing (1)

keep-core/src/hidden/volume.rs

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: build

🔇 Additional comments (4)

keep-core/src/hidden/volume.rs (4)

45-54: LGTM! Appropriate defensive size limit for untrusted input.

The import and constant are correctly defined. The 1MB limit provides protection against malicious or corrupted data while being generous enough for legitimate key records.

458-460: LGTM! Consistent serialization with fixed-int encoding.

Using with_fixint_encoding() ensures a predictable binary format that will correctly roundtrip with the corresponding deserialization options.

545-547: LGTM! Consistent serialization options.

The serialization options match the pattern used in store_key_outer and will correctly roundtrip with load_hidden_records.

600-604: LGTM! Appropriate size limit for individual record deserialization.

The 1MB limit is well-suited for single KeyRecord deserialization. The options chain correctly matches the serialization format used in store_key_outer.

Add size limits to untrusted input parsing and replace unwrap on exte…

2984362

…rnal responses

wksantiago requested a review from kwsantiago January 12, 2026 20:05

wksantiago self-assigned this Jan 12, 2026

This was linked to issues Jan 12, 2026

Replace unwrap() on untrusted external responses #105

Closed

Add size limits to untrusted input parsing #106

Closed

coderabbitai bot reviewed Jan 12, 2026

View reviewed changes

keep-core/src/hidden/volume.rs Show resolved Hide resolved

Fix bincode serialization/deserialization encoding mismatch

efb7a01

coderabbitai bot reviewed Jan 12, 2026

View reviewed changes

kwsantiago approved these changes Jan 16, 2026

View reviewed changes

kwsantiago merged commit bcf07d5 into main Jan 16, 2026
2 checks passed

kwsantiago deleted the Replace-unwrap-and-Add-size-limits branch January 16, 2026 00:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add size limits to untrusted input parsing and replace unwrap on external responses#110

Add size limits to untrusted input parsing and replace unwrap on external responses#110
kwsantiago merged 2 commits intomainfrom
Replace-unwrap-and-Add-size-limits

wksantiago commented Jan 12, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jan 12, 2026 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

wksantiago commented Jan 12, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related issues

Poem

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

wksantiago commented Jan 12, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jan 12, 2026 •

edited

Loading