pdf-parser: Store Name as Vec<u8> and skip comments in arrays/dicts by Velli20 · Pull Request #90 · Velli20/safe-pdf

Velli20 · 2026-02-17T19:04:55Z

Change ObjectVariant::Name from String to Vec to preserve raw bytes and avoid lossy Latin-1 interpretation of #HH hex escapes. Dictionary keys are converted to String at the parsing boundary since PDF spec requires them to be ASCII.

Replace skip_whitespace() with skip_whitespace_and_comments() in array and dictionary parsers so that PDF comments between elements are properly consumed instead of causing parse errors.

Change ObjectVariant::Name from String to Vec<u8> to preserve raw bytes and avoid lossy Latin-1 interpretation of #HH hex escapes. Dictionary keys are converted to String at the parsing boundary since PDF spec requires them to be ASCII. Replace skip_whitespace() with skip_whitespace_and_comments() in array and dictionary parsers so that PDF comments between elements are properly consumed instead of causing parse errors. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Velli20 force-pushed the parser-fixes-4 branch from 8a4012b to 5db118f Compare February 17, 2026 19:10

Velli20 merged commit 4d281f3 into main Feb 17, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pdf-parser: Store Name as Vec<u8> and skip comments in arrays/dicts#90

pdf-parser: Store Name as Vec<u8> and skip comments in arrays/dicts#90
Velli20 merged 1 commit intomainfrom
parser-fixes-4

Velli20 commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Velli20 commented Feb 17, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant