Skip to content

Conversation

efritz
Copy link
Contributor

@efritz efritz commented Sep 11, 2025

The added test case contains an escaped multi-byte string that's valid UTF8 that caused a panic before the fix. The fix applied is to advance the entire multi-byte sequence instead of capping it to a single byte.

This caused a few other test failures, namely because we were validating unescaped identifiers were valid UTF8 where elsewhere we assume they're ASCII. The follow-up fix applied was to treat these in a uniform way.

Please let me know if I've misunderstood the original purpose of the removed min guard.

@varungandhi-src varungandhi-src merged commit ba7cc5f into sourcegraph:main Sep 23, 2025
2 checks passed
@varungandhi-src
Copy link
Contributor

Thanks!

@efritz efritz deleted the ef/multi-byte-utf8-sequences branch October 6, 2025 14:28
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants