Fix #204. #211

BurntSushi · 2016-04-23T01:17:57Z

The DFA handles word boundaries by tagging each state with an is_word
flag that lets us determine whether the next byte in the haystack should
cause a word boundary instruction to match. We were mishandling how this
tagging happened for start states. In particular, the tag was not used as
an index into the start state cache, and therefore could wind up choosing
an incorrect but previously computed start state with the wrong flags set.
This leads to incorrect matches.

We fix this by using the right flags to generate an index.

The DFA handles word boundaries by tagging each state with an `is_word` flag that lets us determine whether the next byte in the haystack should cause a word boundary instruction to match. We were mishandling how this tagging happened for start states. In particular, the tag was not used as an index into the start state cache, and therefore could wind up choosing an incorrect but previously computed start state with the wrong flags set. This leads to incorrect matches. We fix this by using the right flags to generate an index.

BurntSushi mentioned this pull request Apr 23, 2016

Word borders don't seem to be working with split #204

Closed

BurntSushi merged commit 4332c9c into master Apr 23, 2016

BurntSushi deleted the fix-204 branch April 23, 2016 01:34

This was referenced Oct 13, 2025

chore(deps): bump regex from 1.11.1 to 1.12.1 dirvine/sb#40

Closed

chore(deps): bump regex from 1.11.1 to 1.12.2 dirvine/sb#42

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix #204. #211

Fix #204. #211

Uh oh!

BurntSushi commented Apr 23, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Fix #204. #211

Fix #204. #211

Uh oh!

Conversation

BurntSushi commented Apr 23, 2016

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants