Fix software renderer line breaker without the unicode feature #1606

ogoffart · 2022-09-07T09:42:08Z

This change makes the test pass without the unicode feature.

Note that it is not possible to run the test without the unicode feature unless one changes the Cargo.toml, so I did that locally to run the tests

This change makes the test pass without the unicode feature. Note that it is not possible to run the test without the unicode feature unless one changes the Cargo.toml, so I did that locally to run the tests

tronical · 2022-09-07T09:55:17Z

internal/core/textlayout/linebreak_simple.rs

            if let Some(opportunity) = maybe_opportunity {
-                return Some((byte_offset, opportunity));
+                return Some((byte_offset + 1, opportunity));


This worries me a tiny little bit, as this could return a byte offset that's out of the string bounds, if the last character is a break opportunity.

"one past the end" is ok, not? otherwise i can change that to not return that if this is the end.

I think that would be good, yes.

This is a test that passes with unicode and should pass with both (but doesn't!):

#[test] fn test_linebreak_opportunity_at_eot() { let mut it = LineBreakIterator::new("Hello World\n\n"); assert_eq!(it.next(), Some((6, BreakOpportunity::Allowed))); assert_eq!(it.next(), Some((12, BreakOpportunity::Mandatory))); assert_eq!(it.next(), None); }

I added a test that should cover this use case, does it?. The use of this function in fragments.rs seems to anyway create a break at the end if there isn't one (but the difference is that trailing_mandatory_break will be set.)

Yes, that's fine as well. The above is a little more localised, but I think your change is good as well :)

your test will indeed not pass with this change. But should it?

I understand that now the linebreak_simple and linebreak_unicode will behave differently with trailing \n, but checking for the last codepoint is a bit of work which i think is unnessery since the consumer won't bother.

Unless i'm missing something

Right now the consumer doesn't bother. But as the issue as shown, subtle differences between the two line break iterators are difficult to spot and may cause bugs that are hard to find.

So I see three options:

We ignore this difference for now, because the current consumers doesn't care.

We avoid this known difference now (but there may be more, apart from the unicode line break algorithm of course).

We switch to always using unicode line break algorithm, at the expense of code size.

I'm fine with either. What's your take?

i'll go with 1. :-)

Fix software renderer line breaker without the unicode feature

b58fd03

This change makes the test pass without the unicode feature. Note that it is not possible to run the test without the unicode feature unless one changes the Cargo.toml, so I did that locally to run the tests

ogoffart requested a review from tronical September 7, 2022 09:42

tronical reviewed Sep 7, 2022

View reviewed changes

tronical approved these changes Sep 7, 2022

View reviewed changes

Test that trailing newline don't add a line

9576f23

ogoffart merged commit 9d0e90c into master Sep 7, 2022

ogoffart deleted the olivier/swrenderer branch September 7, 2022 11:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix software renderer line breaker without the unicode feature #1606

Fix software renderer line breaker without the unicode feature #1606

ogoffart commented Sep 7, 2022

tronical Sep 7, 2022

ogoffart Sep 7, 2022

tronical Sep 7, 2022 •

edited

Loading

ogoffart Sep 7, 2022

tronical Sep 7, 2022

ogoffart Sep 7, 2022

ogoffart Sep 7, 2022

tronical Sep 7, 2022

ogoffart Sep 7, 2022

Fix software renderer line breaker without the unicode feature #1606

Fix software renderer line breaker without the unicode feature #1606

Conversation

ogoffart commented Sep 7, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tronical Sep 7, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

tronical Sep 7, 2022 •

edited

Loading