-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem with text offset and Linebreak #52
Comments
I'd like to latch on some related issues to this one, as I've seen it in practice: <t>De
FoLiA developers zijn:</t> So a newline in the XML but not an explicit newline, meaning no newline as far as FoLiA is concerned. But it is still whitespace. So I think this is:
And not (I've seen this happen):
Add what about? <t>De\s\s\s\s\s\s\s
FoLiA developers zijn:</t> I'm not entirely sure how we handle that currently, I'd say it's still offset 3. I do agree there is a good argument to consider the offset to be 4 in your above case of an explicit linebreak. |
well, libfolia's folialint happily accepts this:
Also with offset 3. Considering:
Regarding this (were there are 6 spaces behind 'De')
folialint is really happy ... |
So this seems to be solved a long time ago |
example:
C++'s libfolia accepts this, as it sees every
<br/>
as 1 character, so the offset ofFoLiA
is 7Python's folia.py rejects this as it ignores all
<br/>
symbols and requires an offset of 3I think libfolia is right here. but this is very tricky indeed.
The text was updated successfully, but these errors were encountered: