fix for FS#2676, inserting zero length spaces into long sequences of non... by Chris--S · Pull Request #165 · dokuwiki/dokuwiki

Chris--S · 2013-01-26T17:08:53Z

...-breaking characters in diffs

post process the html content string returned by Diff->format to locate long, unbroken strings of characters. Examine those strings and insert zero length (zl) spaces after certain characters (e.g. /#!,:;). When there are sequences of the 'special' characters only insert the zl space after the last character in the sequence.

Also, don't modify content within html tags and keep html entities together.

…non-breaking characters in diffs

michitux · 2013-01-26T18:12:39Z

Should this be used in the diff mails, too? Or are (possibly mobile) mail clients better at that?

michitux · 2013-01-26T18:23:56Z

inc/html.php

I think that entities in the form &#xHEX; (where HEX is a hex value) are valid, too.

for simplicity, do you think changing to my later simplified pattern?

&#?\w{1,4};

I don't think its a good idea to make it overly complicated or accurate. I think its ok to catch more than the set of valid html entities. So saying, do any have more than 4 chars?

Yes, many, have a look at http://htmlentities.com/html/entities/

selfthinker · 2013-01-27T10:56:15Z

Although the original was about URLs because URLs are by far the longest string, I wonder if it also makes sense to do something about other potentially long strings. E.g. _ could be in long variable names or - could be in long product numbers...

Chris--S · 2013-01-27T12:24:02Z

The fix is for long strings, long being 12 characters without a breaking character.

selfthinker · 2013-01-27T13:00:01Z

Yes, I get that. But because the idea came because of URLs, we only looked for typical characters in URLs to break a string. That's why we didn't think of - or _.

Chris--S · 2013-01-27T13:38:20Z

I don't think those two characters should be followed by zero length spaces as they tend to indicate full words. They could be followed by  (is that necessary for '-').

Thinking out loud ... we could do a second parse for long unbroken strings looking for '-' after the first. That would avoid breaking at '-' and '_' except when they were involved in long strings without the other break characters.

selfthinker · 2013-01-27T18:21:55Z

Using  for - and _ would be fine with me as well. Maybe we should leave it as it is and only add other characters when we encounter problems with words including them.

splitbrain · 2013-01-27T21:15:03Z

I'd say let's keep it simple for now.

Would a shy add a hyphen when the browser wraps it? If yes, I'd not use that for diffs as an additional character might be confusing.

fix for FS#2676, inserting zero length spaces into long sequences of non...

Validation Refactor

fix for FS#2676, inserting zero length spaces into long sequences of …

fcfecb6

…non-breaking characters in diffs

michitux reviewed Jan 26, 2013
View reviewed changes

update pattern to catch more html entities

298a7e0

Chris--S added a commit that referenced this pull request Feb 3, 2013

Merge pull request #165 from Chris--S/FS#2676

1061759

fix for FS#2676, inserting zero length spaces into long sequences of non...

Chris--S merged commit 1061759 into dokuwiki:master Feb 3, 2013

micgro42 mentioned this pull request Jun 24, 2015

Zero-Width-Whitespace introduced in revision diff after / within <code> tags #1208

Closed

bleistivt mentioned this pull request Dec 3, 2018

Fix zero width spaces in diffs #2618

Merged

splitbrain added a commit that referenced this pull request Apr 9, 2020

Merge pull request #165 from cosmocode/validationrefactor

6e70451

Validation Refactor

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix for FS#2676, inserting zero length spaces into long sequences of non...#165

fix for FS#2676, inserting zero length spaces into long sequences of non...#165
Chris--S merged 2 commits intodokuwiki:masterfrom
Chris--S:FS#2676

Chris--S commented Jan 26, 2013

Uh oh!

michitux commented Jan 26, 2013

Uh oh!

michitux Jan 26, 2013

Uh oh!

Chris--S Jan 26, 2013

Uh oh!

michitux Jan 26, 2013

Uh oh!

selfthinker commented Jan 27, 2013

Uh oh!

Chris--S commented Jan 27, 2013

Uh oh!

selfthinker commented Jan 27, 2013

Uh oh!

Chris--S commented Jan 27, 2013

Uh oh!

selfthinker commented Jan 27, 2013

Uh oh!

splitbrain commented Jan 27, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

Chris--S commented Jan 26, 2013

Uh oh!

michitux commented Jan 26, 2013

Uh oh!

michitux Jan 26, 2013

Choose a reason for hiding this comment

Uh oh!

Chris--S Jan 26, 2013

Choose a reason for hiding this comment

Uh oh!

michitux Jan 26, 2013

Choose a reason for hiding this comment

Uh oh!

selfthinker commented Jan 27, 2013

Uh oh!

Chris--S commented Jan 27, 2013

Uh oh!

selfthinker commented Jan 27, 2013

Uh oh!

Chris--S commented Jan 27, 2013

Uh oh!

selfthinker commented Jan 27, 2013

Uh oh!

splitbrain commented Jan 27, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants