Attempting to prevent multiple indentions of notes #72

colinmford · 2020-07-08T16:44:19Z

This should fix #71 if approved.

It uses regex to remove the first tab in each line of a multi-paragraph text block.

Test added to make sure

<note>
	Line1
	Line2
	Line3
</note>

does not change when it goes through _normalizeGlifNote into something like

<note>
	Line1
		Line2
		Line3
</note>

etc.

colinmford · 2020-07-08T17:46:33Z

Update:

Instead of using regex to remove the first tab, instead the text is split on this regex:

r"[\n|\r|\r\n]\t*"

Should match cross-platform newlines, plus a tab character if it's present. Results in a nice clean list:

NEWLINE_RE = re.compile(r"[\n|\r|\r\n]\t*")
text = "List1\n\tList2\n\tList3"
NEWLINE_RE.split(text)
>>> ("List1", "List2", "List3")

It replaces a .splitlines() function, however, which technically matches all of these below. I thought it was unlikely, but if we need to support any or all of the additional characters we could add them to the regex.

Representation	Description
\n	Line Feed
\r	Carriage Return
\r\n	Carriage Return + Line Feed
\v or \x0b	Line Tabulation
\f or \x0c	Form Feed
\x1c	File Separator
\x1d	Group Separator
\x1e	Record Separator
\x85	Next Line (C1 Control Code)
\u2028	Line Separator
\u2029	Paragraph Separator

… test

colinmford · 2020-07-08T19:03:16Z

Update based on @alerque suggestion:

Removes Regex and uses textwrap.dedent() to fix the multiple indention issue, but breaks this test

ufoNormalizer/tests/test_ufonormalizer.py

Lines 831 to 837 in 5b67c7a

    
           element = ET.fromstring( 
        
               "<note>   Line1  \t\n\n    Line3\t  </note>") 
        
           writer = XMLWriter(declaration=None) 
        
           _normalizeGlifNote(element, writer) 
        
           self.assertEqual( 
        
               writer.getText(), 
        
               "<note>\n\tLine1\n\t\n\t    Line3\n</note>")

…tions Ben Kiel noted in the issue discussion

colinmford · 2020-08-06T18:52:49Z

I made some revisions, following the advice that @benkiel made here. Now all tests are passing.

I added a dedent_tabs method that works like textwrap.dedent, but it will only dedent tabs and 4-space indentions.
https://github.com/colinmford/ufoNormalizer/blob/440f6a53f1e30be597393f76967f8ac1e528da27/src/ufonormalizer.py#L1367-L1416

This and a few extra lines of stripping allow all the tests to pass:
https://github.com/colinmford/ufoNormalizer/blob/440f6a53f1e30be597393f76967f8ac1e528da27/src/ufonormalizer.py#L1179-L1182

Finally, added additional tests to test for the case @benkiel brought up and other related cases:
https://github.com/colinmford/ufoNormalizer/blob/440f6a53f1e30be597393f76967f8ac1e528da27/tests/test_ufonormalizer.py#L848-L884

moyogo · 2020-08-31T15:41:45Z

Thanks @colinmford!

colinmford · 2020-09-02T19:02:14Z

Thanks @benkiel and @moyogo!

Preventing multiple indentions of notes

ab16bd0

colinmford mentioned this pull request Jul 8, 2020

Trouble normalizing the indention of new lines in Glyph Notes #71

Closed

Perhaps a better way of normalizing incoming paragraph lines

8b9bb06

Uses textwrap.dedent to fix notes indention issue, but breaks another…

b37afac

… test

colinmford added 3 commits August 6, 2020 14:40

Revised method for normalizing notes, new tests to test for the condi…

5b2f355

…tions Ben Kiel noted in the issue discussion

Removing Prints

ba6a1e1

Cleaning up some left over code

440f6a5

benkiel approved these changes Aug 31, 2020

View reviewed changes

benkiel merged commit ece58bc into unified-font-object:master Aug 31, 2020

benkiel mentioned this pull request Jul 30, 2021

Thoughts on <note> and other string data #84

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attempting to prevent multiple indentions of notes #72

Attempting to prevent multiple indentions of notes #72

colinmford commented Jul 8, 2020 •

edited

Loading

colinmford commented Jul 8, 2020

colinmford commented Jul 8, 2020 •

edited

Loading

colinmford commented Aug 6, 2020 •

edited

Loading

moyogo commented Aug 31, 2020

colinmford commented Sep 2, 2020

Attempting to prevent multiple indentions of notes #72

Attempting to prevent multiple indentions of notes #72

Conversation

colinmford commented Jul 8, 2020 • edited Loading

colinmford commented Jul 8, 2020

colinmford commented Jul 8, 2020 • edited Loading

colinmford commented Aug 6, 2020 • edited Loading

moyogo commented Aug 31, 2020

colinmford commented Sep 2, 2020

colinmford commented Jul 8, 2020 •

edited

Loading

colinmford commented Jul 8, 2020 •

edited

Loading

colinmford commented Aug 6, 2020 •

edited

Loading