Some Recorder-generated text asserts need improved unicode normalization #1128

mdmintz · 2021-12-16T19:27:48Z

Some Recorder-generated text asserts need improved unicode normalization.
(https://unicode.org/reports/tr15/#Norm_Forms has details for anyone reading this that's confused.)

Essentially, some foreign language characters can be represented by more than one unicode format, and the Recorder isn't being consistent with character generation, which may cause text-based asserts to fail after the Recorder generates self.assert_text(TEXT, SELECTOR) lines because even if the TEXT appears to match the visible text on the web page, different unicode formats could cause the assertion to fail.

Here's an example of that:

ipdb> 'й'.encode()
b'\xd0\xb9'
ipdb> 'й'.encode()
b'\xd0\xb8\xcc\x86'

The text was updated successfully, but these errors were encountered:

mdmintz added the bug Uh oh... Something needs to be fixed label Dec 16, 2021

mdmintz self-assigned this Dec 16, 2021

mdmintz mentioned this issue Dec 17, 2021

The Recorder Mode Desktop App: Deluxe Edition (and more) #1133

Merged

mdmintz closed this as completed in #1133 Dec 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some Recorder-generated text asserts need improved unicode normalization #1128

Some Recorder-generated text asserts need improved unicode normalization #1128

mdmintz commented Dec 16, 2021

Some Recorder-generated text asserts need improved unicode normalization #1128

Some Recorder-generated text asserts need improved unicode normalization #1128

Comments

mdmintz commented Dec 16, 2021