Unicode escaping for method names is sometimes broken #2651

bradwilson · 2023-01-22T07:24:35Z

Discussed in #2583

^{Originally posted by tsawyer999 September 15, 2022}
Hello,

I am trying to display the name for a test:

Parameter "RETURN VALUE" is always added

I tried those two ways and was not able to accomplish the expected result.

   [Fact]
   public void Parameter_X22RETURN_VALUEX22_is_always_added()

Result:

Parameter "RETURN VALUEX22 is always added

   [Fact]
   public void Parameter_X22RETURN_VALUE_X22_is_always_added()

Result:

Parameter "RETURN VALUE_" is always added

Can someone give me a hand to achieve the desired result?

Thank you very much!

This is a bug in the encoding routine in DisplayNameFormatter:

xunit/src/xunit.v3.core/Sdk/Frameworks/DisplayNameFormatter.cs

Lines 198 to 221 in 2b1f75b

    
           static void TryConsumeEscapeSequence( 
        
           	FormatContext context, 
        
           	char @char, 
        
           	int allowedLength) 
        
           { 
        
           	var escapeSequence = new char[allowedLength]; 
        
           	var consumed = 0; 
        
           	while (consumed < allowedLength && context.HasMoreText) 
        
           	{ 
        
           		var nextChar = context.ReadNext(); 
        
           		escapeSequence[consumed++] = nextChar; 
        
           		if (IsHex(nextChar)) 
        
           			continue; 
        
           		context.Buffer.Append(@char); 
        
           		context.Buffer.Append(escapeSequence, 0, consumed); 
        
           		return; 
        
           	} 
        
           	context.Buffer.Append(char.ConvertFromUtf32(HexToInt32(escapeSequence))); 
        
           }

Lines 216 adds the literal characters that it already consumed into the output string. Unfortunately, the logic is failing because of these steps:

It sees the U in VALUEX22 and tries to interpret that as a Unicode escape (4 character)
It then sees the E, which is a valid hex digit, and goes back for another
It then sees the X, which is not a valid hex digit; it adds UEX to the output, then loops back to look for more escapes.

Unfortunately, because it consumed the X (and put it literally into the output), the algorithm now starts up at 22, which of course is not an escape sequence, so it puts those into the output as literal values.

The expected encoding is Parameter "RETURN VALUE" is always added

The actual encoding is Parameter "RETURN VALUEX22 is always added

The text was updated successfully, but these errors were encountered:

koenigst · 2023-01-29T12:44:32Z

I will create a PR for this. Any thoughts on how the look-ahead should be implemented?

bradwilson added type: Bug area: Core framework target: 3.0 labels Jan 22, 2023

bradwilson mentioned this issue Jan 22, 2023

v3 Roadmap #2133

Open

koenigst added a commit to koenigst/xunit that referenced this issue Jan 29, 2023

xunit#2651 Fix unicode escaping of test names.

03ed084

koenigst added a commit to koenigst/xunit that referenced this issue Jan 29, 2023

xunit#2651 Fix unicode escaping of test names.

b34ccbf

koenigst added a commit to koenigst/xunit that referenced this issue Jan 29, 2023

xunit#2651 Fix unicode escaping of test names.

404adb0

koenigst mentioned this issue Jan 29, 2023

#2651 Fix unicode escaping of test names. #2657

Merged

bradwilson closed this as completed in #2657 Feb 22, 2023

bradwilson pushed a commit that referenced this issue Feb 22, 2023

#2651 Fix unicode escaping of test names. (#2657)

5f13e40

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unicode escaping for method names is sometimes broken #2651

Unicode escaping for method names is sometimes broken #2651

bradwilson commented Jan 22, 2023

koenigst commented Jan 29, 2023

Unicode escaping for method names is sometimes broken #2651

Unicode escaping for method names is sometimes broken #2651

Comments

bradwilson commented Jan 22, 2023

Discussed in #2583

koenigst commented Jan 29, 2023