Remove `\xXX` char escapes from the language #12769

Valloric · 2014-03-08T21:19:45Z

\xXX is very misleading in Rust since it actually works exactly like \u00XX instead of the way it works in C, C++ and other languages. Example:

// this FAILS because left is [195, 191]
assert_eq!( bytes!( "\xFF"), bytes!( 255 ) ); 
// this SUCCEEDS
assert_eq!( bytes!( "\xFF"), bytes!( "\u00FF" ) );

I understand the reasoning behind this (Rust strings are always UTF-8), but then \xXX shouldn't exist in the language. It brings nothing but confusion and it's functionality as implemented is the same as \u00XX.

The text was updated successfully, but these errors were encountered:

alexcrichton · 2014-03-09T01:02:44Z

Closing, this was previously decided in #2800 to be working as intended.

Valloric · 2014-03-09T01:38:09Z

#2800 was about changing \xXX to mean utf8 code unit instead of unicode codepoint. I agree with the conclusion in that issue that that change is not a good idea since it isn't useful.

But this issue is separate; it's about removing \xXX from the language entirely. \uXXXX has to exist because it covers a larger range of values and means "unicode codepoint" in every language. \xXX in Rust has no use because it's equivalent to \u00XX but causes confusion because \xXX in C and C++ means raw byte hex.

So in aggregate it's worse than useless, it's a net negative. It addresses no use case and provides no benefit but comes with a cost; the only thing it does successfully is confuse users coming from Rust's primary market, C & C++ developers.

I honestly can't see why it's being kept.

lilyball · 2014-05-20T00:35:57Z

I'm strongly in favor of a modified form of this, where we allow \xXX for ASCII characters but disallow it for non-ASCII. This was suggested recently in rust-lang/rfcs#69, and apparently was also suggested back in #2800.

Keeping \xXX for codepoints U+0000 through U+007F seems like a good idea, because it means the same thing regardless of whether \xXX is interpreted as a codepoint or a code unit, and it's a convenient syntax for referring to ASCII characters. But interpreting \x80-\xFF as unicode only serves to be confusing. And not just to C/C++ programmers; even though I've been using Rust for quite a while, the other day I caught myself using \x80 in a string and expecting to get the byte 0x80.

If we restrict it to ASCII characters now, that also makes the behavior of the proposed byte string literals (rust-lang/rfcs#69) make more sense, where \x80 will definitely want to refer to the byte 0x80.

chris-morgan · 2014-05-20T01:32:15Z

I am in favour of restricting it for non-bytestring-literals also.

emberian · 2014-06-03T20:50:27Z

This is surprising, I assumed this was only a byte literal. I agree with @kballard here.

Valloric · 2014-06-03T21:40:55Z

As I mentioned on rust-lang/rfcs#69, I agree with @kballard's proposal.

rust-highfive · 2014-09-24T05:00:44Z

This issue has been moved to the RFCs repo: rust-lang/rfcs#312

thestinger added the B-RFC label Mar 8, 2014

alexcrichton closed this as completed Mar 9, 2014

Valloric mentioned this issue May 13, 2014

RFC: Add byte and byte string literals rust-lang/rfcs#69

Merged

lilyball reopened this May 20, 2014

rust-highfive mentioned this issue Sep 24, 2014

Remove \xXX char escapes from the language rust-lang/rfcs#312

Closed

rust-highfive closed this as completed Sep 24, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove `\xXX` char escapes from the language #12769

Remove `\xXX` char escapes from the language #12769

Valloric commented Mar 8, 2014

alexcrichton commented Mar 9, 2014

Valloric commented Mar 9, 2014

lilyball commented May 20, 2014

chris-morgan commented May 20, 2014

emberian commented Jun 3, 2014

Valloric commented Jun 3, 2014

rust-highfive commented Sep 24, 2014

Remove \xXX char escapes from the language #12769

Remove \xXX char escapes from the language #12769

Comments

Valloric commented Mar 8, 2014

alexcrichton commented Mar 9, 2014

Valloric commented Mar 9, 2014

lilyball commented May 20, 2014

chris-morgan commented May 20, 2014

emberian commented Jun 3, 2014

Valloric commented Jun 3, 2014

rust-highfive commented Sep 24, 2014

Remove `\xXX` char escapes from the language #12769

Remove `\xXX` char escapes from the language #12769