New Rule: disallow unicode confusable identifiers

### Rule details

Compute the Unicode skeleton of declared identifiers and disallow if similar to an identifier already in scope

### Related CVE

CVE-2021-42694

### Example code

```js
const loremIpsum = "latin only";
const lоrеmIрsum = "with Cyrillic ";
const lorem‍Ipsum = "with ZWJ";
```


### Participation

- [ ] I am willing to submit a pull request to implement this rule.

### Additional comments

The Zero-Width Joiner (`\u200d`) is a [valid identifier character](https://mathiasbynens.be/notes/javascript-identifiers), even though some parsers like the ones used by typescript or Webpack fail to parse correctly.

Cyrillic characters in the example code is one case of confusable unicode character with latin character, but there are a lot of other possibilities, including confusion between non-latin characters. Unicode defines an algorithm to compute the [skeleton](http://www.unicode.org/reports/tr39/#Confusable_Detection) of text, which we could apply to identifiers, and base the comparison on the skeleton instead of the identifier string.

First reported in https://github.com/eslint/eslint/issues/15240#issuecomment-961535750

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

New Rule: disallow unicode confusable identifiers #117

Rule details

Related CVE

Example code

Participation

Additional comments

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

New Rule: disallow unicode confusable identifiers #117

Description

Rule details

Related CVE

Example code

Participation

Additional comments

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions