Add new state: Unicode compatibility normalization

Hey ,

I noticed that you are considering only two states:
1. One regarding the path normalization if it is done or not before the safe check
2. Second concerns the safe check.

as shown next:

https://github.com/github/codeql/blob/c1c0a705b9f14c0f577a9ae56a9d699e8b6e67d6/python/ql/lib/semmle/python/security/dataflow/PathInjectionQuery.qll#L20-L28

However, there is a third state that is a required one: `Unicode normalized`. If ever a Unicode normalization is performed with a compatibility algorithm (NFKC or NFKD), the query would miss some cases precisely those ones where the Unicode normalization is not performed before the path normalization and the safe check. I draw a little chart to depict my saying:

<img width="667" alt="Image" src="https://github.com/user-attachments/assets/4f45fe2a-49af-4cd4-ac9f-6274f95ec8f0" />

The previous chart shows that when you consider a potential Unicode compatibility normalization, it is a required step before path normalization and safe check. If ever placed between the first two steps or after the last one, that would yield a vulnerable case that got missed due to the fact that the Unicode normalization may reintroduce unexpected special characters such as `..` and `/`.

Regards
@Sim4n6 


	/** A state signifying that the file path has not been normalized. */
	class NotNormalized extends NormalizationState {
	NotNormalized() { this = "NotNormalized" }
	}

	/** A state signifying that the file path has been normalized, but not checked. */
	class NormalizedUnchecked extends NormalizationState {
	NormalizedUnchecked() { this = "NormalizedUnchecked" }
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add new state: Unicode compatibility normalization #19706

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Add new state: Unicode compatibility normalization #19706

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions