ConstExprParser: support numeric literal separator #189

jiripudil · 2023-04-20T16:29:07Z

This PR adds support for an underscore separator in numeric literals, e.g.

/** @var int<0, 999_999_999> */

The logic should be the same as supported by PHP (https://wiki.php.net/rfc/numeric_literal_separator). To maintain BC, this is also implemented similarly to how PHP does it: the separator is supported by the lexer but then stripped

mvorisek · 2023-04-26T16:01:20Z

doc/grammars/type.abnf


 ConstantFloatExp
-	= "e" ["-"] 1*ByteDecDigit
+	= "e" ["-"] 1*ByteDecDigit *("_" 1*ByteDecDigit)


please see phpstan/phpstan-src#2358 (comment), + before exponent should be supported, for consistency, also before any number.

yep, it makes sense to support the same range of expressions as PHP does. But it feels out of scope of this PR; feel free to follow up with a fix :)

I will, this PR is merged, thus there will be no conflicts... One question, why is the grammar defined on two places - doc/grammars/type.abnf and src/Lexer/Lexer.php, is one file generated from the other one?

doc/grammars/type.abnf is mostly for documentation purposes and it's very likely out of sync. There's also FuzzyTest using it but I'm not sure about its purpose.

ondrejmirtes · 2023-04-28T07:31:37Z

Thank you very much! I'm gonna release this as part of phpdoc-parser 1.21 but it's not going to be right away, because I just did this massive amount of work 308c57c which I still need to improve a bit :)

jiripudil · 2023-04-28T17:29:18Z

Sure, no rush :)

mvorisek · 2023-04-29T11:31:53Z

src/Parser/ConstExprParser.php

@@ -47,7 +48,7 @@ public function parse(TokenIterator $tokens, bool $trimStrings = false): Ast\Con

 			return $this->enrichWithAttributes(
 				$tokens,
-				new Ast\ConstExpr\ConstExprFloatNode($value),
+				new Ast\ConstExpr\ConstExprFloatNode(str_replace('_', '', $value)),


What is the reason for str_replace here? It seems there are no other text transformations done in general, for example \d\. is not normalized to \d\.\d etc.

Keeping the underscore here would be a BC break because phpstan-src simply casts this value to int or float, and (int) '1_000_000' evaluates to 1.

ondrejmirtes force-pushed the 1.9.x branch 2 times, most recently from d25945f to 57f6787 Compare April 22, 2023 09:06

mvorisek reviewed Apr 26, 2023

View reviewed changes

ConstExprParser: support numeric literal separator

65e237e

ondrejmirtes force-pushed the numeric-literal-separator branch from 9785f59 to 65e237e Compare April 28, 2023 07:30

ondrejmirtes merged commit 0b4de96 into phpstan:1.9.x Apr 28, 2023

jiripudil deleted the numeric-literal-separator branch April 28, 2023 17:20

mvorisek reviewed Apr 29, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ConstExprParser: support numeric literal separator #189

ConstExprParser: support numeric literal separator #189

jiripudil commented Apr 20, 2023

mvorisek Apr 26, 2023

jiripudil Apr 28, 2023

mvorisek Apr 28, 2023

ondrejmirtes Apr 29, 2023

ondrejmirtes commented Apr 28, 2023

jiripudil commented Apr 28, 2023

mvorisek Apr 29, 2023

jiripudil May 1, 2023

ConstExprParser: support numeric literal separator #189

ConstExprParser: support numeric literal separator #189

Conversation

jiripudil commented Apr 20, 2023

mvorisek Apr 26, 2023

Choose a reason for hiding this comment

jiripudil Apr 28, 2023

Choose a reason for hiding this comment

mvorisek Apr 28, 2023

Choose a reason for hiding this comment

ondrejmirtes Apr 29, 2023

Choose a reason for hiding this comment

ondrejmirtes commented Apr 28, 2023

jiripudil commented Apr 28, 2023

mvorisek Apr 29, 2023

Choose a reason for hiding this comment

jiripudil May 1, 2023

Choose a reason for hiding this comment