Add token index #12

instabledesign · 2017-11-01T14:35:43Z

Hi, i recently work on new project Xpression

My need is to resetPosition at token index but the $token['position'] was the string position of this token in the input string.

My actual workaround was to keep the lexerI index in my Parser code and reset it each time i need it.

So i think if the token index was store in the token i can get it easily with $token['index']

Thank to read.

instabledesign · 2017-11-30T10:27:52Z

Ping @beberlei @stof just to have answer to process or close it..

stof · 2017-11-30T11:44:16Z

I'm not maintainer on this project

instabledesign · 2017-11-30T12:43:51Z

ok sorry

jwage · 2018-04-11T04:58:40Z

Thanks for the change. I think this makes sense and it should be BC. Can you add a unit test?

jwage · 2018-04-16T21:25:18Z

Thanks for making this change and adding the tests! I will merge a little later and I am going to follow this PR up with another to add Travis CI.

Majkl578 · 2018-05-13T22:05:54Z

Unfortunately this change is a BC break and needs to be reverted. :/
It has completely broken egeloen/ivory-serializer: https://gist.github.com/Majkl578/8c1500fd14884091e65e6af3ddef5c84

Thanks @goetas for spotting this!

jwage · 2018-05-13T23:05:30Z

@Majkl578 I reverted this here #18

I kept the other changes from the PR and just reverted the changed functionality.

jwage · 2018-05-13T23:10:41Z

Looking at the code in https://github.com/egeloen/ivory-serializer/tree/master/src/Type and I don't immediately see what it was depending on that caused the break. I will look more later.

instabledesign · 2018-05-14T07:00:55Z

Hi, i think we get drop the

$this->tokens[$index] = array(

but we can keep

'index' => $index,

jwage · 2018-05-14T13:58:33Z

@instabledesign If you have time, can you look at https://github.com/egeloen/ivory-serializer and see why it broke after this change so that we can add tests to cover it?

instabledesign · 2018-05-14T15:32:17Z

Yes.

instabledesign · 2018-05-14T20:42:35Z

Investigation report:

the TypeLexer has a catchablePattern with group capture '([a-z0-9\\\\]+)' so the captured token is store twice
in second time the TypeLexer do a strict comparison between following token

Working solution :
change the egeloen/ivory-serializer for(...)

for ($i = 0; ($i < $count) || ($token === $nextToken); ++$i) {

for ($i = 0; ($i < $count) || ($token['value'] === $nextToken['value'] && $token['type'] === $nextToken['type'] && $token['position'] === $nextToken['position']); ++$i) {

I'll try to fix the AbstractLexer::$index in order to increment only when the match is a not a capture of previous one but without succeed, and i dont think is a good solution.

instabledesign · 2018-05-18T06:52:27Z

I try to fix the 2 problem from above but theire is some logic to build the fixtures with some private method, so this is not easy to reproduce and fix correctly what is going on! I continue to work on it on my free time.

instabledesign · 2018-06-10T07:19:40Z

tests fixed i ping him in order to merge

instabledesign · 2018-06-13T08:56:16Z

egeloen/ivory-serializer look like not active anymore with only one release (from jan 2017)

@jwage did you plan to create new version with this modification?

jwage · 2018-06-13T14:33:50Z

Did we figure out a way to make the change in this repo so it doesn't break existing implementations? (even if their regex is "wrong")

instabledesign · 2018-06-14T08:41:32Z

First i try with group naming but the group naming doesn't work with preg_split

preg_split('/(?<FOO>=|>|<)|(?<BAR>[a-z]+)|(?<BAZ>\d+)/i', 'price>5', -1, PREG_SPLIT_NO_EMPTY | PREG_SPLIT_DELIM_CAPTURE | PREG_SPLIT_OFFSET_CAPTURE);
/*
array(3) {
  [0]=>
  array(2) {
    [0]=>
    string(5) "price"
    [1]=>
    int(0)
  }
  [1]=>
  array(2) {
    [0]=>
    string(1) ">"
    [1]=>
    int(5)
  }
  [2]=>
  array(2) {
    [0]=>
    string(1) "5"
    [1]=>
    int(6)
  }
}
*/

The second way is to deduplicate the matched element with offset

$matches = preg_split('/((=|>|<)|([a-z]+)|(\d+))/i', 'price>5', -1, PREG_SPLIT_NO_EMPTY | PREG_SPLIT_DELIM_CAPTURE | PREG_SPLIT_OFFSET_CAPTURE);
$offset = null;
$matchesDeduplicate = array_filter($matches, function($item)use(&$offset){
    if (null === $offset) {
        $offset = $item[1];
        return true;
    }
    $filter = $offset !== $item[1];
    $offset = $item[1];
    
    return $filter;
});
/*
array(3) {
  [0]=>
  array(2) {
    [0]=>
    string(5) "price"
    [1]=>
    int(0)
  }
  [2]=>
  array(2) {
    [0]=>
    string(1) ">"
    [1]=>
    int(5)
  }
  [4]=>
  array(2) {
    [0]=>
    string(1) "5"
    [1]=>
    int(6)
  }
}
*/

With 300 tokens match 10 each (9000 tokens) we already have 1Mo memory more consuption
preg_split without deduplicate
preg_split with deduplicate

instabledesign · 2018-07-12T07:43:56Z

what did you think about it @jwage

jwage · 2018-07-12T16:43:44Z

I don't think we can make this change without breaking BC or increasing memory usage as you noted.

instabledesign · 2023-11-17T14:03:01Z

Hi can you consider apply this change on the v2 ?
it was originally revert because of breaking unmaintained libs.
@greg0ire

greg0ire · 2023-11-17T20:11:23Z

If there is a breaking change, then it should go into v4 I'm afraid.

instabledesign · 2023-12-15T21:10:30Z

Im not completely sure it was a BC because it only add a new value in the token details.
Like a explain earlier, the egeloen/ivory-serializer not use properly the lexer, so my change alter his behavior.
For me it was ok is this change go in V3 or at least in V4.

greg0ire · 2023-12-15T21:19:38Z

If it's not a breaking change then you should target v3.1

Add token index

f434145

instabledesign force-pushed the master branch from 7776029 to 2acfa7c Compare April 16, 2018 21:03

implement test with ConcreteLexer example

2808f29

instabledesign force-pushed the master branch from 2acfa7c to 2808f29 Compare April 16, 2018 21:04

jwage approved these changes Apr 16, 2018

View reviewed changes

jwage added the enhancement label Apr 16, 2018

jwage added this to the v1.0.2 milestone Apr 16, 2018

jwage requested a review from guilhermeblanco April 16, 2018 21:37

guilhermeblanco approved these changes Apr 17, 2018

View reviewed changes

guilhermeblanco merged commit 0eda1aa into doctrine:master Apr 17, 2018

instabledesign mentioned this pull request Jun 7, 2018

Fix TypeParser + TypeLexer egeloen/ivory-serializer#26

Open

alcaeus modified the milestones: v1.1.0, 1.0.2 Jun 8, 2019

instabledesign mentioned this pull request Nov 17, 2023

ResetPosition doesn't work by token position #53

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add token index #12

Add token index #12

instabledesign commented Nov 1, 2017

instabledesign commented Nov 30, 2017

stof commented Nov 30, 2017

instabledesign commented Nov 30, 2017

jwage commented Apr 11, 2018 •

edited

Loading

jwage commented Apr 16, 2018

Majkl578 commented May 13, 2018

jwage commented May 13, 2018

jwage commented May 13, 2018

instabledesign commented May 14, 2018 •

edited

Loading

jwage commented May 14, 2018

instabledesign commented May 14, 2018

instabledesign commented May 14, 2018

instabledesign commented May 18, 2018

instabledesign commented Jun 10, 2018

instabledesign commented Jun 13, 2018

jwage commented Jun 13, 2018 •

edited

Loading

instabledesign commented Jun 14, 2018

instabledesign commented Jul 12, 2018

jwage commented Jul 12, 2018

instabledesign commented Nov 17, 2023

greg0ire commented Nov 17, 2023

instabledesign commented Dec 15, 2023

greg0ire commented Dec 15, 2023

Add token index #12

Add token index #12

Conversation

instabledesign commented Nov 1, 2017

instabledesign commented Nov 30, 2017

stof commented Nov 30, 2017

instabledesign commented Nov 30, 2017

jwage commented Apr 11, 2018 • edited Loading

jwage commented Apr 16, 2018

Majkl578 commented May 13, 2018

jwage commented May 13, 2018

jwage commented May 13, 2018

instabledesign commented May 14, 2018 • edited Loading

jwage commented May 14, 2018

instabledesign commented May 14, 2018

instabledesign commented May 14, 2018

instabledesign commented May 18, 2018

instabledesign commented Jun 10, 2018

instabledesign commented Jun 13, 2018

jwage commented Jun 13, 2018 • edited Loading

instabledesign commented Jun 14, 2018

instabledesign commented Jul 12, 2018

jwage commented Jul 12, 2018

instabledesign commented Nov 17, 2023

greg0ire commented Nov 17, 2023

instabledesign commented Dec 15, 2023

greg0ire commented Dec 15, 2023

jwage commented Apr 11, 2018 •

edited

Loading

instabledesign commented May 14, 2018 •

edited

Loading

jwage commented Jun 13, 2018 •

edited

Loading