Node diff #25

edubkendo · 2013-09-04T05:52:14Z

There is a lot here, so please ask about anything. @enebo please look especially at my tests. I feel like they are brittle and I could really use some pointers on how to improve them.

In a twist of irony, just adding isSame() to all the nodetypes ended up being all I really needed to fix up rsense. Have not yet implemented equals() but that should be easy enough, since it is just adding a comparison of position info and then checking isSame().

…ode_diff

Undoes hours of overthinking things.

Finally found the right data structure for collecting and returning the diff information.

Begin adding `#isSame()` methods to all Node classes which need it for Diff. These two were put in place to test out some prototype Diff code, but will work.

Wrote the main body of the diffing algorithm. There are many conditionals and it may be possible to simplify or refactor, but it provides the framework for building out a semantic diffing utility.

We cycle through the diff to see if any of the Changes stored were actually just moved. Will probably actually store these in a separate object, but for now we are not.

This is a big improvement, but I've only just realized that a part of my approach has been wrongheaded. I need a way to measure the distance between two nodes, and can probably discard these ideas about complexity and depth. A simple levenshtein distance taken on the raw strings may be enough, or I may have to come up with some measurement specific to an AST.

It was not well thought out, was over complicating things, and producing inaccurate results. Easier to just go ahead and toss them in as an insertion and deletion, and then match up matches later.

This should have been 3 separate commits but the earlier commits didn't go through for some reason. The isSame stuff is fairly minor, but the NodeDiff and SequenceMatcher changes bring major improvements to the results.

Node diff. I will merge and tweak/edit as needed. Cool stuff.

edubkendo added 30 commits July 22, 2013 15:52

Initial work on NodeDiff

e6261ca

Moves NodeDiff, creates SequenceMatcher

b841e8b

Merge branch 'master' of https://github.com/jruby/jruby-parser into n…

fc6e68c

…ode_diff

Skeleton for Diff functionality.

cc2a9de

Begins documenting new classes.

ccb58c9

Merge branch 'master' of https://github.com/jruby/jruby-parser into n…

aa1e35a

…ode_diff

Begin adding specs for NodeDiff and SequenceMatcher

8089537

Flatten children methods and test.

4499068

Undoes hours of overthinking things.

Count node complexity.

7ef8b38

Creates Change class.

565f909

Finally found the right data structure for collecting and returning the diff information.

Adds isSame

5f5c699

Begin adding `#isSame()` methods to all Node classes which need it for Diff. These two were put in place to test out some prototype Diff code, but will work.

Diffing algorithm.

5240136

Wrote the main body of the diffing algorithm. There are many conditionals and it may be possible to simplify or refactor, but it provides the framework for building out a semantic diffing utility.

Check our diff for moved changes

53fcc59

We cycle through the diff to see if any of the Changes stored were actually just moved. Will probably actually store these in a separate object, but for now we are not.

Some cleanup based on @enebo 's recommendations.

092656a

Continue filling out diff functionality. Getting closer.

494e7a0

More diffing based changes.

a0362b7

Changes to SequenceMatcher for Diff

4b7bef8

Begin filling in isSame() method for each nodetype.

c3d7009

Merge remote-tracking branch 'remotes/upstream/master' into node_diff

e76c18a

More isSame().

7c6a3c9

A whole bunch more nodes with isSame() methods.

39eb122

Finishes adding isSame().

0fd39d2

Integration work + cleans up toString()

b8c1cf3

Remove the findDistance() stuff.

baf8c71

It was not well thought out, was over complicating things, and producing inaccurate results. Easier to just go ahead and toss them in as an insertion and deletion, and then match up matches later.

isSame() tweaks, better subdiff, and match methods up

b35bbfc

This should have been 3 separate commits but the earlier commits didn't go through for some reason. The isSame stuff is fairly minor, but the NodeDiff and SequenceMatcher changes bring major improvements to the results.

Merge branch 'break-and-return' into node_diff

7ea4785

Refactors NodeDiff and improves the Documentation

94b7bcd

Specify cast.

9418af3

Nullcheck.

7c63a02

enebo added a commit that referenced this pull request Sep 4, 2013

Merge pull request #25 from edubkendo/node_diff

d64fb6d

Node diff. I will merge and tweak/edit as needed. Cool stuff.

enebo merged commit d64fb6d into jruby:master Sep 4, 2013

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Node diff #25

Node diff #25

Uh oh!

edubkendo commented Sep 4, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Node diff #25

Node diff #25

Uh oh!

Conversation

edubkendo commented Sep 4, 2013

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants