Add index to XPath information of XDocument equivalence failure message #1181

lg2de · 2019-11-07T21:51:06Z

I think, the XDocument equivalence failure message is not helpful when comparing documents with repeating element names, like HTML documents. By just providing the simple XPath it may be very difficult to identify the element with difference:
/html/body/table/tr/td

This PR is a proposal to improve the failure message by writing index number (according XPath syntax) when a specific element was found at least second time on same level:
/html/body/table[2]/tr[17]/td[5]

Additionally

I recommend a new setting for editorconfig according to the current code style
I changed a test name according to the test content

Src/FluentAssertions/Xml/XmlReaderValidator.cs

jnyrup

Great idea 👍
I haven't been using the xml assertions much, but having the index adds value.

Src/FluentAssertions/Xml/XmlReaderValidator.cs

lg2de · 2019-11-09T13:31:07Z

Do you think that this change of error message is a breaking change?
If users asserting XPath using WithMessage existing tests may fail.

jnyrup · 2019-11-09T23:17:26Z

While it technically can be a breaking change, as in tests failing after upgrading the package, I don't recall we've considered improvements/changes to failure messages as breaking changes.

Tests/Shared.Specs/XDocumentAssertionSpecs.cs

dennisdoomen · 2019-11-12T01:36:21Z

While it technically can be a breaking change, as in tests failing after upgrading the package, I don't recall we've considered improvements/changes to failure messages as breaking changes.

Why would a test fail? Because somebody observe FA's failure messages in an exception assertion?

lg2de · 2019-11-12T10:35:26Z

While it technically can be a breaking change, as in tests failing after upgrading the package, I don't recall we've considered improvements/changes to failure messages as breaking changes.

Why would a test fail? Because somebody observe FA's failure messages in an exception assertion?

While creating an example I found that my question is not relevant.
Only when asserting FA failure message the change can cause failing tests. But this is not a normal use case.

jnyrup · 2019-11-15T16:11:08Z

I came to think of another way to combine the Stack and Dictionary into a single graph that keeps track of the xpath indices.

jnyrup@711a7f9

Some very primitive benchmarks:

N is the depth of the xml tree.
xml tags are named a0...aN

10 children per node.

This PR

|                     Method | N |         Mean |      Error |     StdDev | Ratio |    Gen 0 | Gen 1 | Gen 2 |  Allocated |
|--------------------------- |-- |-------------:|-----------:|-----------:|------:|---------:|------:|------:|-----------:|
| BeEquivalentTo             | 1 |     5.234 us |  0.1092 us |  0.1214 us |  1.00 |   1.3351 |     - |     - |    4.11 KB |
|                            |   |              |            |            |       |          |       |       |            |
| BeEquivalentTo             | 2 |    25.672 us |  0.2558 us |  0.2393 us |  1.00 |   4.6082 |     - |     - |   14.17 KB |
|                            |   |              |            |            |       |          |       |       |            |
| BeEquivalentTo             | 5 | 3,828.342 us | 30.7972 us | 24.0445 us |  1.00 | 648.4375 |     - |     - | 1995.18 KB |

My branch

|                     Method | N |         Mean |      Error |     StdDev | Ratio |    Gen 0 | Gen 1 | Gen 2 | Allocated |
|--------------------------- |-- |-------------:|-----------:|-----------:|------:|---------:|------:|------:|----------:|
| BeEquivalentTo             | 1 |     2.987 us |  0.0256 us |  0.0227 us |  1.00 |   1.0529 |     - |     - |   3.24 KB |
|                            |   |              |            |            |       |          |       |       |           |
| BeEquivalentTo             | 2 |    12.788 us |  0.2393 us |  0.2122 us |  1.00 |   2.9602 |     - |     - |   9.14 KB |
|                            |   |              |            |            |       |          |       |       |           |
| BeEquivalentTo             | 5 | 1,558.388 us | 16.2188 us | 13.5434 us |  1.00 | 289.0625 |     - |     - | 890.32 KB |

4 children per node.

This PR

|                     Method |  N |            Mean |         Error |        StdDev | Ratio |       Gen 0 | Gen 1 | Gen 2 |     Allocated |
|--------------------------- |--- |----------------:|--------------:|--------------:|------:|------------:|------:|------:|--------------:|
| BeEquivalentTo             |  3 |        71.90 us |      1.375 us |      1.351 us |  1.00 |     11.8408 |     - |     - |      36.59 KB |
|                            |    |                 |               |               |       |             |       |       |               |
| BeEquivalentTo             |  5 |     1,346.27 us |     36.026 us |     30.084 us |  1.00 |    224.6094 |     - |     - |     694.78 KB |
|                            |    |                 |               |               |       |             |       |       |               |
| BeEquivalentTo             | 10 | 1,871,360.79 us | 27,158.727 us | 24,075.504 us |  1.00 | 328000.0000 |     - |     - | 1009116.03 KB |

My branch

|                     Method |  N |          Mean |         Error |       StdDev |        Median | Ratio |       Gen 0 | Gen 1 | Gen 2 |    Allocated |
|--------------------------- |--- |--------------:|--------------:|-------------:|--------------:|------:|------------:|------:|------:|-------------:|
| BeEquivalentTo             |  3 |      34.01 us |      0.678 us |     1.170 us |      33.47 us |  1.00 |      7.0190 |     - |     - |     21.63 KB |
|                            |    |               |               |              |               |       |             |       |       |              |
| BeEquivalentTo             |  5 |     534.87 us |      6.195 us |     5.492 us |     535.04 us |  1.00 |    101.5625 |     - |     - |    312.93 KB |
|                            |    |               |               |              |               |       |             |       |       |              |
| BeEquivalentTo             | 10 | 592,578.85 us | 10,082.794 us | 8,419.590 us | 594,102.50 us |  1.00 | 103000.0000 |     - |     - | 317702.62 KB |

dennisdoomen · 2019-11-22T08:06:39Z

@jnyrup do you want to do anything with those performance results? Or can this be merged?

lg2de · 2019-11-22T08:09:27Z

@jnyrup do you want to do anything with those performance results? Or can this be merged?

Good question.

So far, I was unsure what to do.
I did not have time to analyzed in detail was @jnyrup has changed.
If performance is better, then I'm fine.
Should I merge it into "my" PR?

jnyrup · 2019-11-22T09:07:50Z

@lg2de You can merge the commit into this PR.
Its commit message (hopefully) describes the idea behind the visit count graph.
In the end it does the same as your algorithm, but more lazily to avoid stack iterations, string allocations and dictionary lookups.

Use a counting graph to keep track of xpath index

Currently we use a stack to keep track of the location and a dictionary to keep track of the xpath index.
That has multiple downsides:

We need to keep two separate data structure in sync.

We're a lot of string allocations on each iterations when computing path as key for the dictionary.

When creating the failure message we're doing lookups in two data structures

Instead we can combine their purposes in a single graph where each node has a Name and a Count, where the latter is updated each time we visit a node with the same path.

On each xml traversal we now only need to find the matching sibling or insert a new child.

Creating the failure message is simply the travel from the current node to the root node in reverse order.

Currently we use a stack to keep track of the location and a dictionary to keep track of the xpath index. That has multiple downsides: * We need to keep two separate data structure in sync. * We're a lot of string allocations on each iterations when computing path as key for the dictionary. * When creating the failure message we're doing lookups in two data structures Instead we can combine their purposes in a single graph where each node has a `Name` and a `Count`, where the latter is updated each time we visit a node with the same path. On each xml traversal we now only need to find the matching sibling or insert a new child. Creating the failure message is simply the travel from the current node to the root node in reverse order.

lg2de · 2019-11-29T11:40:13Z

@jnyrup why do you prefer

while (current.parent is object)

over

while (current.parent != null)

?

dennisdoomen · 2019-11-29T11:55:43Z

The second one can trigger an != operator. The first one is a new C# construct that makes it immediately clear what you're expecting (providing you know the construct ;-))

jnyrup · 2019-11-29T11:58:58Z

https://intellitect.com/check-for-null-not-null/ has a nice write-up of different ways to compare against null.

By using is object, is null or is {} I don't have to care about whether a potentially overloaded == or != operator has proper null checking.

Here's an example of the differences for value and reference types:
(The {} is a )
https://sharplab.io/#v2:CYLg1APgAgTAjAWAFBQMwAJboMLoN7LpGYYBGA9uQDboBqAhlQK4CmAKgJ4AOLcAFAEsAdgBd0AgJToAvAD5x4gM7pypAFYsAxiIDchYvqJp0FanUatOPGINHipchQEJp6IUypU9SYukMkTShoGZnZuFlRbMUkZeQElfABfb19/f2NTGgAlFgAzFgAnFiFNMJ5+XBjHeIFlVQ1tFIMfYgyg9Bz8opKylhtKhzj0FzcPLzSWozJ2zsLi0qsIvgHYhVqkpqJ/AHpt9AApJkUxEXJ0FgAPFgBbLioBXI50ehMmAHM3p4FbqhvikXoIgE5CE6QwUDgADZAmZyDwCoDyAVpNJls8ADQ4EyDdAiAAWBXIAHc3CwSQA5MazbqlACiF1KXCBIL4Ek2AQh0MyKnhiIKLjR9ExuFIOKcfBeKOx3kSQA===

lg2de · 2019-11-29T15:59:19Z

Mmh, understood. It feels overengineered to me.

I've applied jnyrup improvement (leaving the mentioned statement) and did some cleanup.
Would you like to merge now? Or do you still missing something?

jnyrup · 2019-11-29T16:13:24Z

Is that

leaving the mentioned statement

referring to this?

// starting new element, add local name to location stack
// to build XPath info

If so, everything looks good to me 👍

lg2de · 2019-11-29T18:32:03Z

No, sorry.
I was referring to the null check.
;)

jnyrup · 2019-12-05T16:14:28Z

@dennisdoomen whenever you're ready ;)

dennisdoomen

This class needs a bit of love. I'll take a look after merging this one.

lg2de · 2019-12-07T12:06:41Z

This class needs a bit of love. I'll take a look after merging this one.

What do you mean?

I just want to note, that I think about to provide new implementation for XDocument comparison.
Recently I worked often with XDocument/XElement and FluentAssertions. I found the output is still sometimes not helpful. I would like to have results like using BeEquivalentTo on complex classes. Hope I'll find time for this soon...

add index to XPath information of XDocument equivalence failure message

aa1fcad

dennisdoomen requested changes Nov 8, 2019

View reviewed changes

dennisdoomen changed the title ~~add index to XPath information of XDocument equivalence failure message~~ Add index to XPath information of XDocument equivalence failure message Nov 8, 2019

jnyrup reviewed Nov 8, 2019

View reviewed changes

Src/FluentAssertions/Xml/XmlReaderValidator.cs Outdated Show resolved Hide resolved

Src/FluentAssertions/Xml/XmlReaderValidator.cs Outdated Show resolved Hide resolved

Src/FluentAssertions/Xml/XmlReaderValidator.cs Outdated Show resolved Hide resolved

jnyrup reviewed Nov 10, 2019

View reviewed changes

Tests/Shared.Specs/XDocumentAssertionSpecs.cs Outdated Show resolved Hide resolved

lg2de added 2 commits November 11, 2019 22:15

Merge remote-tracking branch 'upstream/master' into XElementDescriptive

7f409b2

rework according to the review

59be23e

lg2de requested a review from dennisdoomen November 11, 2019 21:46

jnyrup approved these changes Nov 12, 2019

View reviewed changes

cleanup

ad407d4

dennisdoomen approved these changes Dec 6, 2019

View reviewed changes

dennisdoomen merged commit fbb9f17 into fluentassertions:master Dec 6, 2019

lg2de deleted the XElementDescriptive branch December 7, 2019 11:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add index to XPath information of XDocument equivalence failure message #1181

Add index to XPath information of XDocument equivalence failure message #1181

lg2de commented Nov 7, 2019

jnyrup left a comment

lg2de commented Nov 9, 2019

jnyrup commented Nov 9, 2019

dennisdoomen commented Nov 12, 2019

lg2de commented Nov 12, 2019

jnyrup commented Nov 15, 2019

dennisdoomen commented Nov 22, 2019

lg2de commented Nov 22, 2019

jnyrup commented Nov 22, 2019

Use a counting graph to keep track of xpath index

lg2de commented Nov 29, 2019

dennisdoomen commented Nov 29, 2019

jnyrup commented Nov 29, 2019

lg2de commented Nov 29, 2019

jnyrup commented Nov 29, 2019

lg2de commented Nov 29, 2019

jnyrup commented Dec 5, 2019

dennisdoomen left a comment

lg2de commented Dec 7, 2019

Add index to XPath information of XDocument equivalence failure message #1181

Add index to XPath information of XDocument equivalence failure message #1181

Conversation

lg2de commented Nov 7, 2019

jnyrup left a comment

Choose a reason for hiding this comment

lg2de commented Nov 9, 2019

jnyrup commented Nov 9, 2019

dennisdoomen commented Nov 12, 2019

lg2de commented Nov 12, 2019

jnyrup commented Nov 15, 2019

10 children per node.

This PR

My branch

4 children per node.

This PR

My branch

dennisdoomen commented Nov 22, 2019

lg2de commented Nov 22, 2019

jnyrup commented Nov 22, 2019

Use a counting graph to keep track of xpath index

lg2de commented Nov 29, 2019

dennisdoomen commented Nov 29, 2019

jnyrup commented Nov 29, 2019

lg2de commented Nov 29, 2019

jnyrup commented Nov 29, 2019

lg2de commented Nov 29, 2019

jnyrup commented Dec 5, 2019

dennisdoomen left a comment

Choose a reason for hiding this comment

lg2de commented Dec 7, 2019