Respect CDATA[[ sections when parsing HTML #298
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
This changes the HTML parser behaviour to properly respect
CDATA[[ sections and to ignore link tags in Javascript code.
The old behaviour can be restored by passing undef as the "marked_sections"
option when creating the WWW::Mechanize object:
The patch also includes a (nasty) test file to check the old and
new behaviour
See also the discussion in https://perlmonks.org/?node_id=11116478 and https://gist.github.com/haukex/fd76efa16f0b07ce6a7441d9b2265b2a for more context