[WIP] fix xmliter namespace on selected node #861

nramirezuy · 2014-08-19T18:46:08Z

This PR was triggered by scrapy-users

Actually xmliter populates a Selector with everything from the position 0 to the tag start, so if we had 100mb before the tag we want to iter it copy those 100mb across all the Selector objects. Also it just extract this info for the first tag and embed the rest on that, this can cause info crossing.

In this PR I kept the regex stuff even tho I think we should use something like iterparse.

Currently xmliter_lxml tests are failing due to it has a different API.

kmike · 2014-08-29T01:54:47Z

+1 to iterparse

pablohoffman · 2014-10-29T21:16:12Z

what's left for this WIP to become MRG?

nramirezuy · 2014-10-29T21:31:00Z

@pablohoffman Know if we keep this approach or we move to iterparse.

fix xmliter namespace on selected node

2a54020

redapple mentioned this pull request Jul 20, 2017

XMLFeedSpider iternodes iterator does not work on XML document with namespace #2842

Closed

Gallaecio mentioned this pull request Aug 21, 2020

Fix iternodes #4746

Merged

wRAR closed this in #4746 Oct 6, 2020

This was referenced Oct 6, 2020

xmliter_lxml cannot find namespaced node names #4833

Closed

Switch xmliter to iterparse #4834

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] fix xmliter namespace on selected node #861

[WIP] fix xmliter namespace on selected node #861

nramirezuy commented Aug 19, 2014

kmike commented Aug 29, 2014

pablohoffman commented Oct 29, 2014

nramirezuy commented Oct 29, 2014

[WIP] fix xmliter namespace on selected node #861

[WIP] fix xmliter namespace on selected node #861

Conversation

nramirezuy commented Aug 19, 2014

kmike commented Aug 29, 2014

pablohoffman commented Oct 29, 2014

nramirezuy commented Oct 29, 2014