Skip to content

Conversation

@kalessin
Copy link
Contributor

Allow to define field extractors over text contents only, without need to re parsing. Added tests and fixed current ones, as a special flag was added on HtmlDataFragment objects

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why call text_all.strip() instead of simply passing text_all? do we need to remove whitespace?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, strictly there is no need to make a strip. I will remove.

…d to re parsing. Added tests and fixed current ones, as a special flag was added on HtmlDataFragment objects
@kalessin
Copy link
Contributor Author

I made the changes you suggested. please check.

shaneaevans added a commit that referenced this pull request Feb 16, 2012
allow defining of field extractors over text content only, without the need to re-parse.
@shaneaevans shaneaevans merged commit 741340a into scrapy:master Feb 16, 2012
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants