Documentation speaks of re.match while meaning re.search #17

shaneaevans · 2011-09-09T05:21:33Z

Reported by Vasily Alexeev on Trac http://dev.scrapy.org/ticket/328

In link extractor reference we see passages like

"allow (str or list) – a single regular expression (or list of regular expressions) that the (absolute) urls must match in order to be extracted. If not given (or empty), it will match all links."

There's two quite different methods for working with regexps: matching and searching. A quick look in sources reveals that in this case we deal with searching, not matching:

_matches = lambda url, regexs: any((r.search(url) for r in regexs))

So documentation is clearly misleading and should be corrected.

pablohoffman · 2012-09-07T21:39:46Z

This ticket has been open for too long without any specific suggestion on the improvement. I personally don't think that "match" is misleading there, as it's well know in the context of regular expressions, and in no place it refers to the re function itself.

pablohoffman closed this as completed Sep 7, 2012

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Documentation speaks of re.match while meaning re.search #17

Documentation speaks of re.match while meaning re.search #17

shaneaevans commented Sep 9, 2011

pablohoffman commented Sep 7, 2012

Documentation speaks of re.match while meaning re.search #17

Documentation speaks of re.match while meaning re.search #17

Comments

shaneaevans commented Sep 9, 2011

pablohoffman commented Sep 7, 2012