Urls surrounded by parenthesis with a path are being parsed incorrectly. #10

lextoumbourou · 2015-05-06T07:31:16Z

This works (and you've got a test for it ❤️):

In [1]: from ttp import ttp
In [2]: p = ttp.Parser()
In [3]: res = p.parse("test (http://example.com)")
In [4]: res.urls
Out[4]: ['http://example.com']
In [5]: res.html
Out[5]: u'test (<a href="http://example.com">http://example.com</a>)'

However, this does not:

In [6]: res = p.parse("test (http://example.com/directory)")
In [7]: res.urls
Out[7]: ['http://example.com/directory)']
In [8]: res.html
Out[8]: u'test (<a href="http://example.com/directory)">http://example.com/directory)</a>'

Note the right parentheses being treated as part of the URL?

Thanks,

Lex

docapotamus · 2015-07-03T15:36:06Z

This isn't necersarily a bug as ) is valid character in a URL path

edmondburnett · 2019-04-17T23:16:59Z

Since parentheses are valid within URLs, perhaps we could say that if there is an opening parentheses before the protocol, i.e. (http:// AND a closing paren at the end of the URL, then only in that case we can ignore/strip them. Because then we could probably assume the user intended to enclose the URL inside parens.

docapotamus · 2019-07-03T21:03:58Z

@edmondburnett I think that sounds like a sensible solution.

lextoumbourou changed the title ~~Urls surrounded by parenthesis with a path are being parsed correctly.~~ Urls surrounded by parenthesis with a path are being parsed incorrectly. May 6, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Urls surrounded by parenthesis with a path are being parsed incorrectly. #10

Urls surrounded by parenthesis with a path are being parsed incorrectly. #10

lextoumbourou commented May 6, 2015

docapotamus commented Jul 3, 2015

edmondburnett commented Apr 17, 2019

docapotamus commented Jul 3, 2019

Urls surrounded by parenthesis with a path are being parsed incorrectly. #10

Urls surrounded by parenthesis with a path are being parsed incorrectly. #10

Comments

lextoumbourou commented May 6, 2015

docapotamus commented Jul 3, 2015

edmondburnett commented Apr 17, 2019

docapotamus commented Jul 3, 2019