Trailing period causes DomainNotAllowed exception #7

paulhammond · 2010-11-15T17:08:49Z

With public_suffix_service 0.7.0 parsing a standard domain works well:

ruby-1.8.7-p249 > PublicSuffixService.parse('example.com')
 => #<PublicSuffixService::Domain:0x101905cc0 @tld="com", @trd=nil, @sld="example">

Adding trailing punctuation correctly returns a DomainInvalid error:

ruby-1.8.7-p249 > PublicSuffixService.parse('example.com,')
PublicSuffixService::DomainInvalid: `example.com,' is not a valid domain
ruby-1.8.7-p249 > PublicSuffixService.parse('example.com:')
PublicSuffixService::DomainInvalid: `example.com:' is not a valid domain

But if the trailing punctuation is a period, the error returned is instead DomainNotAllowed

ruby-1.8.7-p249 > PublicSuffixService.parse('example.com.')
PublicSuffixService::DomainNotAllowed: `example.com.' is not allowed according to Registry policy
ruby-1.8.7-p249 > PublicSuffixService.parse('*.example.com.')
PublicSuffixService::DomainNotAllowed: `*.example.com.' is not allowed according to Registry policy

It's sometimes useful to handle miskeyed input data (DomainInvalid) differently to domains that shouldn't exist (DomainInvalid). For example, in an application I'm working on we ignore DomainInvalid because some of the hostnames are on private networks (eg: host.bigcompany). Without extra error checking code our application will fail to handle a common data input mistake.

Also, example.com. is actually a valid hostname - the trailing . implies a fully qualified domain name.

The text was updated successfully, but these errors were encountered:

weppos · 2010-11-15T18:15:06Z

It's sometimes useful to handle miskeyed input data (DomainInvalid) differently to domains that shouldn't exist (DomainInvalid).

I'm not sure I understood your point. Could you please add a more concrete example?

Also, example.com. is actually a valid hostname - the trailing . implies a fully qualified domain name.

You're right, but this kind of validation goes beyond the purpose of the public suffix list.

paulhammond · 2010-11-15T23:23:51Z

It's sometimes useful to handle miskeyed input data (DomainInvalid) differently to domains that shouldn't exist (DomainInvalid).

I'm not sure I understood your point. Could you please add a more concrete example?

I'm working on improving some code that validates domains where we want to exclude known public suffixes like 'co.uk' but allow things like 'department.bigcompany' for internal-only TLDs inside a company. To do this we're pre-filtering domains, sending it to PublicSuffixService.parse() and catching some of the exceptions raised. Thanks to your code it handles everything we've thrown at it, except for trailing periods.

In debugging the problem I got the exceptions confused, forgetting that PublicSuffixService.parse('host.nonexistant') will throw DomainInvalid not DomainNotAllowed - as a result that whole paragraph doesn't make sense. Sorry for the confusion.

Still, I think it's a bug that this code raises DomainNotAllowed:

PublicSuffixService.parse('example.com.')

Doing this suggests that '.com.' is a known TLD, but 'example' isn't allowed to be registered underneath it, which isn't true.

Tracing the code shows that RuleList.default.find() is finding the com rule, but calling allow('example.com.') on that rule returns false. One piece of the code is flexible about trailing periods, the other isn't.

It seems to me it should either raise DomainInvalid or be valid, depending on whether you think parsing 'example.com.' is within the scope of this library. Raising DomainNotAllowed seems inconsistent and confusing.

weppos · 2010-11-18T12:53:24Z

Raising DomainNotAllowed seems inconsistent and confusing.

I totally agree.

It seems to me it should either raise DomainInvalid or be valid, depending on whether you think parsing 'example.com.'

This is the key point. I need to understand whether the library should be clever enough to handle this, or if I prefer to follow the basic principles of the Public Suffix List.

It seems to me you believe PublicSuffixService should handle it. Is it true?
My only concern is to avoid users' confusion.

paulhammond · 2010-11-29T16:50:53Z

I thought about this and I think PublicSuffixService should not handle example.com. but this should be noted somewhere in the documentation.

I think this is the behavior most people would expect - either because they're unaware that domains can in some cases have trailing periods, or because they're aware of it and want to normalize their domains before checking them against the Public Suffix List (which is what the browsers appear to do).

Also, as you mentioned above, doing domain validation is beyond the scope of the PublicSuffixService, just as you'd expect a developer to remove a trailing comma before passing a domain to the library, it's reasonable for them to remove a trailing period too.

weppos · 2010-12-05T18:42:17Z

Add support for Fully Qualified Domain Names (closed by 4b3f695)

weppos · 2010-12-05T18:44:45Z

Due to the way how the library works, it was much more flexible to add the support for FQDN other than raising some kind of error. The release 0.8.0 correctly detects and parses FQDN.

Thanks for your feedback.

camilo pushed a commit to camilo/public_suffix_service that referenced this issue May 26, 2011

Add support for Fully Qualified Domain Names (closes weppos#7)

4b3f695

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trailing period causes DomainNotAllowed exception #7

Trailing period causes DomainNotAllowed exception #7

paulhammond commented Nov 15, 2010

weppos commented Nov 15, 2010

paulhammond commented Nov 15, 2010

weppos commented Nov 18, 2010

paulhammond commented Nov 29, 2010

weppos commented Dec 5, 2010

weppos commented Dec 5, 2010

Trailing period causes DomainNotAllowed exception #7

Trailing period causes DomainNotAllowed exception #7

Comments

paulhammond commented Nov 15, 2010

weppos commented Nov 15, 2010

paulhammond commented Nov 15, 2010

weppos commented Nov 18, 2010

paulhammond commented Nov 29, 2010

weppos commented Dec 5, 2010

weppos commented Dec 5, 2010