-
-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Investigate how publicsuffix.org can be used for the URL regex #14
Comments
Also relevant: https://mathiasbynens.be/demo/url-regex and the only code that passed all the tests: https://gist.github.com/dperini/729294 |
Here's dperini's regex (as of today): |
Simplified version (no IPs, no username:password, only http/https)
|
The above regex (vizualization) catches all domain suffixes included in the public suffix list as of today. |
This (pseudo-code) regex captures all TLDs in the IANA list:
This was built using [this list of Unicode ranges per script]https://en.wikipedia.org/wiki/Plane_(Unicode)#Basic_Multilingual_Plane) as reference. |
Either for replacing the domain identification regex or for testing the current implementation against possible edge cases. See https://www.publicsuffix.org/list/
The text was updated successfully, but these errors were encountered: