New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

What happens when your crawler meets an https: link ex: <a href-"https://www.test.com> #8

Closed

satish28 opened this issue Sep 22, 2014 · 1 comment

Contributor

satish28 commented Sep 22, 2014

I found the following in your "re.match(r'http+', link)" code. Does this mean you ignore https links if found?

dragonfist453 commented Jul 4, 2020

http+ in regular expression also by default includes https because that's how re works. So it will be fine!

Dineshs91 closed this as completed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment