Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ignoring links in link tag ending with .css or .js #112

Merged
merged 1 commit into from
Aug 2, 2017

Conversation

anudeepti2004
Copy link
Collaborator

Currently it is ignoring all links in the link tag ending with .css and .js.
I noticed that the script tag links were not getting added to the array list in the PaginaUrl class but links were getting added via link tag so made a check there.
Currently the code checks for ".css" rather than just "css". Hence, we have definitely reduced a lot of links which end with .css but we will still get links like "http://fonts.googleapis.com/css?family=Oswald%3A400%2C700%2C300" in the frontier. Should I block such links too?

@coveralls
Copy link

Coverage Status

Coverage decreased (-0.01%) to 46.448% when pulling c8164ea on anudeepti2004:ignoring_non_html_pages into 61c627d on ViDA-NYU:master.

@aecio aecio merged commit 685af4d into VIDA-NYU:master Aug 2, 2017
@anudeepti2004 anudeepti2004 deleted the ignoring_non_html_pages branch August 2, 2017 14:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants