Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

just some ideas #14

Closed
digitalist opened this issue Jul 24, 2018 · 5 comments
Closed

just some ideas #14

digitalist opened this issue Jul 24, 2018 · 5 comments

Comments

@digitalist
Copy link

collecting of /ads.txt
and https:// certs info

may be useful

@s0md3v
Copy link
Owner

s0md3v commented Jul 24, 2018

Thanks for the suggestions, I really appreciate it but I have two questions to ask :')

  1. How do you find out if a javascript file/code runs ads and how is that relevant?
  2. Don't you think that falls out of the scope for a crawler?

@digitalist
Copy link
Author

digitalist commented Jul 24, 2018

nah, just checking for ads.txt file, as in 'robots.txt' - it's an RTB standard (example: https://www.gazeta.ru/ads.txt)
checking for https cert is useful for some activities (antifraud etc.).

It's not hard to implement, maybe I should find time for a pull request

@s0md3v
Copy link
Owner

s0md3v commented Jul 24, 2018

It's not about the difficulty of implementation, it's about relevance. A crawler isn't supposed to check SSL certificates.

@digitalist
Copy link
Author

digitalist commented Jul 24, 2018

but as a crawler it CAN check for their presence/grab them - as an important metadata [intel]

@s0md3v
Copy link
Owner

s0md3v commented Jul 24, 2018

Sorry but I am closing this issue, I don't think it's relevant at all.

  1. ads.txt isn't that common
  2. Grabbing SSL certificates is out of scope

Thanks for the suggestions tho, let me know if you have some more ^_^

@s0md3v s0md3v closed this as completed Jul 24, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants