Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Set explicit limits on document retrieval #12

Open
mwatts15 opened this issue Dec 4, 2014 · 0 comments
Open

Set explicit limits on document retrieval #12

mwatts15 opened this issue Dec 4, 2014 · 0 comments

Comments

@mwatts15
Copy link

mwatts15 commented Dec 4, 2014

Currently, limitations on documents retrieved are either the defaults imposed by Java libraries or of the runtime environment. These limitations must be made explicit and documented for the sake of users.

A minimum set of limits to set:

  • Timeouts for all documents retrieved over HTTP
  • Download rate limitations
  • Request count limitations (possibly tie to an API key)
  • Blacklisting certain domains
  • Content-type restrictions (e.g., ensure that documents fetched are schema-validated XML)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant