-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Parsing error after X responsive pages #14
Comments
It looks like by default Elasticsearch 2.x limits results to 10K results and has the following to say:
We can increase the number some, but probably not infinitely. If you want to iterate over a very large set of results like this, you are going to be better off using the SQL or EXPORT API endpoints. |
I guess I'm surprised we didn't notice this until now -- maybe I haven't noticed error messages, but it also might be a recent issue. It sounds too basic for it to be recent, but I at least wanted to raise the possibility. Given the issue, I do think Censys could supply a more precise/graceful error message than:
You probably want to point users to a different API instead. Since the REST API is the most user/developer-friendly API to Censys data, I would suggest raising the page limit to whatever Censys.io can reasonably support. |
We recently migrated from Elasticsearch 1.6 to 2.4, which is when this would have changed. I'll update the number of results to 25,000, update the documentation, and change this to be a better error messages. This is the first time this has come up in this particular API. You can also export results through the REST API, there's just a slightly different set of semantics: https://censys.io/api/v1/docs/export. |
Thanks @zakird, and thank you for your efforts on Censys. 👍 |
I just wanted to note that this error message should now describe the problem and paths forward. Example at https://censys.io/certificates?q=*&page=401. |
Thank you! |
domain-scan's
gather
feature queries the Censys API to collect .gov subdomains. The queryparsed.subject.common_name:".gov" or parsed.extensions.subject_alt_name.dns_names:".gov"
works on API pages 1-100, but errors out on any page ≥ 101:This same query on the web indicates 7,661 responsive pages:
...but any page ≥ 401 produces an error:
cc: @konklone
The text was updated successfully, but these errors were encountered: