Recommended way to delete by query? #870

braunsonm · 2018-04-01T03:50:36Z

I often have to delete entries which can sometimes exceed 100,000 entries. Thus I have been using:

response = Search().filter('stuff').delete()

But this causes timeout issues elasticsearch.exceptions.ConnectionTimeout: ConnectionTimeout caused by - ReadTimeoutError(HTTPConnectionPool(host='127.0.0.1', port=9200): Read timed out. (read timeout=10))

Is there perhaps a way to let it run and then query every now and then for the status? Just wondering what the usual recommendation is.

The text was updated successfully, but these errors were encountered:

honzakral · 2018-04-01T13:11:51Z

delete_by_query can take quite some time so the recommendation is to either increase the timeout by calling .params(request_timeout=3600) (or some other number higher than 10) to add the parameter to the method call or .params(wait_for_completion=False) to make the api return immediately instead of blocking and waiting.

Hope this helps!

braunsonm · 2018-04-01T16:44:44Z

Thank you very much @honzakral

Is there any reason you'd want to wait for completion? Elasticsearch should get the job done in the background and I can trust that right?

honzakral · 2018-04-01T18:50:40Z

It should, but there can always be errors if something unexpected happens. You will, however, get a task id back from the initial API call and you can use it later (via the low-level API) to query for the job status/results via the tasks API (0).

0 - https://elasticsearch-py.readthedocs.io/en/master/api.html#tasks

honzakral closed this as completed Apr 1, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recommended way to delete by query? #870

Recommended way to delete by query? #870

braunsonm commented Apr 1, 2018 •

edited

honzakral commented Apr 1, 2018

braunsonm commented Apr 1, 2018

honzakral commented Apr 1, 2018

Recommended way to delete by query? #870

Recommended way to delete by query? #870

Comments

braunsonm commented Apr 1, 2018 • edited

honzakral commented Apr 1, 2018

braunsonm commented Apr 1, 2018

honzakral commented Apr 1, 2018

braunsonm commented Apr 1, 2018 •

edited