snapshot should work when cluster is in read_only mode. #8102

webmstr · 2014-10-15T20:48:17Z

I was trying to make a full, consistent backup before an upgrade. Snapshots are at a moment of time, which doesn't work if clients are still updating your indexes.

I tried putting the cluster into read_only mode by setting cluster.blocks.read_only: true, but running a snapshot returned this error:

{"error":"ClusterBlockException[blocked by: [FORBIDDEN/6/cluster read-only (api)];]","status":403}

Please consider allowing snapshots to provide a consistent backup by running when in read-only mode.

The text was updated successfully, but these errors were encountered:

clintongormley · 2014-10-16T18:19:02Z

@webmstr Snapshots are still moment in time while updates are happening. You don't need to lock anything. A snapshot will only backup the state of the index at the point that the backup starts, it won't take any later changes into account.

webmstr · 2014-10-16T20:09:17Z

As I mentioned, snapshots - as currently implemented - are an unreasonable method of performing a consistent backup prior to an upgrade. This enhancement would have allowed that option.

Without the enhancement, snapshots should not be used before an upgrade, because the indexes may have been changed while the snapshot was running. As such, the upgrade documentation should be changed to not propose the use of snapshots as backups, and a "full" backup procedure should be documented in its place.

clintongormley · 2014-10-17T05:04:47Z

Out of interest, why don't you just stop writing to your cluster? Reopening for discussion.

clintongormley · 2014-10-17T05:05:03Z

@imotov what are your thoughts?

webmstr · 2014-10-17T06:18:42Z

I could turn off logstash, but that's just one potential client. Someone could be curl'ing, or using an ES plugin (like head), etc. If you need a consistent backup, you have to disconnect and lock out the clients from the server side.

imotov · 2014-10-17T13:33:58Z

@clintongormley see #5876 I think this one is similar.

clintongormley · 2014-10-17T13:39:15Z

@imotov thanks, so setting index.blocks.write to true on all indices would be a reasonable workaround, at least until #5855 is resolved.

saahn · 2014-11-12T23:26:35Z

@clintongormley Actually, I discovered that the index.blocks.write attribute only prevents writes to existing indices. If a client tries to create a new index, that request succeeds, which brings us back to the same problem. My workaround was to shutdown the proxy node though which our clients access our ES cluster.
I am running into the same issue as @webmstr , but for different reason: I cannot create a consistent backup for a restore to a secondary datacenter because each snapshot takes ~1 hour to complete and we cannot afford to block writes from our clients for such a long period of time.
I am still trying to root cause why snapshots are taking so long; the time required for snapshot completion increases with each snapshot. However, when i restore the same data to a new cluster, snapshotting that data to a new S3 bucket takes less than a minute.

EDIT: I may have a theory on why the snapshots were taking so long... i was taking a snapshot every two hours, and the s3 bucket has a LOT of snapshots now (49). I'm thinking that the calls the ES aws plugin makes to the S3 endpoint slow down over time as the number of snapshots increase.

Or may be it's just the number of snapshots that's causing the slowness...i.e. regardless of whether the backend repository is S3 or fs? I guess I should have an additional cron job that deletes older snaphots. Is there a good rule of thumb on the number of snapshots to retain?

colings86 · 2015-02-20T10:37:12Z

@imotov we discussed this issue but were unclear on what the differences are between the index.blocks.* options are and why the snapshot fails with read_only set to false?

imotov · 2015-02-20T15:54:26Z

@colings86 there is an ongoing effort to resolve this issue in #9203

This commit splits the current ClusterBlockLevel.METADATA into two disctins ClusterBlockLevel.METADATA_READ and ClusterBlockLevel.METADATA_WRITE blocks. It allows to make a distinction between an operation that modifies the index or cluster metadata and an operation that does not change any metadata. Before this commit, many operations where blocked when the cluster was read-only: Cluster Stats, Get Mappings, Get Snapshot, Get Index Settings, etc. Now those operations are allowed even when the cluster or the index is read-only. Related to elastic#8102, elastic#2833 Closes elastic#3703 Closes elastic#5855

This commit splits the current ClusterBlockLevel.METADATA into two disctins ClusterBlockLevel.METADATA_READ and ClusterBlockLevel.METADATA_WRITE blocks. It allows to make a distinction between an operation that modifies the index or cluster metadata and an operation that does not change any metadata. Before this commit, many operations where blocked when the cluster was read-only: Cluster Stats, Get Mappings, Get Snapshot, Get Index Settings, etc. Now those operations are allowed even when the cluster or the index is read-only. Related to elastic#8102, elastic#2833 Closes elastic#3703 Closes elastic#5855 Closes elastic#10521 Closes elastic#10522

This commit splits the current ClusterBlockLevel.METADATA into two disctins ClusterBlockLevel.METADATA_READ and ClusterBlockLevel.METADATA_WRITE blocks. It allows to make a distinction between an operation that modifies the index or cluster metadata and an operation that does not change any metadata. Before this commit, many operations where blocked when the cluster was read-only: Cluster Stats, Get Mappings, Get Snapshot, Get Index Settings, etc. Now those operations are allowed even when the cluster or the index is read-only. Related to #8102 Closes #3703 Closes #5855 Closes #10521 Closes #10522 Closes #2833

imotov · 2015-05-19T16:35:32Z

After discussing this with @tlrx it looks like the best way to address this issue is by moving snapshot and restore cluster state elements from cluster metadata to a custom cluster element where it seems to belong (since information about currently running snapshot and restore hardly qualifies as metadata).

…rom custom metadata to custom cluster state part Information about in-progress snapshot and restore processes is not really metadata and should be represented as a part of the cluster state similar to discovery nodes, routing table, and cluster blocks. Since in-progress snapshot and restore information is no longer part of metadata, this refactoring also enables us to handle cluster blocks in more consistent manner and allow creation of snapshots of a read-only cluster. Closes elastic#8102

clintongormley closed this as completed Oct 16, 2014

clintongormley reopened this Oct 17, 2014

clintongormley added the discuss label Oct 17, 2014

clintongormley mentioned this issue Nov 13, 2014

ClusterBlockException on /_cat/indices/ on readonly indices #5855

Closed

clintongormley mentioned this issue Nov 29, 2014

index.blocks.read_only prevents _status #2833

Closed

tlrx mentioned this issue Jan 8, 2015

Elasticsearch head requests return 403 when made to readonly index. #3703

Closed

tlrx mentioned this issue Feb 27, 2015

Add METADATA_READ and METADATA_WRITE blocks #9203

Merged

javanna added >bug v1.6.0 v2.0.0-beta1 and removed discuss labels Apr 10, 2015

javanna assigned tlrx Apr 10, 2015

javanna added the :Distributed/Snapshot/Restore Anything directly related to the `_snapshot/*` APIs label Apr 10, 2015

s1monw added v1.6.1 and removed v1.6.0 labels Jun 3, 2015

s1monw removed the v1.5.3 label Jun 3, 2015

imotov mentioned this issue Jun 4, 2015

Move in-progress snapshot and restore information from custom metadata to custom cluster state part #11486

Merged

imotov assigned imotov and unassigned tlrx Jun 4, 2015

imotov closed this as completed in #11486 Jun 11, 2015

clintongormley removed the v1.6.1 label Jul 16, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

snapshot should work when cluster is in read_only mode. #8102

snapshot should work when cluster is in read_only mode. #8102

webmstr commented Oct 15, 2014

clintongormley commented Oct 16, 2014

webmstr commented Oct 16, 2014

clintongormley commented Oct 17, 2014

clintongormley commented Oct 17, 2014

webmstr commented Oct 17, 2014

imotov commented Oct 17, 2014

clintongormley commented Oct 17, 2014

saahn commented Nov 12, 2014

colings86 commented Feb 20, 2015

imotov commented Feb 20, 2015

imotov commented May 19, 2015

snapshot should work when cluster is in read_only mode. #8102

snapshot should work when cluster is in read_only mode. #8102

Comments

webmstr commented Oct 15, 2014

clintongormley commented Oct 16, 2014

webmstr commented Oct 16, 2014

clintongormley commented Oct 17, 2014

clintongormley commented Oct 17, 2014

webmstr commented Oct 17, 2014

imotov commented Oct 17, 2014

clintongormley commented Oct 17, 2014

saahn commented Nov 12, 2014

colings86 commented Feb 20, 2015

imotov commented Feb 20, 2015

imotov commented May 19, 2015