Enable state locking for plan/apply/destroy/refresh/taint/untaint #11686

jbardin · 2017-02-03T21:33:37Z

This enables the locking of state through the command UI.

This does change the behavior of 2 tests. Previously when running a plan with no existing state, the plan would be written out and then backed up on the next WriteState by another BackupState instance. Since we now maintain a single State instance throughout an operation, the backup happens before any state exists so no backup file is created. This shouldn't be a problem, as there really was nothing that required backing up. Now those tests will create the state file before running.

The lock/unlock terraform commands will be added in another PR. Only local state is supported so far, so the commands aren't yet required.

Have the LocalBackend lock the state during operations, and enble this for the apply comand.

This makes it more apparent that the information passed in isn't required nor will it conform to any standard. There may be call sites that can't provide good contextual info, and we don't want to count on that value.

Previously when runnign a plan with no exitsing state, the plan would be written out and then backed up on the next WriteState by another BackupState instance. Since we now maintain a single State instance thoughout an operation, the backup happens before any state exists so no backup file is created. This is OK, as the backup state the tests were checking for is from the plan file, which already exists separate from the state.

We are not going to handle lock expiration, at least at this time, so remove the Expires fields to avoid any confusion.

Depending on the implementation, local state locks may be reentrant within the same process. Use a separate process to test locked state files.

Verify that these operations fail when a state file is locked.

this way we can signal it directly to amke sure it exits cleanly.

add missing lock-state flag to untaint

Close and remove the file descriptor from LocalState if we Unlock the state. Also remove an empty state file if we created it and it was never written to. This is mostly to clean up after tests, but doesn't hurt to not leave empty files around.

pchaganti · 2017-02-04T14:42:49Z

👍

mitchellh

Some minor changes, overall looks amazing.

mitchellh · 2017-02-05T20:52:54Z

backend/backend.go

@@ -22,7 +22,8 @@ type Backend interface {

 	// State returns the current state for this environment. This state may
 	// not be loaded locally: the proper APIs should be called on state.State
-	// to load the state.
+	// to load the state. If the state.State is a state.Locker, it's up to the
+	// caller to call Lock and Unlock as needed.


Thanks for updating the comment, this is the correct behavior I wanted!

mitchellh · 2017-02-05T20:55:10Z

backend/local/backend_apply.go

+	defer func() {
+		if s, ok := opState.(state.Locker); op.LockState && ok {
+			if err := s.Unlock(); err != nil {
+				log.Printf("[ERROR]: %s", err)


We should multierror append the error to runningOp.Err so that the error shows up to the end user. I would make a long const (like other error messages in the package) for it so that the user knows what to do: verify everything is okay, manually call terraform unlock

Ah yes. I was thinking this would only apply to LocalState, where Unlock shouldn't error, and the lock is gone on exit anyway.

mitchellh · 2017-02-05T20:57:24Z

command/meta.go

 	statePath    string
 	stateOutPath string
 	backupPath   string
 	parallelism  int
 	shadow       bool
 	provider     string
+	lockState    bool


Nitpick: let's name this stateLock just to match the other state-related fields above (statePath, stateOutPath)

mitchellh · 2017-02-05T20:58:00Z

command/apply.go

@@ -272,6 +274,8 @@ Options:
                         modifying. Defaults to the "-state-out" path with
                         ".backup" extension. Set to "-" to disable backup.

+  -lock-state=true       Lock the state file when locking is supported.


Bike shed: Let's just use -lock. Maybe we'll lock more in the future maybe we won't but I think its clear regardless and I'd prefer the aesthetic of it.

mitchellh · 2017-02-05T20:59:05Z

backend/backend.go

@@ -99,6 +103,10 @@ type Operation struct {
 	// Input/output/control options.
 	UIIn  terraform.UIInput
 	UIOut terraform.UIOutput
+
+	// If LockState is true, the Operation must Lock any
+	// state.Lockers for its duration, and Unlock when complete.


Add to the comment: if using backend.Local, it is up to the caller to unlock the state.

I don't think backend.Local is an exception here. The state is acquired and used solely within Enhanced.Operation, which is done for backend.Local too. I did note that Backend.State expects the caller to lock the state as needed, and that's called from within an Operation.

Have the defer'ed State.Unlock call append any error to the RunningOperation.Err field. Local error would be rare and self-correcting, but when the backend.Local is using a remote state the error may require user intervention.

mitchellh

I think this is good. One thing I want to just leave as a note here but shouldn't block this merge: we should think through a UX if locking is taking awhile (for whatever reason, the network).

In Otto I had created a package that was basically "do this, but if it takes longer than N (time.Duration) then show this message". I think bringing that as a helper package here and using that for cases like this would be ideal. In the average case, state locking should be fast enough, if its taking longer than 100ms or something we should probably inform the user that we're trying to acquire a state lock. I could see some users terraform <op> hanging and being curious why.

ghost · 2020-04-17T02:02:32Z

I'm going to lock this issue because it has been closed for 30 days ⏳. This helps our maintainers find and focus on the active issues.

If you have found a problem that seems similar to this, please open a new issue and complete the issue template so we can capture all the details necessary to investigate further.

jbardin added 14 commits February 2, 2017 18:08

enable local state locking for apply

9cdba1f

Have the LocalBackend lock the state during operations, and enble this for the apply comand.

Change lock reason -> info

1078781

This makes it more apparent that the information passed in isn't required nor will it conform to any standard. There may be call sites that can't provide good contextual info, and we don't want to count on that value.

add locking to plan and refresh commands

dd19cb2

add -lock-state usage to plan/refresh/apply/destr

a157ebb

Add state locking in taint/untaint

9160884

Remove "expires" from lock info.

a2b5811

We are not going to handle lock expiration, at least at this time, so remove the Expires fields to avoid any confusion.

apply-test

6a20c35

Add separate program for locking state files

fb60b6f

Depending on the implementation, local state locks may be reentrant within the same process. Use a separate process to test locked state files.

Add test for apply/refresh on locked state files

bd65ddb

Verify that these operations fail when a state file is locked.

build the statelocker binary before running

f3e4c05

this way we can signal it directly to amke sure it exits cleanly.

Add test for locked state in plan

9fa436e

Add test for destroy with locked state

82e59cd

Add test/untaint tests with locked state

cd96bb5

add missing lock-state flag to untaint

jbardin added core enhancement labels Feb 3, 2017

jbardin requested a review from mitchellh February 3, 2017 21:33

Cleanup state file during Unlock

e92559f

Close and remove the file descriptor from LocalState if we Unlock the state. Also remove an empty state file if we created it and it was never written to. This is mostly to clean up after tests, but doesn't hurt to not leave empty files around.

mitchellh suggested changes Feb 5, 2017

View reviewed changes

jbardin added 3 commits February 6, 2017 09:54

Update runningOp.Err with State.Unlock error

0d7752b

Have the defer'ed State.Unlock call append any error to the RunningOperation.Err field. Local error would be rare and self-correcting, but when the backend.Local is using a remote state the error may require user intervention.

s/Meta.lockState/Meta.stateLock/g

0790318

Change CLI flag to '-lock'

eb8e5ac

mitchellh approved these changes Feb 6, 2017

View reviewed changes

jbardin merged commit 9fbc5b1 into master Feb 6, 2017

jbardin deleted the jbardin/state-locking branch February 6, 2017 18:42

apparentlymart mentioned this pull request Feb 7, 2017

remote: Introduce locking mechanism into remote backend interface #5036

Closed

hashicorp locked and limited conversation to collaborators Apr 17, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable state locking for plan/apply/destroy/refresh/taint/untaint #11686

Enable state locking for plan/apply/destroy/refresh/taint/untaint #11686

jbardin commented Feb 3, 2017

pchaganti commented Feb 4, 2017

mitchellh left a comment

mitchellh Feb 5, 2017

mitchellh Feb 5, 2017

jbardin Feb 6, 2017

mitchellh Feb 5, 2017

mitchellh Feb 5, 2017

mitchellh Feb 5, 2017

jbardin Feb 6, 2017

mitchellh left a comment

ghost commented Apr 17, 2020

Enable state locking for plan/apply/destroy/refresh/taint/untaint #11686

Enable state locking for plan/apply/destroy/refresh/taint/untaint #11686

Conversation

jbardin commented Feb 3, 2017

pchaganti commented Feb 4, 2017

mitchellh left a comment

Choose a reason for hiding this comment

mitchellh Feb 5, 2017

Choose a reason for hiding this comment

mitchellh Feb 5, 2017

Choose a reason for hiding this comment

jbardin Feb 6, 2017

Choose a reason for hiding this comment

mitchellh Feb 5, 2017

Choose a reason for hiding this comment

mitchellh Feb 5, 2017

Choose a reason for hiding this comment

mitchellh Feb 5, 2017

Choose a reason for hiding this comment

jbardin Feb 6, 2017

Choose a reason for hiding this comment

mitchellh left a comment

Choose a reason for hiding this comment

ghost commented Apr 17, 2020