Implement Persistent Volumes #3897

lmars · 2017-02-02T04:30:48Z

This pull request is a continuation of the work already done on the persistent-volumes branch to implement persistent volumes as per #2438.

Summary:

the scheduler syncs volumes from all hosts and persists them to the volumes and job_volumes tables in the controller database
when placing a job which requires volumes, the scheduler first tries to locate existing, unassigned volumes for the job's app / release / type and the volume's path, and if it finds one then places the job on the same host as the volume whilst also assigning the volume to the job (i.e. setting Volume.JobID)
volumes can be "decommissioned" via flynn volume decommission ID which causes the volume to not be attached to any new jobs
if a host which has unassigned volumes is down, jobs are still scheduled on that host but enter a new blocked state waiting for either the host to come back up or the volume to be decommissioned. This then means if a host is rebooted which has data for a process type, the job just stays down until the host comes back rather than being restarted on a different host with an empty volume, but if the host has really gone away then it is up to the operator to decommission the volume and unblock the job to be scheduled on a different host

Things which need to be considered but not included in this PR:

volumes won't be persisted through deployments, but the design allows for that to be added later (see this comment)
we should add flynn volume backup and flynn volume restore so that if a host is lost and volumes need to be decommissioned in order to move jobs to other hosts, operators can first restore volumes so that the unblocked job doesn't have to start with a completely empty volume

titanous · 2017-02-13T15:54:27Z

cli/volume.go

+	register("volume", runVolume, `
+usage: flynn volume
+       flynn volume show [--json] <id>
+       flynn volume decommission <id>


Should this command be scoped to the app by default?

titanous · 2017-02-13T16:03:59Z

controller/scheduler/host.go

+							Type:   VolumeEventTypeDestroy,
+						}
+					}
+					ch <- e


Should this select on h.stop as well?

titanous · 2017-02-13T16:40:52Z

controller/scheduler/scheduler.go

+		// and this volume doesn't exist on that host
+		if job.HostID != "" && vol.HostID != job.HostID {
+			continue
+		}


Is exclusivity enforced in flynn-host too? There can technically be two schedulers scheduling jobs at once under failure conditions.

titanous · 2017-02-13T16:47:22Z

controller/scheduler/scheduler.go

@@ -1035,6 +1274,10 @@ func (s *Scheduler) HandleInternalStateRequest(req *InternalStateRequest) {
 		req.State.Formations[key.String()] = &f
 	}

+	for id, vol := range s.volumes {
+		req.State.Volumes[id] = &(*vol)


What is the reason for &(*vol)?

We need to copy it to create a "snapshot" of the scheduler's state to pass back to the caller.

lmars · 2017-03-02T11:19:50Z

@titanous comments addressed.

Signed-off-by: Lewis Marshall <lewis@lmars.net>

Useful for the scheduler creating volumes which it needs to track. Signed-off-by: Lewis Marshall <lewis@lmars.net>

Signed-off-by: Lewis Marshall <lewis@lmars.net>

lmars force-pushed the persistent-volumes-1 branch 2 times, most recently from e24e66d to 0c3914e Compare February 10, 2017 16:34

titanous reviewed Feb 13, 2017

View reviewed changes

lmars force-pushed the persistent-volumes-1 branch from 5fc5d22 to e9fedf0 Compare March 2, 2017 11:18

titanous approved these changes Mar 7, 2017

View reviewed changes

lmars force-pushed the persistent-volumes-1 branch from e9fedf0 to db239f9 Compare March 8, 2017 12:54

lmars added 10 commits March 8, 2017 22:16

controller/utils: Fix ProvisionVolume

6b590fc

Signed-off-by: Lewis Marshall <lewis@lmars.net>

appliance/redis: Wait for Redis to start during provision

899c72f

Signed-off-by: Lewis Marshall <lewis@lmars.net>

host/volume: Add volume events

41dfb58

Signed-off-by: Lewis Marshall <lewis@lmars.net>

scheduler: Sync volumes from hosts

daafabd

Signed-off-by: Lewis Marshall <lewis@lmars.net>

host/volume: Allow clients to set volume IDs

234a7eb

Useful for the scheduler creating volumes which it needs to track. Signed-off-by: Lewis Marshall <lewis@lmars.net>

controller: Add volume API

7d50aeb

Signed-off-by: Lewis Marshall <lewis@lmars.net>

scheduler: Place jobs on hosts with existing volumes

4df23ba

Signed-off-by: Lewis Marshall <lewis@lmars.net>

cli: Add volume command

ce1a987

Signed-off-by: Lewis Marshall <lewis@lmars.net>

host/volume: Retry creating volumes

582aafd

Signed-off-by: Lewis Marshall <lewis@lmars.net>

host: Ensure volumes are only attached to at most one job

d714bca

Signed-off-by: Lewis Marshall <lewis@lmars.net>

lmars force-pushed the persistent-volumes-1 branch from db239f9 to d714bca Compare March 8, 2017 23:01

lmars merged commit 7669dff into master Mar 9, 2017

lmars deleted the persistent-volumes-1 branch March 9, 2017 14:36

titanous mentioned this pull request Dec 5, 2017

Persistent Volumes #2438

Closed

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Persistent Volumes #3897

Implement Persistent Volumes #3897

lmars commented Feb 2, 2017

titanous Feb 13, 2017

titanous Feb 13, 2017

titanous Feb 13, 2017

titanous Feb 13, 2017

lmars Mar 2, 2017

lmars commented Mar 2, 2017

Implement Persistent Volumes #3897

Implement Persistent Volumes #3897

Conversation

lmars commented Feb 2, 2017

titanous Feb 13, 2017

Choose a reason for hiding this comment

titanous Feb 13, 2017

Choose a reason for hiding this comment

titanous Feb 13, 2017

Choose a reason for hiding this comment

titanous Feb 13, 2017

Choose a reason for hiding this comment

lmars Mar 2, 2017

Choose a reason for hiding this comment

lmars commented Mar 2, 2017