allocwatcher: don't destroy local allocdir after migration #18108

tgross · 2023-07-31T20:52:22Z

When ephemeral disks are migrated from an allocation on the same node, allocation logs for the previous allocation are lost.

There are two workflows for the best-effort attempt to migrate the allocation data between the old and new allocations. For previous allocations on other clients (the "remote" workflow), we create a local allocdir and download the data from the previous client into it. That data is then moved into the new allocdir and we delete the allocdir of the previous alloc.

For "local" previous allocations we don't need to create an extra directory for the previous allocation and instead move the files directly from one to the other. But we still delete the old allocdir entirely, which includes all the logs!

There doesn't seem to be any reason to destroy the local previous allocdir, as the usual client garbage collection should destroy it later on when needed. By not deleting it, the previous allocation's logs are still available for the user to read.

Fixes: #18034

tgross · 2023-08-01T15:39:48Z

After confirming the behavior in #18034, I tested the patch in this PR with the following job specification.

jobspec

job "example" {

  group "web" {

    network {
      mode = "bridge"
      port "www" {
        to = 8001
      }
    }

    ephemeral_disk {
      migrate = true
      sticky  = true
    }

    task "httpd" {

      driver = "docker"

      config {
        image   = "busybox:1"
        command = "httpd"
        args    = ["-vv", "-f", "-p", "8001", "-h", "/local"]
        ports   = ["www"]
      }

      resources {
        cpu    = 128
        memory = 100
      }

    }
  }
}

Run the job:

$ nomad job run ./example.nomad.hcl
...
    2023-08-01T11:32:50-04:00: Allocation "b1461797" created: node "d9f5ad8c", group "web"
...

Write to its logs and to its ephemeral data and verify those writes have been made:

$ curl -so /dev/null "http://10.37.105.80:24429/alloc-b1461797"
$ nomad alloc exec b1461797 /bin/sh -c 'echo $NOMAD_ALLOC_ID >> /alloc/data/test.txt'

$ nomad alloc fs b1461797 /alloc/data/test.txt
b1461797-a661-d640-f8b8-b6e0e4b91d2e

$ nomad alloc logs -stderr b1461797
[::ffff:10.37.105.1]:52584: url:/alloc-b1461797
[::ffff:10.37.105.1]:52584: response:404

Update the job and run again:

$ nomad job run ./example.nomad.hcl
...
    2023-08-01T11:34:47-04:00: Allocation "525dd5d2" created: node "d9f5ad8c", group "web"
...

Verify that the logs for the previous allocation still exist, that the ephemeral disk has been migrated, but that the old allocation's ephemeral disk has been moved as expected.

$ nomad alloc logs -stderr b1461797
[::ffff:10.37.105.1]:52584: url:/alloc-b1461797
[::ffff:10.37.105.1]:52584: response:404

$ nomad alloc fs b1461797 /alloc/data/test.txt
Unexpected response code: 404 (rpc error: stat /var/nomad/data/alloc/b1461797-a661-d640-f8b8-b6e0e4b91d2e/alloc/data/test.txt: no such file or directory)

$ nomad alloc fs 525dd5d2 /alloc/data/test.txt
b1461797-a661-d640-f8b8-b6e0e4b91d2e

When ephemeral disks are migrated from an allocation on the same node, allocation logs for the previous allocation are lost. There are two workflows for the best-effort attempt to migrate the allocation data between the old and new allocations. For previous allocations on other clients (the "remote" workflow), we create a local allocdir and download the data from the previous client into it. That data is then moved into the new allocdir and we delete the allocdir of the previous alloc. For "local" previous allocations we don't need to create an extra directory for the previous allocation and instead move the files directly from one to the other. But we still delete the old allocdir _entirely_, which includes all the logs! There doesn't seem to be any reason to destroy the local previous allocdir, as the usual client garbage collection should destroy it later on when needed. By not deleting it, the previous allocation's logs are still available for the user to read. Fixes: #18034

shoenig

LGTM!

client/allocwatcher/alloc_watcher.go

When ephemeral disks are migrated from an allocation on the same node, allocation logs for the previous allocation are lost. There are two workflows for the best-effort attempt to migrate the allocation data between the old and new allocations. For previous allocations on other clients (the "remote" workflow), we create a local allocdir and download the data from the previous client into it. That data is then moved into the new allocdir and we delete the allocdir of the previous alloc. For "local" previous allocations we don't need to create an extra directory for the previous allocation and instead move the files directly from one to the other. But we still delete the old allocdir _entirely_, which includes all the logs! There doesn't seem to be any reason to destroy the local previous allocdir, as the usual client garbage collection should destroy it later on when needed. By not deleting it, the previous allocation's logs are still available for the user to read. Fixes: #18034

tgross · 2023-08-02T13:51:30Z

BPA failed at random for the 1.4.x release branch, I've backported that by hand.

tgross mentioned this pull request Jul 31, 2023

Allocation Logs Immediately Removed on Allocation Replacement #18034

Closed

tgross added theme/logging type/bug theme/client theme/data migration labels Jul 31, 2023

vercel bot deployed to Preview – nomad-storybook-and-ui July 31, 2023 20:55 View deployment

tgross force-pushed the migrate-preserve-logs branch from c03d830 to a3a637e Compare August 1, 2023 15:43

vercel bot deployed to Preview – nomad-storybook-and-ui August 1, 2023 15:46 View deployment

tgross added this to the 1.6.x milestone Aug 1, 2023

tgross marked this pull request as ready for review August 1, 2023 16:29

tgross requested review from shoenig, angrycub and schmichael August 1, 2023 16:29

shoenig approved these changes Aug 1, 2023

View reviewed changes

schmichael reviewed Aug 1, 2023

View reviewed changes

client/allocwatcher/alloc_watcher.go Show resolved Hide resolved

schmichael approved these changes Aug 1, 2023

View reviewed changes

tgross merged commit 8ad663d into main Aug 2, 2023
25 checks passed

tgross deleted the migrate-preserve-logs branch August 2, 2023 13:41

tgross added backport/1.4.x backport to 1.4.x release line backport/1.5.x backport to 1.5.x release line backport/1.6.x backport to 1.6.x release line labels Aug 2, 2023

This was referenced Aug 2, 2023

Backport of allocwatcher: don't destroy local allocdir after migration into release/1.5.x #18131

Merged

Backport of allocwatcher: don't destroy local allocdir after migration into release/1.6.x #18132

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allocwatcher: don't destroy local allocdir after migration #18108

allocwatcher: don't destroy local allocdir after migration #18108

tgross commented Jul 31, 2023 •

edited

tgross commented Aug 1, 2023

shoenig left a comment

tgross commented Aug 2, 2023

allocwatcher: don't destroy local allocdir after migration #18108

allocwatcher: don't destroy local allocdir after migration #18108

Conversation

tgross commented Jul 31, 2023 • edited

tgross commented Aug 1, 2023

shoenig left a comment

Choose a reason for hiding this comment

tgross commented Aug 2, 2023

tgross commented Jul 31, 2023 •

edited