cmd/subd: export a merkle hash of on-disk state #551

masiulaniec · 2019-01-24T04:43:06Z

It would be good to have a metric that holds, for example, a float32 obtained by taking a 4-byte prefix of a SHA-512 of the merkle hash of the entire file system. Such values could be logged to time series databases, and used by monitoring systems for making sure that the managed hosts converge to the same bits. Subd already scans the file system and calculates hashes so deriving a merkle hash should not introduce much extra overhead.

rgooch · 2019-01-24T07:02:55Z

I like the basic idea. Does it need to be a Merkle tree hash? That would require storing hashes in the directory inodes.
A more straight-forward implementation might be to have a modified hasher which hashes each hash that is computed.
Note that, either way, this would only expose a hash of all the file data. Inode metadata would not be captured.

masiulaniec · 2019-01-24T19:58:56Z

Agreed on all counts. I just wanted to put the basic idea in your head. I think metadata ought to be part of the hash.

rgooch · 2019-01-24T22:35:36Z

Metadata will be more complicated, but I agree that it's the kind of thing you'd want. Perhaps mtime data maybe not included?

masiulaniec · 2019-01-24T23:42:16Z

I can see wanting to exclude mtime for computed files.

rgooch · 2019-01-25T05:18:50Z

What about mtime for regular files?

masiulaniec · 2019-01-25T16:00:09Z

For regular files, mtime is image-defined and enforced just like any other attribute. I would include it.

rgooch · 2019-01-25T16:09:32Z

If mtimes for regular files are included in the hash, they will also be included for computed files, because as far as the sub is concerned, they are just regular files. It's only the Dominator that knows that they are computed files.

masiulaniec · 2019-01-26T03:00:02Z

Ack. So I don't see a reason to exclude mtime from the hash. We plan to do horizontal checks (host vs. host) and vertical (host vs. image).

rgooch · 2019-01-28T04:25:57Z

The mtime difference for computed files will make that difficult.

masiulaniec · 2019-01-29T23:47:54Z

The computed files will all have equal mtime thanks to os.Chtimes, no?

rgooch · 2019-01-30T01:05:09Z

The mtime for computed files is taken from the current time when the Dominator sees that the computed file contents need to be changed. So, in practice, every sub is going to have a different mtime for a particular computed file. There is no horizontal consistency.

masiulaniec · 2019-01-30T14:49:41Z

I can see two options: a) set the mtime anyway (I realize this could confuse tools such as rsync), b) present the hasher with zero mtime for computed files.

masiulaniec · 2019-01-30T14:52:02Z

I understand option b) would require dominator to start revealing to subd that certain files are computed, a classification detail that is currently beautifully hidden.

masiulaniec · 2019-01-30T14:58:10Z

Your suggestion of excluding mtime from hash computation sounds pragmatic: it would allow the feature to be implemented without expanding interfaces but would not preclude including mtime later if a clean design is found.

rgooch · 2019-01-31T16:13:00Z

Yes, excluding mtime from the hash seems the best for now. I'm reluctant to complicate subd unless it's essential.

masiulaniec · 2019-04-13T14:36:51Z

Alternatively, the metric could be emitted at the level of the dominator server where the distinction between regular and computed files can still be made.

rgooch · 2019-04-13T16:40:56Z

Hm. Maybe we should take a step back at look at the problem you're trying to solve? Do you want to ensure that all machines converge to the required state and have alerting for machines which do not converge (after N attempts, say)? If that's what you're looking for, then the Dominator already knows this. It's currently presented in the dashboard and it could be exposed via metrics too.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cmd/subd: export a merkle hash of on-disk state #551

cmd/subd: export a merkle hash of on-disk state #551

masiulaniec commented Jan 24, 2019 •

edited

rgooch commented Jan 24, 2019

masiulaniec commented Jan 24, 2019

rgooch commented Jan 24, 2019 •

edited

masiulaniec commented Jan 24, 2019

rgooch commented Jan 25, 2019

masiulaniec commented Jan 25, 2019

rgooch commented Jan 25, 2019

masiulaniec commented Jan 26, 2019

rgooch commented Jan 28, 2019

masiulaniec commented Jan 29, 2019

rgooch commented Jan 30, 2019

masiulaniec commented Jan 30, 2019

masiulaniec commented Jan 30, 2019

masiulaniec commented Jan 30, 2019 •

edited

rgooch commented Jan 31, 2019

masiulaniec commented Apr 13, 2019

rgooch commented Apr 13, 2019

cmd/subd: export a merkle hash of on-disk state #551

cmd/subd: export a merkle hash of on-disk state #551

Comments

masiulaniec commented Jan 24, 2019 • edited

rgooch commented Jan 24, 2019

masiulaniec commented Jan 24, 2019

rgooch commented Jan 24, 2019 • edited

masiulaniec commented Jan 24, 2019

rgooch commented Jan 25, 2019

masiulaniec commented Jan 25, 2019

rgooch commented Jan 25, 2019

masiulaniec commented Jan 26, 2019

rgooch commented Jan 28, 2019

masiulaniec commented Jan 29, 2019

rgooch commented Jan 30, 2019

masiulaniec commented Jan 30, 2019

masiulaniec commented Jan 30, 2019

masiulaniec commented Jan 30, 2019 • edited

rgooch commented Jan 31, 2019

masiulaniec commented Apr 13, 2019

rgooch commented Apr 13, 2019

masiulaniec commented Jan 24, 2019 •

edited

rgooch commented Jan 24, 2019 •

edited

masiulaniec commented Jan 30, 2019 •

edited