ENH: add layout metadata compiler and examples #5

SantiagoTorres · 2017-10-24T16:03:52Z

Hello, this commit adds the metadata compiler and a couple of metadata samples (debian grep and seattle).
Don't review the link metadata files, just the readme and the compiler please.

lukpueh

Sorry for the extremely late review, this really slipped under my radar. As requested, I read over compile-examples.py and I like it. I wonder if it's worth building this into in-toto? AFAICS, the larger part of the script takes care of traversing the metadata, in order to sort fields and truncate long values, which largely aligns with the feature request in in-toto/in-toto#18. What do you think?

If we do want to merge this here, there are a couple of things that need to be fixed:

The script is not compatible with the current metadata specification (nor are the ~30K added lines of metadata)
It shouldn't be required that the layout is called root.layout
Links are loaded by globbing for *.link files and showed in the order glob returns them. Should we not rather load them as they are defined in layout, as e.g. in_toto.verifylib.load_links_for_layout does?
Link file globbing did not work for me (I didn't troubleshoot though)
Sublayout's are not handled
IMO it comes unexpected that the displayed materials and products are random samples if there are more than 9 of them
Adding ellipses as last item (to show that a dict is truncated), does not guarantee that it is indeed printed as last item. Python does not guarantee to keep the order of dict items as they were inserted.
A bare minimum of documentation would be nice

Let me know if I should help.

lukpueh · 2019-04-17T15:32:30Z

Btw. here's a list of required metadata schema changes:

metablock
- all top level fields are now under the signed field, and the signatures field is a sibling of the signed field
layout
- expected_command and run must be lists
- expires must not have milliseconds
- material_matchrules are now expected_materials
- product_matchrules are now expected_products
- MATCH rule syntax has change
- threshold is a mandatory field
link
- _type value is lowercase
- return_value is now part of byproducts
- environment is a mandatory field

SantiagoTorres · 2019-04-17T15:32:41Z

Hi!

I wonder if it's worth building this into in-toto? AFAICS, the larger part of the script takes care of traversing the metadata, in order to sort fields and truncate long values, which largely aligns with the feature request in in-toto/in-toto#18. What do you think?

I'm not entirely sure if this is what's to be addressed on this side. I intentionally avoided using in-toto as a dependency (or any templating library for that matter) so as to keep this script-y. I believe that that issue has been floating around and having different meanings every time we revisit it, I'm afraid.

As for the point issues:

The script is not compatible with the current metadata specification (nor are the ~30K added lines of metadata)

Yes, unfortunately time has gone by, we may want to update it to conform to the latest spec.

It shouldn't be required that the layout is called root.layout

Probably not, but considering we're the ones that will be running this to update our examples in the docs I don't see why we should care much about user interface right away (this could be also ticketized)

Links are loaded by globbing for *.link files and showed in the order glob returns them. Should we not rather load them as they are defined in layout, as e.g. in_toto.verifylib.load_links_for_layout does?

This requires a dependency on in-toto or smarter parsing of json objects, which I tried to avoid. Again, we could make the tool smarter if we'd like by adding more deps/code, but we may want to think about it for just a docs repo.

Link file globbing did not work for me (I didn't troubleshoot though)

I suspect it's because the keyid prefix,

Sublayout's are not handled

No, as we don't have any examples on the docs repo that use sublayouts. We can always add support for this as the need arises.

IMO it comes unexpected that the displayed materials and products are random samples if there are more than 9 of them

I can't personally think of any other way to keep things succint, but I'm open to suggestions. I do agree this is not a perfect solution to overly verbose metadata.

Adding ellipses as last item (to show that a dict is truncated), does not guarantee that it is indeed printed as last item. Python does not guarantee to keep the order of dict items as they were inserted.

This is true, and it's something I bailed on working on back then. We could use a frozendict or serialize and then append on the printout after-the-fact (which would be messy).

A bare minimum of documentation would be nice

Agreed. Let's decide on whether this goes here and work accordingly.

- metablock - all top level fields are now under the `signed` field, and the `signatures` field is a sibling of the `signed` field - layout - `expected_command` and `run` must be lists - `expires` must not have milliseconds - `material_matchrules` are now `expected_materials` - `product_matchrules` are now `expected_products` - `MATCH` rule syntax has change - `threshold` is a mandatory field - link - `_type` value is lowercase - `return_value` is now part of `byproducts` - `environment` is a mandatory field

Add function that recursively traverses a passed python object, e.g. in-toto metadata, allowing to truncate long strings, lists and dicts, and to reorder dict keys, using OrderedDict.

Update and merge template populating functions, make them use the newly added metadata beaufifier (truncate and order) and rename to create_markdown_summary.

Update markdown summaries for debian, polypasswordhasher and seattle sample in-tot metadata.

Update sample metadata summaries for pph, seattle and debian supply chain metadata, using latest version of Santiago's metadata compiler script (see in-toto/specification#5). This commit also adds jekyll frontmatter to auto convert markdown to html and make the metadata sample pages part of the website layout.

lukpueh · 2019-05-15T11:38:42Z

@SantiagoTorres, I updated the sample layout and link metadata to meet the latest version of the spec, cleaned-up the compiler script, and used it to create new versions of the markdown-formatted summaries, which I have already published on our website (see pph, seattle and debian).

Let me know what you think.

lukpueh · 2019-05-15T12:30:55Z

Btw. 088277b is my stab at in-toto/in-toto#18. IMO especially the dict field ordering for pretty printing is very useful.

adityasaky · 2019-09-23T17:04:51Z

examples/compile-examples.py

+    if len(obj) > kw["max_str_len"]:
+      obj = obj[:kw["max_str_len"] - 3] + "..."
+
+  # Truncate list and recurse into _beautify for each  item


Extra space after each

Good eyes! Thanks for the review. :)

SantiagoTorres added the enhancement label Oct 24, 2017

SantiagoTorres assigned lukpueh and reza-curtmola Oct 24, 2017

lukpueh reviewed Apr 17, 2019

View reviewed changes

SantiagoTorres and others added 3 commits May 14, 2019 13:12

ENH: add layout metadata compiler and examples

397d2aa

ENH: Address Trishank's review comments

72d8ad2

lukpueh force-pushed the metadata-samples branch from cea8f0d to 10c0ea1 Compare May 14, 2019 14:48

lukpueh added 4 commits May 15, 2019 11:16

Add document header to examplile compiler script

120bf6a

Add beautify function to example compiler

088277b

Add function that recursively traverses a passed python object, e.g. in-toto metadata, allowing to truncate long strings, lists and dicts, and to reorder dict keys, using OrderedDict.

Refactor exampile compiler script

8fc42ff

Update and merge template populating functions, make them use the newly added metadata beaufifier (truncate and order) and rename to create_markdown_summary.

Update sample metadata summaries with new compiler

69c5a17

Update markdown summaries for debian, polypasswordhasher and seattle sample in-tot metadata.

lukpueh force-pushed the metadata-samples branch from 1ccdb89 to 69c5a17 Compare May 15, 2019 10:55

lukpueh mentioned this pull request May 15, 2019

Update sample metadata summaries in-toto/in-toto.github.io#10

Merged

lukpueh closed this May 15, 2019

lukpueh reopened this May 15, 2019

adityasaky approved these changes Sep 23, 2019

View reviewed changes

Remove stray space in example compiler script

d5a9464

lukpueh merged commit ef76b20 into master Sep 24, 2019

lukpueh mentioned this pull request Jan 28, 2020

Inconsitencies between the in-toto specification and reference implementations: to _name or to name, that is the question #24

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: add layout metadata compiler and examples #5

ENH: add layout metadata compiler and examples #5

SantiagoTorres commented Oct 24, 2017

lukpueh left a comment

lukpueh commented Apr 17, 2019 •

edited

Loading

SantiagoTorres commented Apr 17, 2019 •

edited

Loading

lukpueh commented May 15, 2019

lukpueh commented May 15, 2019

adityasaky Sep 23, 2019

lukpueh Sep 24, 2019

ENH: add layout metadata compiler and examples #5

ENH: add layout metadata compiler and examples #5

Conversation

SantiagoTorres commented Oct 24, 2017

lukpueh left a comment

Choose a reason for hiding this comment

lukpueh commented Apr 17, 2019 • edited Loading

SantiagoTorres commented Apr 17, 2019 • edited Loading

lukpueh commented May 15, 2019

lukpueh commented May 15, 2019

adityasaky Sep 23, 2019

Choose a reason for hiding this comment

lukpueh Sep 24, 2019

Choose a reason for hiding this comment

lukpueh commented Apr 17, 2019 •

edited

Loading

SantiagoTorres commented Apr 17, 2019 •

edited

Loading