metaformats: should we distinguish the parsed output item more explicitly? #2

snarfed · 2023-11-29T20:16:23Z

Right now, if https://microformats.org/wiki/metaformats finds eligible metaformats, it generates an h-entry and appends it to the returned items. There's no way to distinguish this item from real mf2 items, though, which is unfortunate. As an implementor, I could use one! Especially for interpreting home page metaformats as an h-card, eg #3, but also for non-homepage pages. Should we include a new property? New type? (I assume not.) Something else? cc @tantek

The text was updated successfully, but these errors were encountered:

sknebel · 2023-11-30T12:46:02Z

given that they as far as I see don't really participate in the nesting of objects (i.e. a metaformats-parsed object is not going to be a child or property-value of an mf2-parsed object, nor vice-versa) they could be sorted in a separate list, e.g. metaformats-items. Alternatively, they could have an extra flag on the same level as type

aciccarello · 2023-12-02T00:14:07Z

I wondered about this too in microformats/microformats-parser#229.

Should there be a property identifying the mf as being parsed from metaformats in case someone wants to cleanup messy meta tag content

I'd prefer to not put them in a separate list so a consumer of the parsed output doesn't need to do anything extra. So far I haven't personally needed to know if if an output if from metaformats, but I could see a property identifying it being useful.

angelogladding · 2023-12-04T01:30:09Z

I think adding a new property meta-item keeps things clean and explicit. In Python:

if parsed["meta-item"]:

vs. eg.

if parsed["items"] and parsed["items"][-1].get("source") == "metaformats":

I believe mf2py can toggle metaformats parsing on by default immediately if we can keep items as is and use meta-item experimentally -- see microformats/mf2py#213 (comment)

snarfed · 2023-12-04T02:42:12Z

As @aciccarello mentioned, the problem is that a separate list forces all consumers to have to be explicit. One of the benefits of the current metaformats spec is that it lets current mf2 consuming code (choose to) benefit from metaformats automatically, without any changes. New top-level field preserves that, separate list doesn't.

angelogladding · 2023-12-05T00:03:29Z

I do like automatic fallback for entries. Now I better see what you guys are talking about.

mf2util will need to be updated to look for the new top-level field and ignore it when interpreting a feed but everything else in that library should just work (again by simply operating on the first item).

>>> mf2json = mf2py.parse(url="https://zeldman.com", metaformats=True)
>>> homepage_feed = mf2util.interpret_feed(mf2json, "https://zeldman.com")
>>> homepage_feed["entries"][-1]["name"]
'Zeldman on Web and Interaction Design'

The fix will look something like this which is perfectly fine:

if feed["entries"][-1].get("source") == "metaformats":
    feed["entries"].pop()

And you'll never actually need to look up the meta item so I was optimizing for a non-existent case with:

if parsed["meta-item"]:

So keeping it in items and adding a top-level field does make good sense.

tantek transferred this issue from microformats/microformats2-parsing Dec 2, 2023

angelogladding mentioned this issue Dec 4, 2023

Add extension to support metaformats microformats/mf2py#213

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

metaformats: should we distinguish the parsed output item more explicitly? #2

metaformats: should we distinguish the parsed output item more explicitly? #2

snarfed commented Nov 29, 2023 •

edited

sknebel commented Nov 30, 2023

aciccarello commented Dec 2, 2023 •

edited

angelogladding commented Dec 4, 2023 •

edited

snarfed commented Dec 4, 2023 •

edited

angelogladding commented Dec 5, 2023

metaformats: should we distinguish the parsed output item more explicitly? #2

metaformats: should we distinguish the parsed output item more explicitly? #2

Comments

snarfed commented Nov 29, 2023 • edited

sknebel commented Nov 30, 2023

aciccarello commented Dec 2, 2023 • edited

angelogladding commented Dec 4, 2023 • edited

snarfed commented Dec 4, 2023 • edited

angelogladding commented Dec 5, 2023

snarfed commented Nov 29, 2023 •

edited

aciccarello commented Dec 2, 2023 •

edited

angelogladding commented Dec 4, 2023 •

edited

snarfed commented Dec 4, 2023 •

edited