WP_HTML_Tag_Processor: Implement `get_attribute_names()` method #46685

ockham · 2022-12-20T16:56:29Z

What?

Add a get_attribute_names() method to WP_HTML_Tag_Processor, which returns the names of all attributes in a given tag.

$p = new WP_HTML_Tag_Processor( '<div enabled class="test" data-test-id="14">Test</div>' );
$p->next_tag();
$p->get_attribute_names() === array( 'enabled', 'class', 'data-test-id' );

Alternative to adding a get_attribute_by_prefix method (#46672).

Why?

Getting all attributes comes in handy for larger classes of attributes without knowing exactly which ones to expect. Examples include prefixed attributes for datasets (data-) and custom namespaces (e.g. ng- or x-).

Furthermore, it helps with syntax like <img wp-bind:src="myblock.imageSource" />, which is a requirement for the Block Interactivity API (see).

More background: WordPress/block-interactivity-experiments#118 (comment).

How?

Basically returns array_keys( $this->attributes ) 🤷‍♂️

Testing Instructions

See included unit test.

dmsnell · 2022-12-20T23:29:08Z

At first sight I like this approach better than get_all_attributes(). It's relying on data we've already parsed and allocated, and it doesn't have the same worst-case scenario.

Still I wonder if get_attribute_names_with_prefix() might not be a good idea to entertain. The less we expose by default then I would guess the better the using code would be. I'm far less concerned here about allowing zero-length prefixes as it won't be the same foot-gun that get_attributes() is, plus, it may just be annoying enough that it pushes people to only ask for the attributes they care about, leading to less noise on the output.

foreach ( $p->get_attribute_names_prefixed_by( 'wp-' ) as $attribute ) {
	// we won't have to filter out the attributes we don't care about here
}

adamziel · 2022-12-21T10:25:51Z

Oh hm, I guess $p->get_attribute_names_prefixed_by( '' ) would work. It is weird, though, isn't it? And being able to inquire about what was just parsed, as in the tag name, the attributes names, and the values, seems like a natural choice. Maybe it would make sense to have both get_attribute_names_prefixed_by and get_attributes_names...?

dmsnell · 2022-12-21T20:54:05Z

It is weird, though, isn't it?

yes, and I like that, because the first impression we get is that we should be asking specifically for what we want rather than asking for everything.

the use of the tag processor with wp directives is also pushing this system into a range of operation that we considered worst-case scenario when developing it. that is, on render, it visits every single HTML tag in a document and inspects every single attribute (or in this case, every attribute name and a possibly-zero subset of those attributes).

if the system is a bit awkward to use in the more dire performance scenarios I think that's better than making the dangerous behaviors the easier approach.

it's more philosophical with this than with get_attributes_prefix_by() though because we've already allocated these attribute name values and can return them quickly with array_keys() though the addition of the prefix not only provides that subtle philosophical bump but its also seems like it can provide a convenience for calling code like in the wp directives because it eliminates incidental filtering by only getting what we want.

ockham · 2022-12-22T10:51:27Z

I agree with your reasoning @dmsnell. One thing to consider: I'm not sure if we'll have forever control over what methods people add to the class, and if something is missing that's all-too "obvious" or frequently requested, there's a chance that people might just go ahead and add it (a get_attributes() method that returns key/value pairs, in this case -- with all its performance implications). Now the question is if either get_attributes_by_prefix() or get_attribute_names() could be "good enough" so they wouldn't feel compelled to add get_attributes() 😬

I think there's a slightly higher chance that people will use get_attribute_names() to filter whatever attributes they're really interested in (especially if their filter criteria can't be expressed as a prefix) and then use get_attribute() on each individual attribute that they care about -- which would avoid the need for get_attributes().

(Then again, prefixes really seem to represent the bulk of filtering scenarios. I have yet to come up with a realistic non-prefix criterion. Anything that y'all can think off?)

it's more philosophical with this than with get_attributes_prefix_by() though because we've already allocated these attribute name values and can return them quickly with array_keys()

Yeah. IMO, the fact that the names are so readily available could prompt people to want to expose them via a get_attribute_names() method (see my initial argument at the top of this comment).

I don't really feel strongly one way or another. Based on the above, I might be slightly inclined towards get_attribute_names. In addition to those arguments, it's also simpler to implement and requires fewer computations by the class method. So I guess it's both "more obvious" and "cheaper", which might be strong enough arguments in its favor.

adamziel · 2022-12-22T14:18:57Z

@dmsnell I don't mind asking for all the attributes – it's what I'm used to in many other APIs. As for the awkward method call – I like it because of the reasons you mentioned, and at the same time I don't like how it forces me, the API consumer, into using specific a naming pattern for my attributes.

All in all, I don't have a strong opinion here. I'd slightly prefer get_attribute_names(), but I'm happy to move forward with either.

adamziel · 2022-12-22T14:20:59Z

phpunit/html/wp-html-tag-processor-test.php

+	 * @covers WP_HTML_Tag_Processor::get_attribute_names
+	 */
+	public function test_get_attribute_names() {
+		$p = new WP_HTML_Tag_Processor( '<div enabled class="test" data-test-id="14">Test</div>' );


Let's also test what happens when:

There are no attributes on a given tag

An attribute was added but not flushed

An attribute was added and flushed

We're on a tag closer

We're not on any tag (before the first next_tag)

Any other corner cases you can think of @dmsnell ?

Thank you, these are all great suggestions -- I'll add them 👍

(Even if we proceed with get_attribute_names_prefixed_by(), we should be able to carry them over 😊 )

I've added tests for the following in ca06c64:

There are no attributes on a given tag

We're on a tag closer

We're not on any tag (before the first next_tag)

(Mostly by copying and modifying the corresponding get_attribute() tests.) This also included a test that navigates to a non-existing tag and verifies that get_attribute_names() returns null.)

Since there wasn't a test to check that get_attribute() returned null when in a closing tag, I've added one in 8c51859.

As for the remaining test cases

An attribute was added but not flushed
An attribute was added and flushed

...I'd like to wait for the outcome of #46680 before adding coverage for "added but not flushed", since we might need it.

"Added and flushed" I'll add now 👍

"Added and flushed" I'll add now 👍

5b248c0

dmsnell · 2022-12-22T15:59:34Z

I have yet to come up with a realistic non-prefix criterion

this is probably a good enough reason for now to stick with get_attribute_names_prefixed_by() IMO since we're contrasting something that maintains relative performance characteristics and meets a known need vs. creating something that is arguably more familiar but doesn't have any known need at this time.

to be clear, I was not arguing for get_attributes_prefixed_by() but rather get_attribute_names_prefixed_by()

I'm not sure if we'll have forever control over what methods people add to the class, and if something is missing that's all-too "obvious" or frequently requested, there's a chance that people might just go ahead and add it

Yeah I get that, but while we have the privilege and responsibility I want to let the known needs drive the interface and how it evolves. The tag processor is defined so far by its constraints, and arguably it's already an inconvenient interface because of it (e.g. no "get parent node"), but those constraints are also what makes it possible to exist.

it's what I'm used to

that's no surprise! DOM APIs expose all sorts of conveniences and expectations that we don't have, but the tag processor is fundamentally different than those APIs. the fact that it looks different might actually help us out here by making it more obvious that our other familiarities and assumptions about how it works may not carry over; might help avoid people feeling burnt by this and give up on it.

given that we're not preventing people from reading all the attribute names with this it may even serve as a bridge to help educate people how the API works as they use it.

at the same time I don't like how it forces me, the API consumer, into using specific a naming pattern for my attributes.

I'm probably missing something here because I don't follow how this forces you to make any specific naming choices. the argument, as I understand it, is you probably already have prefixed attributes you care about because that's almost universally how all of these embedded DSLs work, and getting only those prefixed-attributes can actually remove a few steps from calling code because of it.

in the case you want them all you do the dirty thing and request a zero-length prefix. it's good that it stands out, because it provides a moment of pause to reflect on whether that's actually needed (usually isn't). so I'm looking at it like this - how can we preserve the freedom for someone to do this while making the likely-more-appropriate thing the clearer first choice?

ockham · 2023-01-02T11:03:28Z

Rebased (to include #46748).

ockham · 2023-01-02T13:38:55Z

to be clear, I was not arguing for get_attributes_prefixed_by() but rather get_attribute_names_prefixed_by()

My bad, I had indeed read that as get_attributes_by_prefix()!

Your reasoning makes sense to me -- I'll file a PR to implement get_attribute_names_prefixed_by() 😄

phpunit/html/wp-html-tag-processor-test.php

ockham · 2023-01-02T15:48:00Z

I'll file a PR to implement get_attribute_names_prefixed_by() 😄

#46840

ockham · 2023-01-10T15:11:28Z

Closing if favor of #46840.

ockham requested review from adamziel and dmsnell December 20, 2022 16:56

ockham self-assigned this Dec 20, 2022

ockham requested a review from spacedmonkey as a code owner December 20, 2022 16:56

This was referenced Dec 20, 2022

WP_HTML_Tag_Processor: Add get_attributes_by_prefix() method #46672

Closed

Experiment with WP_HTML_Tag_Processor WordPress/block-interactivity-experiments#118

Closed

ockham force-pushed the add/get-attribute-names branch from 0e8cc1a to 8f6e866 Compare December 21, 2022 09:32

adamziel reviewed Dec 22, 2022

View reviewed changes

ockham mentioned this pull request Dec 22, 2022

SSR: Experiment with WP_HTML_Tag_Processor WordPress/block-interactivity-experiments#125

Closed

ockham added 2 commits January 2, 2023 12:03

WP_HTML_Tag_Processor: Implement get_attribute_names()

4e4b957

Remove superfluous parameter comment

1573010

ockham force-pushed the add/get-attribute-names branch from a9fd045 to 1573010 Compare January 2, 2023 11:03

ockham added 5 commits January 2, 2023 14:45

Move test down

8be4ac9

Add test for get_attribute() when in closing tag

8c51859

Add more get_attribute_names tests()

ca06c64

Fix implementation

2dbf838

Verify that get_attribute_names finds attribute added by set_attribute

5b248c0

ockham mentioned this pull request Jan 2, 2023

WP_HTML_Tag_Processor: Add get_attribute_names_with_prefix() method #46840

Merged

ockham commented Jan 2, 2023

View reviewed changes

phpunit/html/wp-html-tag-processor-test.php Outdated Show resolved Hide resolved

Fix PHPDoc string

9c3ab7e

ockham mentioned this pull request Jan 3, 2023

Tracking Issue: Server-side rendering of directives WordPress/block-interactivity-experiments#120

Closed

2 tasks

ockham closed this Jan 10, 2023

ockham deleted the add/get-attribute-names branch January 10, 2023 15:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WP_HTML_Tag_Processor: Implement `get_attribute_names()` method #46685

WP_HTML_Tag_Processor: Implement `get_attribute_names()` method #46685

ockham commented Dec 20, 2022

dmsnell commented Dec 20, 2022

adamziel commented Dec 21, 2022 •

edited

Loading

dmsnell commented Dec 21, 2022

ockham commented Dec 22, 2022

adamziel commented Dec 22, 2022 •

edited

Loading

adamziel Dec 22, 2022 •

edited

Loading

ockham Jan 2, 2023

ockham Jan 2, 2023

ockham Jan 2, 2023

ockham Jan 2, 2023

dmsnell commented Dec 22, 2022

ockham commented Jan 2, 2023

ockham commented Jan 2, 2023 •

edited

Loading

ockham commented Jan 2, 2023

ockham commented Jan 10, 2023

WP_HTML_Tag_Processor: Implement get_attribute_names() method #46685

WP_HTML_Tag_Processor: Implement get_attribute_names() method #46685

Conversation

ockham commented Dec 20, 2022

What?

Why?

How?

Testing Instructions

dmsnell commented Dec 20, 2022

adamziel commented Dec 21, 2022 • edited Loading

dmsnell commented Dec 21, 2022

ockham commented Dec 22, 2022

adamziel commented Dec 22, 2022 • edited Loading

adamziel Dec 22, 2022 • edited Loading

Choose a reason for hiding this comment

ockham Jan 2, 2023

Choose a reason for hiding this comment

ockham Jan 2, 2023

Choose a reason for hiding this comment

ockham Jan 2, 2023

Choose a reason for hiding this comment

ockham Jan 2, 2023

Choose a reason for hiding this comment

dmsnell commented Dec 22, 2022

ockham commented Jan 2, 2023

ockham commented Jan 2, 2023 • edited Loading

ockham commented Jan 2, 2023

ockham commented Jan 10, 2023

WP_HTML_Tag_Processor: Implement `get_attribute_names()` method #46685

WP_HTML_Tag_Processor: Implement `get_attribute_names()` method #46685

adamziel commented Dec 21, 2022 •

edited

Loading

adamziel commented Dec 22, 2022 •

edited

Loading

adamziel Dec 22, 2022 •

edited

Loading

ockham commented Jan 2, 2023 •

edited

Loading