Skip to content

Commit

Permalink
Merge pull request #346 from oddbird/wpt-gh-actions
Browse files Browse the repository at this point in the history
WPT + GH Actions
  • Loading branch information
stacyk committed May 1, 2023
2 parents 5db3034 + 473c0d2 commit 384cd7c
Show file tree
Hide file tree
Showing 5 changed files with 206 additions and 0 deletions.
206 changes: 206 additions & 0 deletions content/blog/2023/web-platform-tests-polyfills.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,206 @@
---
title: Using Web Platform Tests to Write and Test Polyfills
author: ed
date: 2023-05-01
tags:
- Article
- Build Tools
- DevOps
- GitHub
- JavaScript
- Testing
image:
src: blog/2023/graph.jpg
summary: |
Learn how to leverage Web Platform Tests to ensure your polyfills are
implementing upcoming browser features correctly, including how to generate a
comprehensive report of failing/passing tests on each change.
---

{% import 'embed.macros.njk' as embed %}

## Web Platform Tests (WPT): What Are They?

The [Web Platform Tests](https://web-platform-tests.org/) (WPT) are a
cross-browser test suite for the web platform stack, including WHATWG, W3C, and
other specifications. They help browser vendors gauge their compatibility with
the web platform by writing tests that can be run in all browsers. This gives
confidence to both browser projects and web developers that they are shipping
software that is compatible with other implementations, and that they can rely
on the web platform to work across browsers and devices without needing extra
layers of abstraction. [The WPT
repository](https://github.com/web-platform-tests/wpt) also provides
documentation, tools, and resources for test writing and reviewing.

You can run all tests in your current browser by visiting
[wpt.live](https://wpt.live) or by cloning the repository and running the `./wpt
serve` command. The tests are organized in folders, and visiting any of the HTML
files will run all subtests and display the results. For example, visiting the
test
[`anchor-name-0002.html`](https://wpt.live/css/css-anchor-position/anchor-name-002.html)
with Chrome 112 results in two passing and two failing tests:

{{ embed.img(
src='blog/2023/wpt-fail-large.png',
alt='Test results for anchor-name-0002.html, showing two of four passing tests'
) }}

To get a bird's eye view of the test results for upcoming versions of major
browsers, visit [wpt.fyi](https://wpt.fyi).

## WPT and Polyfills

The primary users of WPT are browser vendors who want to evaluate development
versions of their browser against the test suite. However, polyfill authors can
also leverage WPT to run the test suite on stable browsers with their polyfill
loaded. This is useful to measure the impact of the polyfill in a quantitative
way that doesn't rely on manual testing.

WPT includes built-in support to run with an arbitrary JavaScript file inserted
as part of the server response. The process is explained in the [official
documentation](https://web-platform-tests.org/running-tests/testing-polyfills.html),
but it boils down to these steps:

1. Package your polyfill as a single-file, immediately-invoked script
2. Start WPT with `./wpt serve --inject-script=/path/to/polyfill.js`
3. Visit any test and ensure the script is inlined as the first element of the
`<head>`. This should be obvious when inspecting the page source.

At this point you should hopefully notice test scores have improved with the
help of the injected polyfill. If something goes wrong, WPT should also report
tests that are not completing or are failing for some other reason.

When we inject our [CSS anchor positioning
polyfill](https://github.com/oddbird/css-anchor-positioning/) and visit the
`anchor-name-0002.html` test again, we see that all tests are now passing:

{{ embed.img(
src='blog/2023/wpt-pass-large.png',
alt='Test results for anchor-name-0002.html, showing four of four passing tests'
) }}

## Monitoring WPT "Coverage"

An important part of polyfill development is ensuring the polyfill covers all or
most tests for a given feature to maintain compliance with the spec.
Furthermore, polyfill authors want to monitor how coverage changes as features
are added to the polyfill. Lastly, it's important to detect changes in test
results as the specs evolve and tests are added or modified.

You can do these things manually if you thoroughly visit all tests that apply to
your polyfill, but we wanted an automated way to inject the polyfill when
running WPT against all browsers, monitor the results for each run, and
understand how they change over time.

We have implemented an automated approach that accomplishes just that for our
[CSS anchor positioning
polyfill](https://github.com/oddbird/css-anchor-positioning/). The WPT results
are gathered and included in an HTML report that visually represents the number
of subtests that are passing each time a commit is pushed to `main`. This makes
it easy to get the general idea of which tests are passing and which ones
require more work. It also provides links to the test source and the test page
on [wpt.live](https://wpt.live) and the local server (assuming you have it
running). The HTML report can be viewed and shared as a standalone file, or
hosted on any service that supports static sites. We use Netlify to host the
[report page for the anchor position
polyfill](https://anchor-position-wpt.netlify.app/).

{{ embed.img(
src='blog/2023/wpt-report.png',
alt='Table showing multiple passing and failing tests for the main branch of the anchor position polyfill'
) }}

*Note: the report includes many failing tests because the polyfill is a work in
progress and is actively being worked on.*

Another important feature is the automatic generation of reports for pull
requests. This allows the team to quickly compare the report of a branch that
implements a new polyfill feature against the base report from the `main`
branch. GitHub Actions takes care of creating this PR report and [including a
comment](https://github.com/oddbird/css-anchor-positioning/pull/90#issuecomment-1458115810)
linking to it on all pull requests.

Let's dive into the steps required to automate the report generation.

### Step 1: Start the WPT Server With an Injected Polyfill

Before we start testing, we need to make sure the WPT server is running with the
`--inject-script` flag. This process needs to be started separately, and kept
running for the duration of the test runs. It's important to note that WPT
requires Python to run this server, so you will need to have it installed in
your system or CI environment.

### Step 2: Compile a List of Relevant Tests

Polyfills target specific browser features and are only concerned with specific
WPT results. To avoid running the entire WPT suite, we first need a list of
programmatically filtered-down tests that are relevant to our polyfill. The
final list of tests makes up the rows of the report.

Fortunately, WPT can [generate a manifest
file](https://web-platform-tests.org/running-tests/command-line-arguments.html?highlight=manifest#manifest)
that we can easily parse as JSON to apply our custom filter logic.

### Step 3: Compile a List of Relevant Browsers

When testing the polyfill, we want to make sure we cover a wide array of browser
versions and engines to reach as many users as possible. For anchor positioning,
we are leveraging [Browserslist](https://browsersl.ist/) to generate a list of
the two most recent versions of Chrome, Firefox, Edge, Safari, and Samsung
Internet. These make up the columns of the report, and each test will run
against each browser version. By using Browserslist we ensure the test suite is
running against new browser versions as they are released.

### Step 4: Programmatically Launch Instances of the Browsers

By using [BrowserStack](https://www.browserstack.com/) and
[Selenium](https://selenium.dev), we can spawn instances of all browsers and
point them to the relevant test. We recommend using an asynchronous approach to
run the browsers because the total number of test runs is the product of all
relevant tests and browser versions (rows times columns). For our test suite of
71 tests and 12 browser versions, that results in 852 runs, so the speed boost
from running multiple browsers at once is a welcome addition.

### Step 5: Gather Test Results

Visiting any WPT page in your browser will produce a human-readable report of
the subtest results, but WPT also supports programmatic access to these results.
Once the browser processes are started we can use this access to gather the
results and store them for later review and to generate the report.

In our case we are storing the raw test result data as a JSON file in case we
ever want to go back to a previous test run and analyze it further.

### Step 6: Generate the Report

Finally, with the test results available we use a
[LiquidJS](https://liquidjs.com/) template to generate an HTML file with a table
cell for each test and browser version combination. For ease of use we also
include links to the specific commit that generated the report, for both the
polyfill and WPT repositories (in case we are using a forked WPT repo).

We also save timestamped versions of the report to create an [historic
archive](https://anchor-position-wpt.netlify.app/history.html), which is very
useful to detect regressions or changes in tests for a given feature or spec.

## Putting It All Together

You can complete all the previous steps manually, but to get the most use out of
WPT we recommend including it as part of your CI. Based on the work of the
[Container Query
Polyfill](https://github.com/GoogleChromeLabs/container-query-polyfill) team,
the anchor position polyfill uses these files to automate the workflow:

- [`.github/workflows/test-wpt.yml`](https://github.com/oddbird/css-anchor-positioning/blob/63486097aeb4d2cda5bff03a4f7bef26bef8d1ba/.github/workflows/test-wpt.yml):
GitHub Actions workflow definition that holds environment variables and glues
all the pieces together, including building the polyfill and committing the
report to a separate branch that deploys to Netlify.
- [`tests/wpt.ts`](https://github.com/oddbird/css-anchor-positioning/blob/63486097aeb4d2cda5bff03a4f7bef26bef8d1ba/tests/wpt.ts):
The main file called by the GH Actions workflow. Takes care of most things
from gathering the WPT list to generating the report.
- [`tests/runner.html`](https://github.com/oddbird/css-anchor-positioning/blob/63486097aeb4d2cda5bff03a4f7bef26bef8d1ba/tests/runner.html):
Acts as the entrypoint for browsers to load tests and packages the results for
storage and reporting.
- [`tests/report.liquid`](https://github.com/oddbird/css-anchor-positioning/blob/63486097aeb4d2cda5bff03a4f7bef26bef8d1ba/tests/report.liquid):
HTML template for the report. The LiquidJS template language makes it easy to
iterate over and render the result data.
Binary file added src/images/blog/2023/graph.jpg
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added src/images/blog/2023/wpt-fail-large.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added src/images/blog/2023/wpt-pass-large.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added src/images/blog/2023/wpt-report.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.

0 comments on commit 384cd7c

Please sign in to comment.