[WIP] Unit tests #126

JKingweb · 2023-07-09T02:03:43Z

This patch constitutes a series of generic unit tests for microformat parsers, using the existing test format. While it is not possible to segregate certain parsing features entirely (notably implied property parsing), I've attempted to keep to the following design philosophy:

each file tests a single feature or facet of parsing, as much as possible
types of microformats and names of properties avoid known vocabularies to emphasize parsing is generic
no two microformats in a given file produce the same output, to make output comparsison easier
example.test is used instead of example.com, as the former is guaranteed never to be a real domain
tests try to be as thorough as practical
standard features which have experimental alternatives are tested in isolated files so that they may be skipped easily
under- or un-specified behaviour is tested in isolated files prefixed with tentative- so that they may be easily skipped; these files reflect common behaviour among established implementations where there is majority agreement
leave existing tests alone to validate that these tests do not contradict existing tests

At present the test suite offers only partial coverage. The to-do list is thus:

I am posting this while still far from finished to gather feedback early. I expect it will take me quite a while to do everything, but what's here can already be useful to implementers.

There is much divergent and poorly-documented on display here

As suggested by gRegor and Tantek Çelik, this avoids any implication we are testing real vocabularies, past, present, or future

tantek

Spot checked all the files and what I checked looked good. Lots here and at this point I think we should land this and see how existing parser implementations do, and then investigate any failures in detail to verify that any failing test is itself valid.

JKingweb · 2023-07-18T22:11:41Z

I have no objections to merging the work thus far as a partial test suite, though I should probably write up some draft documentation (more or less what's detailed in the cover of this request) first. Is there a preferred format? Plain text, markdown, HTML, something else?

Zegnat · 2023-11-05T10:05:00Z

Personally I think a markdown file containing basically what is in the PR description would be nice. Linked to from README, I would say, and maybe even at the root level of the repository. But there does not seem to be any precedent.

There are some changelog files, but honestly I have never read them, in part because they are HTML files. That makes it basically a requirement to clone the repo and open it with a browser to read comfortably. That is why I would prefer markdown, GitHub has native support for it, and it will be easier to refer to it inside the repo.

gRegorLove · 2023-11-22T21:01:07Z

I've been using https://keepachangelog.com/ format recently on some projects and liking it.

JKingweb added 30 commits July 6, 2023 20:51

Unit tests for microformat and property names

b903c82

Whitespace-related name tests

246d842

Tests for multiple root types on one element

971c603

Add negative tests for property names based on pefix

a87f810

Document common behaviour of duplicate properties

384b73a

Split underspecified behaviour from certain stuff

0a3e9d4

Tests for p-properties

8b5b80a

Pre-empt all possible implied properties

9b0a510

Tests for u-properties

9e01ff0

Add missing "else" u- test

dcc26ce

Tests for dt-properties and e-properties

fd28ab7

More negative tests for property parsing

4e09cf6

Implied name tests

1de63e0

Make explicit when properties are not part of test

904d24c

Add no-trim tests for implied name

3c4fcab

Compress whitespace

bc48625

Tests for implied photo

7c42894

Fix various test errors

e310fa4

Tests for implied url

de96b69

VCP tests for p-properties

204a0c5

VCP tests for u-properties

5639c6a

More p- and u- VCP tests

f1d4be6

Restore accidentally deleted test

461803e

VCP tests for dt-properties

15a2179

Use less problematic date format

c97a416

Add special-case elements for VCP date parsing

244dbe9

Whitespace cleanup

7b7b252

VCP test for e-properties

69c7dcf

Tests for microformat nesting in its sudry forms

96cc6f6

There is much divergent and poorly-documented on display here

Use "test" prefix for roots to avoid confusion

a69ed8d

As suggested by gRegor and Tantek Çelik, this avoids any implication we are testing real vocabularies, past, present, or future

tantek approved these changes Jul 18, 2023

View reviewed changes

JKingweb added 2 commits July 22, 2023 16:37

Fix some test errors

b7f9da2

Ensure output is the same with either textContent

2c2840c

Add a README file for unit tests

cd4eaee

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Unit tests #126

[WIP] Unit tests #126

JKingweb commented Jul 9, 2023 •

edited

tantek left a comment

JKingweb commented Jul 18, 2023

Zegnat commented Nov 5, 2023

gRegorLove commented Nov 22, 2023

[WIP] Unit tests #126

Are you sure you want to change the base?

[WIP] Unit tests #126

Conversation

JKingweb commented Jul 9, 2023 • edited

tantek left a comment

Choose a reason for hiding this comment

JKingweb commented Jul 18, 2023

Zegnat commented Nov 5, 2023

gRegorLove commented Nov 22, 2023

JKingweb commented Jul 9, 2023 •

edited