harden datetime verification #1702

bnewbold · 2023-10-03T08:25:30Z

This adds stronger datetime verification in the syntax package, including a bunch of interop test cases, and swaps that in during Lexicon verification.

The prior verification was pretty lax, so I suspect this might cause some problems to other devs, or with existing records, when we roll it out.

bnewbold · 2023-10-12T02:21:22Z

Some updates after discussing with @devinivy earlier in the week.

This keeps a proper ensureValid, and that is what we should use for, eg, new record creation.

It also has normalizeDatetime which has the dual role of normalizing even valid datetimes for sorting in a database column; and being somewhat more lenient for existing createdAt records, but will still throw on totally bogus stuff (eg, empty string or "asfd"). And then normalizeDatetimeAlways if we need a drop-in for toSimplifiedISOSafe, which returns UNIX epoch time if normalizeDatetime would have thrown.

Also rebased on top of main.

The review or feedback i'm still not sure about is whether the full strict validation will only happen on record creation. If that is true, great! Otherwise, if it is also validated when deserializing existing records, maybe we need special logic to distinguish between "new record creation" and "existing record parsing" for this corner-case. A test for this would probably be good.

dholms · 2023-10-12T18:40:18Z

packages/lexicon/src/validators/formats.ts

-    if (!isValidISODateString(value)) {
-      throw new Error()
-    }
+    ensureValidDatetime(value)


yeah this will happen everywhere - on new record creations & when validating existing records

I think we leave this code path as-is for now, and I can add in the more rigorous validation into the pds indexing path

Sure, makes sense. The normalizeDatetime function here would have pretty similar behavior to isValidISODateString, throwing a more specific error when a totally-not-a-datetime is encountered. But just sticking with exactly the current code would be less risky.

dholms

Looks great!

Left a note about where the validation happens & what I think we should do there. I'm happy to tackle that

bnewbold · 2023-10-12T20:59:12Z

If you could tackle that, would be much appreciated!

Noticed tests failed on this branch, I think just flakey, trying a re-run of failed.

dholms · 2023-11-21T22:04:58Z

packages/pds/src/repo/prepare.ts

+    return
+  }
+  try {
+    ensureValidDatetime(createdAt)


added the extra check here

This means records with bad timestamps will still pass "lexicon validation" (which we want so that old records don't break application views), but they will no longer be able to be created on the pds

this specific approach only works as is because the only datetimes on our records are createdAt times & every createdAt property is a datetime. if we get other datetimes, we'll need to tweak this logic

devinivy · 2023-11-21T22:18:06Z

packages/syntax/tests/datetime.test.ts

+describe('normalization', () => {
+  it('normalizes datetimes', () => {


I believe this fun vector originally seen from retr0.id still makes it through, where the normalized date ends-up before year zero, which is invalid:

> normalizeDatetime('0000-01-01T00:00:00+01:00') '-000001-12-31T23:00:00.000Z'

In toSimplifiedISOSafe() you'll find an extra validation check on the normalized output, which is what currently catches this.

just added a test for that, lmk how it looks 👍

devinivy

Feels nice to have this nailed down.

bnewbold · 2023-11-21T23:40:38Z

I didn't see @dholms small patch, and added a bigger patch which verifies that the output of the normalization function is itself valid. And also just disallows dates starting 000 (aka, before year 0010) to ward off more trickery. The specs somewhat disclaim the ability to do very old dates already.

If Dan's fix is simpler/better we can just revert my patches. I resolved a merge conflict.

dholms · 2023-12-01T00:19:00Z

Yup yup I think that looks fine! finally gonna get this in 💪

…tab-should-show-own-threads * origin/main: (59 commits) Config to start notifications daemon from a specific did (#1922) Feature branch: PDS v2 (#1789) Cleanup outdated notifications in appview, add daemon for similar tasks (#1893) Add flag for running db migrations on appview (#1913) Do not generate notifs when post violates threadgate (#1901) Version packages (#1909) Additional @atproto/api 0.6.24 changeset (#1912) Fix snapshots for list items (#1911) Attach record URI to listItemView (#1758) helpers for rkey and tid syntax; validate rkey at record creation time (#1738) harden datetime verification (#1702) fix(debug): properly type debugCatch wrapper result (#1817) style(xrpc-server): avoid un-neccessary "if" statement (#1826) perf(bsky): avoid re-creating auth functions on every request (#1822) Don't create unnecessary error objects (#1908) fix(pds): include aspectRatio on read-sticky posts (#1824) Handle missing creator on lists and feed generators (#1906) ✨ Expose labels attached with legacy actions when events are queried and fix email event builder (#1905) Evented architecture for moderation system (#1617) Put canReply state on post viewer state instead of thread viewer state (#1882) ...

bnewbold requested review from devinivy and dholms October 3, 2023 08:25

bnewbold added 6 commits October 11, 2023 19:04

syntax: add datetime validator (and interop tests)

cf90833

syntax: improve datetime normalization

443ef1f

lexicon: stronger datetime validation (from syntax package)

1d6c337

syntax: make datetime syntax norm test more flexible

bb9a0ed

make fmt

756aa57

datetime: docs, normalize and always variant

a394556

bnewbold force-pushed the bnewbold/harden-datetime branch from 6dac5d6 to a394556 Compare October 12, 2023 02:04

bsky replace toSimplifiedISOSafe with normalizeDatetimeAlways

d5e27d6

dholms reviewed Oct 12, 2023

View reviewed changes

dholms added 2 commits November 21, 2023 15:49

Merge branch 'main' into bnewbold/harden-datetime

a52bdcc

more rigorous datetime parsing on record creation

9220573

dholms reviewed Nov 21, 2023

View reviewed changes

devinivy reviewed Nov 21, 2023

View reviewed changes

handle negative dates

eac065a

devinivy approved these changes Nov 21, 2023

View reviewed changes

bnewbold added 2 commits November 21, 2023 15:37

syntax: disallow datetimes before year 0010

91f5d93

syntax: datetime normalization functions validate output

9f1c382

dholms merged commit c17971a into main Dec 1, 2023
10 checks passed

dholms deleted the bnewbold/harden-datetime branch December 1, 2023 00:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

harden datetime verification #1702

harden datetime verification #1702

bnewbold commented Oct 3, 2023 •

edited

Loading

bnewbold commented Oct 12, 2023

dholms Oct 12, 2023

bnewbold Oct 12, 2023

dholms left a comment

bnewbold commented Oct 12, 2023

dholms Nov 21, 2023

devinivy Nov 21, 2023

dholms Nov 21, 2023

devinivy left a comment

bnewbold commented Nov 21, 2023

dholms commented Dec 1, 2023

		describe('normalization', () => {
		it('normalizes datetimes', () => {

harden datetime verification #1702

harden datetime verification #1702

Conversation

bnewbold commented Oct 3, 2023 • edited Loading

bnewbold commented Oct 12, 2023

dholms Oct 12, 2023

Choose a reason for hiding this comment

bnewbold Oct 12, 2023

Choose a reason for hiding this comment

dholms left a comment

Choose a reason for hiding this comment

bnewbold commented Oct 12, 2023

dholms Nov 21, 2023

Choose a reason for hiding this comment

devinivy Nov 21, 2023

Choose a reason for hiding this comment

dholms Nov 21, 2023

Choose a reason for hiding this comment

devinivy left a comment

Choose a reason for hiding this comment

bnewbold commented Nov 21, 2023

dholms commented Dec 1, 2023

bnewbold commented Oct 3, 2023 •

edited

Loading