Add Text/replace builtin #1065

alexhumphreys · 2020-09-06T06:10:23Z

This attempts to standardise the Text/replace builtin.

There's a few parts I wasn't sure on (eg. the B case for the test tests/parser/success/builtinsA.dhall), and there's probably plenty other things I missed so best to look over this carefully.

Also, wasn't sure how best to describe this feature in the prose sections (doc strings in prelude, Language-tour.md, etc.). I feel the need to use the phrase "substring" but Dhall has Text not String so maybe that's misleading. So if anyone has suggestions to make those descriptions more clear I'd appreciate it.

Fixes #1051.

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Gabriella439

Looks great so far!

I think one thing we need to decide is what to do if the substring to replace is "", in order to avoid an infinite loop. Some possibilities I can think of are:

Silently perform no replacement if the substring to replace is empty
Return an Optional with a None result
Require the substring to replace to be statically known at type-checking time and return a type error if empty

Prelude/Text/replace.dhall

standard/beta-normalization.md

alexhumphreys · 2020-09-07T07:33:09Z

As regards "", I'd vote for the first choice of do nothing. The second choice of returning an optional seems unergonomic. The third choice seems like an odd type error, given that "non-empty string" (or non-empty list for that matter) isn't a type that exists anywhere else in the language.

Just tried ruby to see what it does in this case and it was not what I was expecting:

"foo".gsub "" "bar" == "barfbarobarobar"

... not sure we want to follow that example 😅 Though I can see a case to be made for Text/replace "" "foo" "" === "foo".

I don't feel strongly about this though, whatever ye all prefer is cool.

Co-authored-by: Gabriel Gonzalez <Gabriel439@gmail.com>

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Co-authored-by: Gabriel Gonzalez <Gabriel439@gmail.com>

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

alexhumphreys · 2020-09-16T12:55:27Z

@Gabriel439 I've committed your suggested change defining normalisation in terms of recursion. I've also added a note to do nothing in the case of "" as the substring to replace, and if we decide on some other behaviour I can update this later.

Gabriella439 · 2020-09-18T04:59:56Z

@alexhumphreys: I'm taking vacation this week, but I'll review again afterwards

alexhumphreys · 2020-09-18T05:30:31Z

Ah nice, enjoy the vacation! 🌴

Gabriella439

Could you also add a test for replacing an empty string? That way, implementations won't forget to handle that case

Other than that, this looks to me like this proposal is basically ready to move out of Draft status

docs/references/Built-in-types.md

docs/tutorials/Language-Tour.md

Gabriella439 · 2020-09-19T22:10:42Z

standard/beta-normalization.md

+If the substring to replace is empty (`""`), then no replacement is performed:
+
+
+    f ⇥ Text/replace "" replacement   a ⇥ "foo"
+    ────────────────────────────────────────────
+    f a ⇥ "foo"


Could you move this judgement to be the first or second one? The standard has an implicit "first judgement wins" convention that we've been using when more than one judgement applies, and we want this judgment to take precedence

Co-authored-by: Gabriel Gonzalez <Gabriel439@gmail.com>

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

alexhumphreys · 2020-09-20T08:09:07Z

@Gabriel439 I've added that empty string test. I still don't have the B case for tests/parser/success/builtinsA.dhall as I'm not sure what Text/replace gets parsed as, but aside from that it's good for me to move this out of draft status.

sjakobi · 2020-09-20T13:04:07Z

Isn't the normalization still way under-specified? How are composite characters supposed to be handled, for example? (I believe @Profpatsch had some opinions about that in some discussion about Text operations)

Gabriella439 · 2020-09-21T02:58:32Z

@sjakobi: I'd propose doing replacement at the grapheme level, as suggested by this test:

dhall-lang/Prelude/Text/replace.dhall

Line 12 in 595c8ca

    
           let example2 = assert : replace "👨" "👩" "👨‍👩‍👧‍👦" ≡ "👩‍👩‍👧‍👦"

… and for correctly handling composite characters I'd suggest performing the replace if the "needle" and the prefix are both canonically equivalent, meaning that they have the same normalization form, specifically Normalization Form C.

sjakobi · 2020-09-21T12:50:26Z

@Gabriel439 Sounds good to me!

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

alexhumphreys · 2020-09-26T06:32:53Z

@Gabriel439 @sjakobi I updated this with a note on using NFC. Well, I pretty much just took your comment from above. Does this need more detail or would that be clear enough?

Gabriella439 · 2020-09-27T18:25:38Z

Prelude/Text/replace.dhall

+
+let example1 = assert : replace "💣" "💥" "💣💣💣" ≡ "💥💥💥"
+
+let example2 = assert : replace "👨" "👩" "👨‍👩‍👧‍👦" ≡ "👩‍👩‍👧‍👦"


Could you also add this as one of the test cases in the standard test suite?

Yep, updated. Should I leave this in the prelude? Just wondering how easy this will be for all implementations, since most users importing the prelude won't need this particular replacement.

I think it is also worth keeping in the Prelude since this is useful documentation for the end user. These examples will also show up in the generated docs

Gabriella439 · 2020-10-03T21:44:09Z

@alexhumphreys: I have a PR open with all of my suggested changes in: alexhumphreys#1

I decided to skip handling normalization for now (and adding a test making that explicit). I can standardize support for Unicode normalization in a separate proposal, to keep this pull request simpler.

… as standardized in dhall-lang/dhall-lang#1065

Gabriella439 · 2020-10-04T01:29:30Z

I also have a matching change to the Haskell implementation up here: dhall-lang/dhall-haskell#2063

Suggested changes for dhall-lang#1065

alexhumphreys · 2020-10-04T19:40:48Z

@Gabriel439 thanks for the PR, those beta normalisation rules are a lot clearer. It also thought me a bit about string interpolation: didn't know it would get normalised under a lambda binding before the lambda is applied.

Splitting the Unicode handling to a different PR is cool too. Would let example2 = assert : replace "👨" "👩" "👨‍👩‍👧‍👦" ≡ "👩‍👩‍👧‍👦" still remain in the prelude for this PR? I think the output of that wouldn't change between NFC and NFD normalisation, so it should be ok.

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Gabriella439 · 2020-10-04T22:49:19Z

@alexhumphreys: I think that example would be the same regardless of how we handle normalization

sjakobi

LGTM apart from one more wibble. Thanks! :)

docs/references/Built-in-types.md

Co-authored-by: Simon Jakobi <simon.jakobi@gmail.com>

Gabriella439 · 2020-10-09T16:08:44Z

@alexhumphreys: You should be clear to merge this

alexhumphreys · 2020-10-09T16:19:24Z

@Gabriel439 sweet! Any 3 day waiting period or can I just hit the button?

Gabriella439 · 2020-10-09T18:26:49Z

@alexhumphreys: Since you don't have a majority of approvals but you also don't have a majority of disapprovals then the policy is 7 days since the PR was ready (i.e. not a draft PR) and 1 day since last code change. For more details, see: https://github.com/dhall-lang/dhall-lang/blob/master/.github/CONTRIBUTING.md#how-do-changes-get-approved

alexhumphreys · 2020-10-10T07:45:50Z

Ah cool, so it should be good to be merged. Looks like I don't have write access so maybe one of ye wants to hit the button? Also there's 25 commits in this PR now, do ye have a policy of squashing commits or should I go in and tidy up the history a bit?

philandstuff · 2020-10-10T08:17:11Z

This repo enforces squash-on-merge. We should get you the commit bit so you can merge yourself 👍🏻

philandstuff · 2020-10-10T09:11:38Z

@alexhumphreys I’ve granted you write access now, check your email

alexhumphreys · 2020-10-10T09:46:29Z

Sweet! Thanks @philandstuff, and everyone else too for helping on this 🙂

Gabriella439 · 2020-10-12T01:58:12Z

@alexhumphreys: Thank you, too! Also, now that this is merged you can submit an invoice here for $200 and one of us will approve it: https://opencollective.com/dhall/expenses/new

alexhumphreys · 2020-10-12T09:08:45Z

Thanks, but it's ok! I've been meaning to donate to that Dhall fund so consider this a contribution 🙂

… as standardized in dhall-lang/dhall-lang#1065

basile-henry

Sorry, I'm a bit late for a review. I just have a question as I am attempting to implement this in dhall-rust. 😅

basile-henry · 2020-10-16T19:00:50Z

tests/normalization/success/unit/TextReplaceAbstractA.dhall

+{- This test verifies that an implementation correctly permits both the
+   "replacement" and the "haystack" to be abstract.
+-}
+λ(x : Text) → λ(y : Text) → Text/replace "a" "-${x}-" "_a_${y}_a_"


I don't really understand how the haystack can be abstract. Don't we want to replace occurrences of the needle in the abstract section too? i.e. What happens if y is the string "a"?

@basile-henry: Oh, I see what you mean. Allowing the haystack to be abstract means that β-reduction is no longer confluent. For right now, just ignore that test and I can put up a change to require the haystack to be non-abstract before further reduction can occur.

alexhumphreys marked this pull request as draft September 6, 2020 06:11

alexhumphreys force-pushed the feature/text/replace branch from 50e1ea6 to e083884 Compare September 6, 2020 06:21

Add Text/replace builtin

e083884

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Gabriella439 reviewed Sep 7, 2020

View reviewed changes

Prelude/Text/replace.dhall Show resolved Hide resolved

standard/beta-normalization.md Outdated Show resolved Hide resolved

standard/beta-normalization.md Outdated Show resolved Hide resolved

alexhumphreys and others added 5 commits September 7, 2020 11:53

Update standard/beta-normalization.md

42fb6ff

Co-authored-by: Gabriel Gonzalez <Gabriel439@gmail.com>

Add extra unicode test

db056f0

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

fix line formatting

df2405b

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Define behaviour by induction in standard/beta-normalization.md

bb814eb

Co-authored-by: Gabriel Gonzalez <Gabriel439@gmail.com>

Merge branch 'master' into feature/text/replace

ae955b9

alexhumphreys changed the title ~~[WIP] Add Text/replace builtin~~ Add Text/replace builtin Sep 16, 2020

Add a note about empty 'needle' argument

9182692

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

alexhumphreys force-pushed the feature/text/replace branch from 7ac0709 to 9182692 Compare September 16, 2020 12:54

Nadrieril approved these changes Sep 19, 2020

View reviewed changes

Gabriella439 reviewed Sep 19, 2020

View reviewed changes

alexhumphreys and others added 4 commits September 20, 2020 09:53

Add named arguments in docs/references/Built-in-types.md

2577b6e

Co-authored-by: Gabriel Gonzalez <Gabriel439@gmail.com>

Add replace newlines example

cae8ec7

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Reorder Text/replace beta-normalization rules

a61ff5f

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Add test for replacing an empty string

595c8ca

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

alexhumphreys marked this pull request as ready for review September 20, 2020 08:09

Specify normalisation form for comparison

e67a627

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Gabriella439 approved these changes Sep 27, 2020

View reviewed changes

Add missing decoding judgment

8affa98

Gabriella439 added a commit to dhall-lang/dhall-haskell that referenced this pull request Oct 4, 2020

Implement Text/replace

3db42ee

… as standardized in dhall-lang/dhall-lang#1065

Gabriella439 mentioned this pull request Oct 4, 2020

Implement Text/replace dhall-lang/dhall-haskell#2063

Merged

Merge pull request #1 from dhall-lang/gabriel/Text/replace

2e9c134

Suggested changes for dhall-lang#1065

Alex Humphreys added 2 commits October 4, 2020 21:46

Swap subset for substring

e217c2a

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Remove specific Unicode normalisation form

3efd28f

Signed-off-by: Alex Humphreys <alex.humphreys@here.com>

Gabriella439 approved these changes Oct 4, 2020

View reviewed changes

philandstuff mentioned this pull request Oct 6, 2020

Version 18.0.0 -> 19.0.0 #1080

Merged

sjakobi approved these changes Oct 8, 2020

View reviewed changes

docs/references/Built-in-types.md Outdated Show resolved Hide resolved

Update docs/references/Built-in-types.md

c03ffcf

Co-authored-by: Simon Jakobi <simon.jakobi@gmail.com>

Merge branch 'master' into feature/text/replace

4d9d42f

alexhumphreys merged commit 16160eb into dhall-lang:master Oct 10, 2020

Gabriella439 added a commit to dhall-lang/dhall-haskell that referenced this pull request Oct 13, 2020

Implement Text/replace (#2063)

56bf116

… as standardized in dhall-lang/dhall-lang#1065

alexhumphreys deleted the feature/text/replace branch October 15, 2020 06:40

basile-henry reviewed Oct 16, 2020

View reviewed changes

This was referenced Oct 16, 2020

Implement Text/replace Nadrieril/dhall-rust#181

Merged

Text/replace on interpolated text #1084

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Text/replace builtin #1065

Add Text/replace builtin #1065

alexhumphreys commented Sep 6, 2020

Gabriella439 left a comment

alexhumphreys commented Sep 7, 2020 •

edited

Loading

alexhumphreys commented Sep 16, 2020

Gabriella439 commented Sep 18, 2020

alexhumphreys commented Sep 18, 2020

Gabriella439 left a comment

Gabriella439 Sep 19, 2020

alexhumphreys Sep 20, 2020

alexhumphreys commented Sep 20, 2020 •

edited

Loading

sjakobi commented Sep 20, 2020

Gabriella439 commented Sep 21, 2020

sjakobi commented Sep 21, 2020

alexhumphreys commented Sep 26, 2020

Gabriella439 Sep 27, 2020

alexhumphreys Sep 28, 2020

Gabriella439 Sep 28, 2020

Gabriella439 commented Oct 3, 2020

Gabriella439 commented Oct 4, 2020

alexhumphreys commented Oct 4, 2020

Gabriella439 commented Oct 4, 2020

sjakobi left a comment

Gabriella439 commented Oct 9, 2020

alexhumphreys commented Oct 9, 2020

Gabriella439 commented Oct 9, 2020

alexhumphreys commented Oct 10, 2020

philandstuff commented Oct 10, 2020

philandstuff commented Oct 10, 2020 •

edited

Loading

alexhumphreys commented Oct 10, 2020

Gabriella439 commented Oct 12, 2020

alexhumphreys commented Oct 12, 2020

basile-henry left a comment

basile-henry Oct 16, 2020

Gabriella439 Oct 16, 2020 •

edited

Loading


		let example1 = assert : replace "💣" "💥" "💣💣💣" ≡ "💥💥💥"

		let example2 = assert : replace "👨" "👩" "👨‍👩‍👧‍👦" ≡ "👩‍👩‍👧‍👦"

Add Text/replace builtin #1065

Add Text/replace builtin #1065

Conversation

alexhumphreys commented Sep 6, 2020

Gabriella439 left a comment

Choose a reason for hiding this comment

alexhumphreys commented Sep 7, 2020 • edited Loading

alexhumphreys commented Sep 16, 2020

Gabriella439 commented Sep 18, 2020

alexhumphreys commented Sep 18, 2020

Gabriella439 left a comment

Choose a reason for hiding this comment

Gabriella439 Sep 19, 2020

Choose a reason for hiding this comment

alexhumphreys Sep 20, 2020

Choose a reason for hiding this comment

alexhumphreys commented Sep 20, 2020 • edited Loading

sjakobi commented Sep 20, 2020

Gabriella439 commented Sep 21, 2020

sjakobi commented Sep 21, 2020

alexhumphreys commented Sep 26, 2020

Gabriella439 Sep 27, 2020

Choose a reason for hiding this comment

alexhumphreys Sep 28, 2020

Choose a reason for hiding this comment

Gabriella439 Sep 28, 2020

Choose a reason for hiding this comment

Gabriella439 commented Oct 3, 2020

Gabriella439 commented Oct 4, 2020

alexhumphreys commented Oct 4, 2020

Gabriella439 commented Oct 4, 2020

sjakobi left a comment

Choose a reason for hiding this comment

Gabriella439 commented Oct 9, 2020

alexhumphreys commented Oct 9, 2020

Gabriella439 commented Oct 9, 2020

alexhumphreys commented Oct 10, 2020

philandstuff commented Oct 10, 2020

philandstuff commented Oct 10, 2020 • edited Loading

alexhumphreys commented Oct 10, 2020

Gabriella439 commented Oct 12, 2020

alexhumphreys commented Oct 12, 2020

basile-henry left a comment

Choose a reason for hiding this comment

basile-henry Oct 16, 2020

Choose a reason for hiding this comment

Gabriella439 Oct 16, 2020 • edited Loading

Choose a reason for hiding this comment

alexhumphreys commented Sep 7, 2020 •

edited

Loading

alexhumphreys commented Sep 20, 2020 •

edited

Loading

philandstuff commented Oct 10, 2020 •

edited

Loading

Gabriella439 Oct 16, 2020 •

edited

Loading