SIP-56 - Proper Specification for Match Types. #65

sjrd · 2023-08-11T09:30:13Z

Currently, match type reduction is not specified, and its implementation is by nature not specifiable.
This is an issue because match type reduction spans across TASTy files (unlike, for example, type inference or GADTs), which can and will lead to old TASTy files not to be linked again in future versions of the compiler.

This SIP proposes a proper specification for match types, which does not involve type inference.
It is based on baseType computations and subtype tests involving only fully-defined types.
That is future-proof, because baseType and subtyping are defined in the specification of the language.

The proposed specification defines a subset of current match types that are considered legal.
Legal match types use the new, specified reduction rules.
Illegal match types are rejected, which is a breaking change, and can be recovered under -source:3.3.

content/match-types-spec.md

SethTisue · 2023-08-11T20:58:15Z

content/match-types-spec.md

+Their specification is complicated, and the implementation as well.
+Our quantitative study showed that they were however "often" used (10 occurrences spread over 4 libraries).
+In each case, they seem to be a way to express what Scala 2 type projections (`A#T`) could express.
+While not quite as powerful as type projections (which were shown to be unsound), match types with type member extractors delay things enough for actual use cases to be meaningful.


@dwijnand you might have something to say about this? I know it's come up repeatedly in our discussions about match types that it's possible to use them to smuggle in something akin to abstract type projections.

something akin to abstract type projections

With the proposed spec, only concrete type projections actually make it. As long as it's abstract, ExtractTOf[A] cannot reduce to T like A#T does. The difference is that A#T has the "best" bounds of T in A, whereas ExtractTOf[A] has only the match type's bound as long as T is not concrete (through subsequent substitution of A by something more precise in which T is a type alias whose rhs does not depend on its prefix).

For example,

class Base { type T def someT: T = ??? } type ExtractTOfAux[A] = Base { type T = A } type ExtractTOf[X <: Base] = X match case ExtractTOfAux[t] => t // Scala 2 abstract type projection, works def fooOfBaseProj[B <: Base](b: B): B#T = b.someT // Scala 3 match type extractor: does not work because the match type does not reduce // as T is not concrete in B. def fooOfBaseMT[B <: Base](b: B): ExtractTOf[B] = b.someT

The match-type-based type member extractor still features enough power because you can "carry around" values of that type, and at the edge your B becomes concrete enough that you can reduce. This is what happens with scala.Enumerations.

IMO, people should not use that. It's all a hack. Instead, we should use path-dependent types with actual values of the enclosing class. For example the proper definition of fooOfBase is:

def fooOfBasePathDep(b: Base): b.T = b.someT

Unfortunately, people have been using type projections/type member extractors instead. Yes, sometimes it avoids one extra layer of nesting: if you need a class that extends Foo[b.type], you would b to be declared one level up, and so that class needs to be nested in another class that has a b field. This is why the json4s workaround is what it is, instead of turning it into path-dependent types. The correct solution would be to have

class EnumSerializers(val enumeration: Enumeration) { class EnumSerializer extends Serializer(enumeration.Value) { ... } }

but that would have changed the public API, so it was not an acceptable PR to make.

content/match-types-spec.md

soronpo · 2023-08-12T08:43:25Z

I gave only the specification part to GPT-4 for review. Here is what it mentioned:
The provided specification for the proposed SIP regarding Match Types in Scala 3 is thorough and quite detailed. However, based on the text you've provided, here are a few observations and potential issues:

Clarity on Syntactical Components:
- When you mention various constructs like TypeCapture, TypeWithoutCapture, etc., it would be helpful to provide some simple examples or even a brief definition for each. This ensures that the reader is clear on what constitutes each of these syntactical elements.
Nested Captures in Legal Patterns:
- The specification mentions that the following is not a legal pattern: case Inv[Cov[t]] => because a type capture is nested two levels below a non-covariant type constructor. However, it's not immediately obvious why this is illegal based solely on the legal pattern criteria provided. An elaboration on this could be beneficial.
Concrete Vs. Abstract:
- There's mention of checking whether a type is 'concrete' versus 'abstract', especially in the matching section. It might be helpful to clarify how the proposal differentiates between these two.
The Matching Process:
- The matching procedure is explained well in terms of steps and conditions. Still, due to its complexity, adding a visual representation, like a flowchart, might help in better understanding the flow of the process.
Refined Type Constructs:
- There's a mention of refined types, like Base { type Y = t }. However, an explanation of the intricacies of these refined types, especially within the context of match types, would be beneficial.
Caveats or Limitations:
- If there are any caveats or limitations associated with this proposal, it would be beneficial to outline them explicitly. This helps in setting clear expectations for developers using this feature.
Examples:
- While there are ample examples given, providing more complex scenarios and edge cases might be useful. This can help in highlighting the intricacies and power of the new match type system. Moreover, a practical use-case of where such a feature would be beneficial in real-world code can be a useful addition.
Performance Implications:
- Given the complexity of the match types and the specified algorithm, what are the expected performance implications? Would matching against these types result in a considerable slowdown in compile time?
Error Handling & Messages:
- The proposal doesn't talk about the quality or detail of error messages that would be provided when developers write illegal patterns or mismatched types. For a feature of this complexity, clear error messages are crucial.
Disjointness:
- The section on disjointness is quite brief. Elaborating more on how provably disjoint types are handled might provide clarity, especially in edge cases.

In summary, while the proposal is detailed and gives a good overview of the changes to Match Types, addressing the above points could make it even clearer and more actionable. It's essential that when proposing such changes, developers at all levels of expertise can understand and work with the feature effectively.

anatoliykmetyuk · 2023-08-14T10:09:50Z

As per the recent changes to the SIP process, each SIP now needs to have a SIP Manager assigned. I propose @bjornregnell to be a manager of this SIP - is that OK with you?

As per the process spec, this role entails responsibility for all the communications around the SIP during its lifecycle, including requesting a vote on the SIP from the whole Committee, presenting the SIP to the Committee at the plenary meetings, merging or closing the corresponding PR, reporting to the community on the vote outcome, and announcing when it is available for testing.

bjornregnell · 2023-08-14T10:41:46Z

@anatoliykmetyuk I'm not sure I'm the best pic for this as I have never seriously tried Match types or any other Scala meta-programming feature so I have some catch up + learning to do in order to write intelligibly about this in a community post etc.

But if there is no one more knowledgeable in Match types available, I can do it iff @sjrd is willing to give me a beginner-level personal tutorial in Zoom when possible ;) (It will have to be after the start of the fall semester as I'm currently swamped with teaching preparations...)

This enables a workaround for the `scala.Enumeration#Value` extractor use case, found notably in json4s.

sjrd · 2023-08-14T14:27:43Z

I have addressed Seth's comments.

As well as some comments of ChatGPT 🙄.

bjornregnell · 2023-08-14T10:55:49Z

content/match-types-spec.md

+In each case, they seem to be a way to express what Scala 2 type projections (`A#T`) could express.
+While not quite as powerful as type projections (which were shown to be unsound), match types with type member extractors delay things enough for actual use cases to be meaningful.
+
+As far as we know, those use cases have no workaround if we make type member extractors illegal.


So if I understand correctly, the currently unspecified Match-types brings some expressive power from Scala 2's type projections T#A with respect to type member extraction as in the use case of computing the type of a member in an enum. This seems useful. Currently the proposal seems to state that this expressive power should be removed with no workaround. I think that the proposal needs a discussion on what can be done about this or why this particular case is not included among the legal cases as a special case or something similar. Or is it unsound to ask for the type of enum members? (sorry if this is a nonsensical question)

Hum no, the sentence says: if we remove type member extractors from the proposed spec, then we will have no workaround anymore. With the proposed spec, we can extract type members.

The originally submitted SIP/spec did not allow to extract the type member of SomeEnumeration#Value because Value is a class member (not a type alias), but the commit "Allow class members to be matched by type member extractors." made that actually possible.

OK thanks for the clarification!

anatoliykmetyuk · 2023-08-15T07:38:16Z

@bjornregnell got it, thanks for the heads-up!

@soronpo has agreed to be the SIP Manager, so, in that case, he'll manage this SIP.

content/match-types-spec.md

sjrd · 2023-08-23T13:10:43Z

Well, it took me quite a while to get to the bottom of it, but I added a proposed specification for the provably-disjoint test, as requested. Note that in the end, I actually do recommend that we change it compared to the current implementation. I have a PR for those additional changes at scala/scala3#18416

I discovered that it is necessary for some existing, reasonable use cases of `provablyDisjoint`.

sjrd · 2023-08-30T14:55:54Z

I have polished the proposal a bit further with type lambda support in provablyDisjoint, and with a spec for the full reduction algorithm as well.

content/match-types-spec.md

dwijnand · 2023-09-04T10:13:38Z

content/match-types-spec.md

+    * `Ai ⋔ Bi` and it is in invariant position, or
+    * `Ai ⋔ Bi` and it is in covariant position and there exists a field of that type parameter in `E`
+
+It is worth noting that this definition disregards prefixes entirely.


Disregards differing prefixes? Just to be clear that not all prefixes are disregarded.

sjrd · 2023-09-04T11:10:26Z

@sjrd or @bishabosha could you add AnyKind to this, please? I.e. lampepfl/dotty#18510

What do you think is missing? It's already in the spec and in the implementation for this SIP. (Note that in the spec, Nothing and AnyKind are not classes; they are special types.)

dwijnand · 2023-09-04T17:38:11Z

@sjrd or @bishabosha could you add AnyKind to this, please? I.e. lampepfl/dotty#18510

What do you think is missing? It's already in the spec and in the implementation for this SIP. (Note that in the spec, Nothing and AnyKind are not classes; they are special types.)

That any type and AnyKind are never disjoint, which it is in the current implementation and that I'm fixing in my PR. If that's what's already said in this spec, then nothing more.

sjrd · 2023-09-04T17:59:34Z

The provablyDisjoint (⋔) relation is defined in terms of all the possible cases that make it true. When one of the sides is AnyKind, the only possible rules that can apply are Nothing ⋔ T and S ⋔ Nothing, as well as rules on union/intersection types, which recurse. So AnyKind ⋔ Nothing and conversely, but no other type can ever be provably disjoint from AnyKind. There is no need for a rule that says something about when provablyDisjoint is not true.

soronpo · 2023-10-17T08:44:57Z

@bjornregnell @Kordyjan Do you agree to request a vote in our next SIP meeting? If yes, what would be your recommendation?

bjornregnell · 2023-10-17T11:40:41Z

Do you agree to request a vote in our next SIP meeting? If yes, what would be your recommendation?

Yes, I recommend to accept ~~iff~~ as @sjrd is happy with the review feedback given so far and feel fairly certain that the spec does not contain major problems that cannot be fixed later. I guess we are voting for it to "experimental" and that we later vote for it be be accepted as "stable" when a tested implementation is available.

I have read through the proposal and find the motivation for the need of a spec very well written and understandable. However, my knowledge in the underlying theory, type systems. meta-programming, etc, is too limited to give deeper feedback to the actual specification, esp. the section "Matching".

If I mark the entire natural language bullet list after "matchPattern behaves according to what kind is P:" and copy-paste it into a val s = """...""" and calculate

scala> s.split("\n").map(_.trim).filter(_.startsWith("If ")).length

then I get 21 (sic!) "If "-clauses that are up for review. I humbly wonder if there is some other human with at least the knowledge of @sjrd that have double-checked each rule for omissions or bugs?

I have read all 21 of them and "LGTM", but I cannot say that I am able to check them, although they look reasonable to me on a surface-understanding level.

Also, perhaps it could be explicitly stated in the SIP preamble that this proposed spec form the requirements of a complete re-implementation from scratch of this language feature in a coming compiler version (if that is the case (?) - just to make it clear upfront).

(And then I wonder how to verify that the new implementation complies with this spec - but that is a matter of how to implement this and how to accept is as "stable" in a later stage...)

Anyway, thank you very much indeed @sjrd for embarking on this seemingly very intricate and tedious but also very important work.

bjornregnell · 2023-10-17T11:44:30Z

(And I guess that when implementing this in the compiler based on the spec, omissions and bugs in the spec are likely to be found, if any.)

sjrd · 2023-10-17T11:52:59Z

I humbly wonder if there is some other human with at least the knowledge of @sjrd that have double-checked each rule for omissions or bugs?

FWIW, there is at least one other human: Olivier Blanvillain, who created match types in the first place.

I hoped there would be fewer "If"s. Unfortunately, it seems each one of them is necessary to have something complete enough. 🤷‍♂️

(And I guess that when implementing this in the compiler based on the spec, omissions and bugs in the spec are likely to be found, if any.)

There is already an implementation at scala/scala3#18262, which indeed, discovered omissions in the early versions of the spec. The latest spec is up-to-date with that implementation, and conversely. The implementation passes the (small) community build, but we have not tried it yet on the open community build.

bjornregnell · 2023-10-17T11:58:43Z

The implementation passes the (small) community build, but we have not tried it yet on the open community build.

OK great. Perhaps each "If" can be formulated as a test case in the code base?

bjornregnell · 2023-10-17T12:12:15Z

I looked at scala/scala3#18262 and indeed there are tests, but I could not trace them to the rules in the spec - would that kind of traceability be helpful? @sjrd

bjornregnell · 2023-10-17T12:13:00Z

Perhaps the rules in the spec should be numbered or named?

soronpo · 2023-11-17T15:14:19Z

The committee voted today and accepted this SIP to move into the implementation stage.
A few comments:

It would be great to have a "running proof" disjointness example against a basic Scala match type example. One possible candidate is Match type regression scala3#18448
To move it to official from experimental, it's important to at least go through a green community build.

sjrd force-pushed the match-types-spec branch 2 times, most recently from e0266f2 to 78254ae Compare August 11, 2023 09:37

Add SIP-56: Proper Specification for Match Types.

4ae0ee0

sjrd force-pushed the match-types-spec branch from 78254ae to 4ae0ee0 Compare August 11, 2023 09:43

SethTisue reviewed Aug 11, 2023

View reviewed changes

sjrd mentioned this pull request Aug 14, 2023

Make ext.EnumValue[E] compatible with SIP-56 match types. json4s/json4s#1347

Merged

anatoliykmetyuk added stage:design status:under-review labels Aug 14, 2023

anatoliykmetyuk assigned bjornregnell, soronpo and Kordyjan Aug 14, 2023

sjrd added 2 commits August 14, 2023 16:11

Address most of Seth's comments.

cb6b892

Allow class members to be matched by type member extractors.

afbd365

This enables a workaround for the `scala.Enumeration#Value` extractor use case, found notably in json4s.

sjrd mentioned this pull request Aug 14, 2023

SIP-56: Better foundations for match types scala/scala3#18262

Merged

bjornregnell reviewed Aug 14, 2023

View reviewed changes

bishabosha reviewed Aug 15, 2023

View reviewed changes

content/match-types-spec.md Show resolved Hide resolved

sjrd force-pushed the match-types-spec branch 2 times, most recently from d53fa51 to 88bd807 Compare August 23, 2023 13:22

Add proposed specification of provably disjoint.

17fe545

sjrd force-pushed the match-types-spec branch from 88bd807 to 17fe545 Compare August 23, 2023 13:59

sjrd added 2 commits August 30, 2023 14:04

Include type lambdas in the simple types for provablyDisjoint.

cc9c90a

I discovered that it is necessary for some existing, reasonable use cases of `provablyDisjoint`.

Add the complete reduction algorithm.

3c6b360

Tweaks to the match types proposal.

580d30b

dwijnand reviewed Sep 4, 2023

View reviewed changes

content/match-types-spec.md Show resolved Hide resolved

dwijnand reviewed Sep 4, 2023

View reviewed changes

content/match-types-spec.md Show resolved Hide resolved

dwijnand reviewed Sep 4, 2023

View reviewed changes

anatoliykmetyuk added the manager:soronpo label Sep 22, 2023

anatoliykmetyuk added status:vote-requested and removed status:under-review labels Nov 15, 2023

soronpo added stage:implementation and removed stage:design status:vote-requested labels Nov 17, 2023

sjrd mentioned this pull request Nov 27, 2023

Make the source compile with SIP-56 match types, for Scala 3.4. iheartradio/ficus#286

Open

anatoliykmetyuk added the status:under-review label Dec 11, 2023

anatoliykmetyuk merged commit 3a7fef4 into scala:main Dec 11, 2023

sjrd deleted the match-types-spec branch December 11, 2023 09:24

anatoliykmetyuk added manager:soronpo and removed manager:soronpo labels Dec 11, 2023

anatoliykmetyuk added stage:completed status:shipped and removed status:under-review stage:implementation labels Jan 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SIP-56 - Proper Specification for Match Types. #65

SIP-56 - Proper Specification for Match Types. #65

sjrd commented Aug 11, 2023 •

edited by anatoliykmetyuk

Loading

SethTisue Aug 11, 2023 •

edited

Loading

sjrd Aug 15, 2023

soronpo commented Aug 12, 2023

anatoliykmetyuk commented Aug 14, 2023

bjornregnell commented Aug 14, 2023

sjrd commented Aug 14, 2023

bjornregnell Aug 14, 2023

sjrd Aug 15, 2023

bjornregnell Aug 15, 2023

anatoliykmetyuk commented Aug 15, 2023

sjrd commented Aug 23, 2023

sjrd commented Aug 30, 2023

dwijnand Sep 4, 2023

sjrd commented Sep 4, 2023 •

edited

Loading

dwijnand commented Sep 4, 2023

sjrd commented Sep 4, 2023

soronpo commented Oct 17, 2023

bjornregnell commented Oct 17, 2023 •

edited

Loading

bjornregnell commented Oct 17, 2023

sjrd commented Oct 17, 2023

bjornregnell commented Oct 17, 2023

bjornregnell commented Oct 17, 2023

bjornregnell commented Oct 17, 2023 •

edited

Loading

soronpo commented Nov 17, 2023 •

edited

Loading

SIP-56 - Proper Specification for Match Types. #65

SIP-56 - Proper Specification for Match Types. #65

Conversation

sjrd commented Aug 11, 2023 • edited by anatoliykmetyuk Loading

SethTisue Aug 11, 2023 • edited Loading

Choose a reason for hiding this comment

sjrd Aug 15, 2023

Choose a reason for hiding this comment

soronpo commented Aug 12, 2023

anatoliykmetyuk commented Aug 14, 2023

bjornregnell commented Aug 14, 2023

sjrd commented Aug 14, 2023

bjornregnell Aug 14, 2023

Choose a reason for hiding this comment

sjrd Aug 15, 2023

Choose a reason for hiding this comment

bjornregnell Aug 15, 2023

Choose a reason for hiding this comment

anatoliykmetyuk commented Aug 15, 2023

sjrd commented Aug 23, 2023

sjrd commented Aug 30, 2023

dwijnand Sep 4, 2023

Choose a reason for hiding this comment

sjrd commented Sep 4, 2023 • edited Loading

dwijnand commented Sep 4, 2023

sjrd commented Sep 4, 2023

soronpo commented Oct 17, 2023

bjornregnell commented Oct 17, 2023 • edited Loading

bjornregnell commented Oct 17, 2023

sjrd commented Oct 17, 2023

bjornregnell commented Oct 17, 2023

bjornregnell commented Oct 17, 2023

bjornregnell commented Oct 17, 2023 • edited Loading

soronpo commented Nov 17, 2023 • edited Loading

sjrd commented Aug 11, 2023 •

edited by anatoliykmetyuk

Loading

SethTisue Aug 11, 2023 •

edited

Loading

sjrd commented Sep 4, 2023 •

edited

Loading

bjornregnell commented Oct 17, 2023 •

edited

Loading

bjornregnell commented Oct 17, 2023 •

edited

Loading

soronpo commented Nov 17, 2023 •

edited

Loading