-
Notifications
You must be signed in to change notification settings - Fork 28
SIP-69: Existential containers #101
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Changes from all commits
1d78df7
1790303
3e6fc1b
4642b49
03c23a5
8b0807c
5ac6ae4
ef4829d
45f77d9
dacee46
85af31b
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,288 @@ | ||
--- | ||
layout: sip | ||
permalink: /sips/:title.html | ||
stage: design | ||
status: submitted | ||
presip-thread: n/a | ||
title: SIP-NN - Existential Containers | ||
--- | ||
|
||
**By: Dimi Racordon and Eugene Flesselle and Matt Bovel** | ||
|
||
## History | ||
|
||
| Date | Version | | ||
|---------------|--------------------| | ||
| Nov 25th 2024 | Initial Draft | | ||
|
||
## Summary | ||
|
||
Type classes have become a well-established feature in the Scala ecosystem to escape some of the shortcomings of subtyping with respect to extensibility. | ||
Unfortunately, type classes do not support run-time polymorphism and dynamic dispatch, two features typically taken for granted in Scala. | ||
|
||
This SIP proposes a minimal change to the language to support *existential containers*, which address this problem. | ||
An existential container wraps a value together with a witness of its conformance to one or several type classes into an object exposing the API defined by these type classes. | ||
|
||
## Motivation | ||
|
||
Type classes can address some of the well-known limitations of subtyping with respect to extensibility, such as the ability to extend existing data types with new behaviors [1]. | ||
A type class describes the interface of a generic _concept_ as a set of requirements, expressed in the form of operations and associated types. | ||
These requirements can be implemented for a specific type, thereby specifying how this type _models_ the concept. | ||
The following illustrates: | ||
|
||
```scala | ||
import shapes.{Square, Hexagon} | ||
|
||
trait TypeClass: | ||
type Self | ||
|
||
trait Polygon extends TypeClass: | ||
extension (self: Self) | ||
def area: Double | ||
|
||
given Square is Polygon: ... | ||
given Hexagon is Polygon: ... | ||
``` | ||
|
||
Defining `Polygon` as a type class rather than an abstract class to be inherited allows us to retroactively state that squares are polygons without modifying the definition of `Square`. | ||
Sticking to subtyping would require the definition of an inefficient and verbose wrapper class. | ||
|
||
Alas, type classes offer limited support for eliding type information at compile-time. | ||
Hence, it is difficult to manipulate heterogeneous collections or write procedures returning arbitrary values known to model a particular concept. | ||
The following illustrates: | ||
|
||
```scala | ||
def largest[T: Polygon](xs: Seq[T]): Option[T] = | ||
xs.maxByOption(_.area) | ||
|
||
largest(List(Square(), Hexagon())) | ||
// error: No given instance of type Polygon{type Self = Square | Hex} was found for a context parameter of method largest | ||
``` | ||
|
||
The call to `largest` is illegal because, although there exist witnesses of the `Polygon` and `Hexagon`'s conformance to `Polygon`, no such witness exists for their least common supertype. | ||
In other words, it is impossible to call `largest` with an heterogeneous sequence of polygons. | ||
|
||
## Proposed solution | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This section is under-specified. Reading it, I have no idea what the proposed solution is, even though I generally agree on the value of the feature being proposed We need several more use case examples, which are less contrived than Someone should be able to read the proposed solution have zero idea how it is implemented, and still get the general idea of what the proposed language feature is about. Right now, that is not the case, so this section of the proposal is incomplete There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I can write other examples but I disagree that the one in the document is contrived. It also generalizes very easily to any situation involving a set of classes conforming to the same type class. Any instance of a situation where the "upper bound" of a set of types is defined retroactively would look like the proposed example.
Instead of justifying existential containers, however, I propose to clarify that the SIP is about sugaring the selection of a method rather than proposing the containers themselves, as those can be defined in library space. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. You really need to justify both the language changes and library support. The language change is useless without the library support, but most people would not be familiar with this library-level technique.
|
||
|
||
The problems raised above can be worked around if, instead of using generic parameters with a context bound, we use pairs bundling each value with its conformance witness. | ||
In broad strokes, a solution generalizes the following possible implementation of `largest`: | ||
|
||
```scala | ||
trait AnyPolygon: | ||
type Value: Polygon as witness | ||
val value: Value | ||
|
||
def largest(xs: Seq[AnyPolygon]): Option[AnyPolygon] = | ||
xs.maxByOption((a) => a.witness.area(a.value)) | ||
``` | ||
|
||
The type `AnyPolygon` conceptually represents an arbitrary polygon. | ||
It consists of a pair containing some arbitrary value as well as a witness of that value's type being a polygon. | ||
We call this pair an _existential container_, as a nod to a similar feature in Swift. | ||
|
||
While the above example "hardcodes" the type class `Polygon`, Scala's type system is actually rich enough to safely define a generic abstraction represeting existential containers parameterized by a type class. | ||
A possible implementation is presented in the appendix but its details are actually irrelevant to for the proposed change. | ||
The purpose of this SIP is _only_ to support the selection of an existential container's `value` field implicitly. | ||
That way, one could simply write `xs.maxByOption(_.area)` in the above example, resulting in quite idiomatic scala. | ||
|
||
To illustrate further, assume the existence of an abstraction named `Containing[TC]` for representing containers pairing an arbitrary value with a witness of its conformance to some type class `TC`. | ||
The proposed change would let the compiler accept the following example: | ||
|
||
```scala | ||
trait Polygon extends TypeClass: | ||
extension (self: Self) def area: Double | ||
|
||
def largest(xs: Seq[Containing[Polygon]]): Option[Containing[Polygon]] = | ||
xs.maxByOption(_.area) | ||
``` | ||
|
||
This implementation of `largest` requires existential containers to take and return arbitrary polygons. | ||
Indeed, we wish to operate on a _heterogeneous_ list of polygons (i.e., types conforming to `Polygon`), not a list of a particular type happing to have an instance of the type class. | ||
On the return side, we wish to return any type known to be a polygon paired with the witness of its conformance. | ||
Again, doing so (conveniently) is not possible without existential containers. | ||
|
||
The above example generalizes to any occurrence of heterogeneous collection. | ||
For instance: | ||
|
||
```scala | ||
trait CustomHashable extends TypeClass: | ||
extension (self: Self) def hashInto(hasher: Hasher) | ||
|
||
def customHashValue(xs: List[Containing[CustomHashable]]): Int = | ||
val h = Hasher() | ||
for x <- xs do xs.hashInto(h) | ||
h.finalize() | ||
``` | ||
|
||
Returning a value paired with its witness generalizes similarly. | ||
|
||
```scala | ||
trait Sizeable extends TypeClass: | ||
extension (self: Self) def size: Int | ||
|
||
def shortest[A: Sizeable, B: Sizeable](a: A, b: B): Containing[Sizeable] = | ||
if b.size < a.size then Containing(a) else Containing(b) | ||
``` | ||
|
||
Further motivation for existential containers in Scala have been described in a research paper [2]. | ||
|
||
### Specification | ||
|
||
Assuming the existence of an abstraction named `Containing[TC]` for representing containers pairing an arbitrary value with a witness of its conformance to some type class `TC` in the standard library, the compiler injects the selection of the `value` field implicitly when a method of `Containing[TC]` is selected. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Since this is in the
Otherwise, this should be in a separate section (example or appendix) |
||
|
||
Illustrating with our running example: | ||
|
||
```scala | ||
// Version with subtyping: | ||
trait Polygon1: | ||
def area: Double | ||
def largest1(xs: Seq[Polygon1]): Option[Polygon1] = | ||
xs.maxByOption(_.area) | ||
|
||
// Version with existential containers: | ||
trait Polygon2 extends TypeClass: | ||
extension (self: Self) def area: Double | ||
def largest2(xs: Seq[Containing[Polygon2]]): Option[Containing[Polygon2]] = | ||
xs.maxByOption(_.area) // <- sugared form of `xs.maxByOption(_.value.area)` | ||
``` | ||
|
||
### Compatibility | ||
|
||
The change in the syntax does not affect any existing code and therefore this proposal has no impact on source compatibility. | ||
|
||
The semantics of the proposed feature is fully expressible in Scala. | ||
Save for the implicit addition of `.value` on method selection when the receiver is an instance of `Containing[C]`, this proposal requires no change in the language. | ||
As a result, it has no backward binary or TASTy compatibility consequences. | ||
|
||
### Feature interactions | ||
|
||
The proposed feature is meant to interact with implicit search, as currently implemented by the language. | ||
More specifically, given an existential container `c`, accessing `c.value` _opens_ the existential while retaining its type `c.Value`, effectively keeping an _anchor_ (i.e., the path to the scope of the witness) to the interface of the type class. | ||
|
||
Since no change in implicit resolution is needed, this proposal cannot create unforeseen negative interactions with existing features. | ||
|
||
### Other concerns | ||
|
||
This document has been written under the experimental modularity improvements for Scala 3. | ||
Although the proposed feature is fully expressible without those changes, the encoding of existential containers can only work with the "old" (i.e., the one currently used in production) or "new" type class style. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I had to read this sentence a few times to get the point so maybe it would be good to rephrase it somehow (it's not clear at first which expression |
||
|
||
### Open questions | ||
|
||
One problem not addressed by the proposed encoding is the support of multiple type classes to form the interface of a specific container. | ||
For example, one may desire to create a container of values whose types conform to both `Polygon` _and_ `Show`. | ||
We have explored possible encodings for such a feature but decided to remove them from this proposal, as support for multiple type classes can most likely be achieved without any additional language change. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Is the intent to answer this more fully before the proposal comes to a vote? Or do we expect it to remain an open question until some future iteration after the SIP lands? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I would propose that as a future proposal, if necessary. That said it is likely that the SIP committee would not have to formally weigh in this addition because it would only relate to the way existential containers are encoded in the library, without any other change to the language. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more.
you suggest that you can synthesize a single I guess the construction is the harder part - resolution would "just work" |
||
|
||
Another open question relates to possible language support for shortening the expression of a container type and/or value. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Ditto on whether this might get filled in soon, or it's definitely a "someday" thing? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. That's a someday thing but, again, these open questions can hopefully be answered without introducing more changes to the language. |
||
|
||
## Related work | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Apart from Swift, what other languages have a feature like this? Does Haskell have something similar? Or Rust? Both of those languages have typeclass-like features and presumably would encounter the same issues around e.g. hetoregenous collections. How do they solve it, or do they not? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The proposal mentions dynamic traits in Rust. Those work almost exactly like Swift but are slightly less expressive. The proposal also cite a paper documenting other related work from scientific literature. I'll add a note mentioning Haskell's |
||
|
||
Swift supports existential containers. | ||
For instance, `largest` can be written as follows in Swift: | ||
|
||
```swift | ||
func largest(_ xs: [any Polygon]) -> (any Polygon)? { | ||
xs.max { (a, b) in a.area < b.area } | ||
} | ||
``` | ||
|
||
Unlike in this proposal, existential containers in Swift are built-in and have a dedicated syntax (i.e., `any P`). | ||
One advantage of Swift's design is that the type system can treat an existential container as supertype of types conforming to that container's interface. | ||
For example, `any Polygon` is supertype of `Square` (assuming the latter conforms to `Polygon`): | ||
|
||
```swift | ||
print(largest([Square(), Hexagon()])) | ||
``` | ||
|
||
In contrast, to avoid possible undesirable complications, this proposal does not suggest any change to the subtyping relation of Scala. | ||
|
||
Rust also supports existential containers in a similar way, writing `dyn P` to denote a container bundling some value of a type conforming to `P`. | ||
Similar to Swift, existential containers in Rust are considered supertypes of the types conforming to their bound. | ||
|
||
Existential contains are also featured in Haskell, under the [`ExistentialQuantification`](https://wiki.haskell.org/Heterogenous_collections) extension. | ||
Unlike in Swift and Rust, packing and unpacking in and out of existential containers requires more boilerplate: | ||
|
||
```haskell | ||
{-# LANGUAGE ExistentialQuantification #-} | ||
|
||
class Polygon a where | ||
area :: a -> Double | ||
|
||
data AnyPolygon = forall a . Polygon a => MakePolygon a | ||
pack :: Polygon a => a -> AnyPolygon | ||
pack = MakePolygon | ||
|
||
instance Polygon Square where | ||
area s = 1.0 | ||
instance Polygon Hexagon where | ||
area h = 1.0 | ||
|
||
largest :: [AnyPolygon] -> Maybe AnyPolygon | ||
largest (x : xs) = case (largest xs) of | ||
Nothing -> Just x | ||
Just(y) -> Just (if (f x) < (f y) then y else x) | ||
where f (MakePolygon a) = area a | ||
largest [] = Nothing | ||
``` | ||
|
||
A more formal exploration of the state of the art as been documented in a research paper presented prior to this SIP [2]. | ||
|
||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This proposal is missing a section on
Why is the proposed solution better than the alternatives I listed here? What other alternatives should we be aware of, if any? |
||
## Alternatives considered | ||
|
||
### No change to the language | ||
|
||
As already mentioned, Scala's type system is currently strong enough to support the definition of existential containers. | ||
Hence, no change to the language is strictly necessary to support them. | ||
The gain offered by the proposal is that method selection on an existential container will look more familiar. | ||
|
||
### Converting to and from wrappers | ||
|
||
As mentioned in our motivations, the problem of creating a heterogeneous collections can be addressed by defining custom wrappers. | ||
One can further define implicit conversions to alleviate the syntactic burden. | ||
This approach is nonetheless strictly more verbose since it requires the definition of a specific container for each type class used in conjunction with heterogeneous collections. | ||
In that sense, existential containers can be understood as a generalized wrapper. | ||
|
||
## FAQ | ||
|
||
#### Is there any significant performance overhead in using existential containers? | ||
|
||
On micro benchmarks testing method dispatch specifcally, we have measured that dispatching through existential containers in Scala was about twice as slow as traditional virtual method dispatch, which is explained by the extra pointer indirection introduced by an existential container. | ||
This overhead drops below 10% on larger, more realistic benchmarks [2]. | ||
|
||
## References | ||
|
||
1. Stefan Wehr and Peter Thiemann. 2011. JavaGI: The Interaction of Type Classes with Interfaces and Inheritance. ACM Transactions on Programming Languages and Systems 33, 4 (2011), 12:1–12:83. https://doi.org/10.1145/1985342.1985343 | ||
2. Dimi Racordon and Eugene Flesselle and Matt Bovel. 2024. Existential Containers in Scala. ACM SIGPLAN International Conference on Managed Programming Languages and Runtimes, pp. 55-64. https://doi.org/10.1145/3679007.3685056 | ||
|
||
|
||
## Appendix | ||
|
||
The following is a possible implementation of existential containers. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Here's a version that doesn't rely on the experimental modularity language import (and doesn't require the typeclass to extend /** A value together with an evidence of its type conforming to some type class. */
sealed trait Containing[TC[_]]:
/** The type of the contained value. */
type Value: TC as witness
/** The contained value. */
val value: Value
object Containing:
/** Wraps a value of type `V` and a corresponding typeclass instance `TC[V]` into a `Containing[TC]`. */
def apply[TC[_]](v: Any)[V >: v.type](using TC[V]): Containing[TC] =
new Containing[TC]:
type Value >: V <: V
val value: Value = v |
||
|
||
```scala | ||
import language.experimental.{clauseInterleaving, modularity} | ||
|
||
/** A type class. */ | ||
trait TypeClass: | ||
type Self | ||
|
||
/** A value together with an evidence of its type conforming to some type class. */ | ||
sealed trait Containing[Concept <: TypeClass]: | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. does it only work with this upper bound or can it be |
||
|
||
/** The type of the contained value. */ | ||
type Value: Concept as witness | ||
|
||
/** The contained value. */ | ||
val value: Value | ||
|
||
object Containing: | ||
|
||
/** Wraps a value of type `V` into a `Containing[TC]` provided a witness that `V is TC`. */ | ||
def apply[TC <: TypeClass](v: Any)[V >: v.type](using V is TC) = | ||
new Containing[TC]: | ||
type Value >: V <: V | ||
val value: Value = v | ||
``` | ||
|
||
Given a type class `C`, an instance `Containing[TC]` is an existential container. | ||
The context bound on the definition of the `Value` member provides a witness of `Value`'s conformance to `TC` during implicit resolution when a method of the `value` field is selected. | ||
The companion object of `Containing` provides basic support to create containers ergonomically. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The SIP number is already known