[WIP] Scala with Explicit Nulls #5747

abeln · 2019-01-18T23:24:26Z

This PR sketches how to change the Scala type system so that reference types are no longer implicitly nullable. Instead, nullability can be recovered via union types (e.g. nullable strings have type String|Null).

See the accompanying doc for a description of the design: https://gist.github.com/abeln/9f79774bac111d99b3ae2cb9016a33e6

The changes include

a new type hierarchy where Null <: Any directly, and of no other type
a type "translation" layer for Java interop, to expose the true nullability of types in Java fields/methods
a simple form of flow-sensitive type inference to make it easier to work with nullable values

Unfortunately, the changes end up touching many components within the compiler (parser, typer, implicits, etc.), since assumptions about Null are baked in in many places. However, the code that
does the actually-interesting stuff is not much; most of the modified files are fixed tests.

There are still a few (very) important TODOs:

fix all the 37 remaining failing tests (listed here https://pastebin.com/k5qbsrGn)
port the Scala standard library to the new type system
bootstrap Dotty with the new type system

However, before embarking on 2. and 3. we wanted to get feedback from the Dotty team and the community on how things are looking.

Any and all feedback is greatly appreciated!

This is the initial change disconnecting Null from the bottom of the type hierarchy. Modified multiple places in the compiler where the notion that Null is a subtype of any reference type is hardcoded. At this point, there are many failing tests and the compiler can no longer bootstrap.

The two replaced cases are: * ascriptions: (null: String) => (??? : String) * vals/vars: val x: String = null => val x: String = ???

The JLS (https://docs.oracle.com/javase/specs/jls/se7/html/jls-14.html#jls-14.18) explains that "throw" should accept a null argument (in which case it throws an NPE): ``` If evaluation of the Expression completes normally, producing a null value, then an instance V' of class NullPointerException is created and thrown instead of null. The throw statement then completes abruptly, the reason being a throw with value V'. ``` So changed the typer so the prototype allows for null.

This time we convert defs as well: def foo(x: String): String = null => def foo(x: String): String = ???

Commit cc64374 incorrectly updated `isBottomClass` in the bakend interface. Revert the change because in the backend types are nullable.

This first version of the transform adds "|Null" to field types and method argument and return types of Java classes. e.g. class C { String foo(String x); } becomes class C { String|Null foo(String|Null x); } Type parameters also get nullified (e.g. "ArrayList[T] => ArrayList[T|Null]").

JavaNull is defined as `type JavaNull = Null @JavaNullAnnot` On selections from an expression of type `T | JavaNull`, we select as if we were selecting from T. This is intended to make Java interop more user-friendly, because null values coming from Java are typed as `T | JavaNull`. Of course, this means selections on Java-retured values can fail with NPEs.

When the compiler encounters a method with a varargs argument, the type of the varargs is initially represnted as an Array[T]. Later, it is transformed into a RepeatedParamType[T]. However, the nullability transform makes it so that the varargs has type `Array[T|JavaNull]|JavaNull`. We need to teach `arrayToRepeated` how to handle that case so we can get `RepeatedParamType[T|JavaNull]|JavaNull` as the result.

Instead of changing typedSelect, add the special case for JavaNull in Types#findMember. Additionally, cleaned up the tests, which now pass without -Ychecks but fail with -Ychecks. Additionally, there's a problem where the compiler won't infer a union type: e.g. ``` val x = new ArrayList[String]() val r = x.get(0) ``` The compiler will infer `r: Object` and not `r: String|JavaNull`. Need to address separately.

Local unit tests now pass.

Tag as nullable TypeParamRefs, so we can handle polymorphic Java methods.

When an enum is read from Java code, the compiler synthesizes a bunch of classes/modules/fields for it. One of the synthesized entities is a class that extends java.lang.Enum and calls its constructor. The first argument of the constructor is a string, so we were passing null which failed. Pass the empty string instead. This is ok because the synthesized Java code isn't run: it's just there for typechecking.

Before erasure, reference types are non-nullable, but after it they should be nullable again, because JVM types are nullable. This fixes tests/pos/i536 by changing the notion of a nullable type to take into consideration the current phase id. A similar thing is already done in TypeComparer in a different case: https://github.com/lampepfl/dotty/blob/master/compiler/src/dotty/tools/dotc/core/TypeComparer.scala#L676

@unchecked

… type `Nothing` When desugaring pattern matching code for expressions where the matched value has type `Null` or `Nothing`, we used to generate code that's type-incorrect. Example: ``` val Some(x) = null ``` got desugared into ``` val x: Nothing = matchResult1[Nothing]: { case val x1: Null @unchecked = null: Null @unchecked if x1.ne(null) then { case val x: Nothing = x1.value.asInstanceOf[Nothing] return[matchResult1] x: Nothing } else () return[matchResult1] throw new MatchError(x1) } ``` There were two problems here: 1) `x1.ne(null)` 2) `x1.value` In both cases, we're trying to invoke methods that don't exist for type `Nothing` (and #2 doesn't exist for `Null`). This commit changes the desugaring so we generate a no-op for unapply when the value matched has type `Nothing` or `Null`. This works because the code we used to generate is never executed (because the `x1.ne(null)`) check.

This adds a second TypeMap, specifically for constructors. For constructors, we nullify the argument types, but not the return type. Once we nullify the arguments of constructors, all case classes were breaking, because the logic to generate synthetic methods for case classes relies on finding the symbol for IndexOutOfBoundException, which changed with this CL. Patch up that logic as well.

Null <: Any makes for a cleaner type hierarchy. For example, we can now abstract over non-nullable types with `def foo[T <: AnyRef](x: T) = ...` However, both AnyRef and Null need to be comparable with reference equality, so we add a new trait RefEq ``` trait RefEq { def eq(that: RefEq): Boolean def ne(that: RefEq): Boolean } ``` and make both AnyRef and Null extend RefEq. RefEq is completely synthetic, and it gets erased to Object.

Go back to _not_ ignoring nullability during override checks. The original motivation was twofold: 1) make migration more easy (less type errors to fix in code that overrides java classes) 2) enable binary compatibility with pre and post nullability versions of a Scala library However, since our current approach to binary compatibility is to not do anything, point 2 is now moot. If and when we need to re-enable this in the future we can always do so. Until then, this eliminates a source of unsoundness.

In some cases during the null transform we see Java types of the form `A & B`, which weren't previously handled. Handle intersections by nf(A & B) = nf(A) & nf(B) | JavaNull (& binds stronger) but take care not to add JavaNull again while nullifying A and B.

dottybot

Hello, and thank you for opening this PR! 🎉

All contributors have signed the CLA, thank you! ❤️

Commit Messages

We want to keep history, but for that to actually be useful we have
some rules on how to format our commit messages (relevant xkcd).

Please stick to these guidelines for commit messages:

Separate subject from body with a blank line

When fixing an issue, start your commit message with Fix #<ISSUE-NBR>:

Limit the subject line to 72 characters

Capitalize the subject line

Do not end the subject line with a period

Use the imperative mood in the subject line ("Add" instead of "Added")

Wrap the body at 80 characters

Use the body to explain what and why vs. how

adapted from https://chris.beams.io/posts/git-commit

Have an awesome day! ☀️

smarter · 2019-01-19T17:09:09Z

Very cool! Before we get to reviewing this PR, it'd be helpful if it was cleaned up a bit, there's many commits like "fix tests" which are not really meaningful on their own. Ideally, commits in a PR should be atomic: serve a clear purpose detailed in their commit message, and pass all tests (we don't actually enforce that in the dotty repo currently). This PR also needs to be rebased.

Would it be possible to gate the invasive semantic changes of this PR behind a compiler flag ? This way we could merge it even if we're not sure if we'll accept it, and it'll allow more experimentation in the wild.

https://gist.github.com/abeln/9f79774bac111d99b3ae2cb9016a33e6 states that JavaNull is non-denotable. Could the documentation be expanded to justify this ? I'm personally very wary of inferring types that the users cannot write down or talk about, it makes it much harder for users to reason about their code.

odersky · 2019-01-19T17:46:08Z

@abeln I am excited about this PR! Its timing was a bit unfortunate, since we just merged a large change how positions get computed which affected many lines. So it will take some effort to rebase, I am afraid.

smarter · 2019-01-19T19:06:40Z

It looks like the behavior of this proposal is being discussed at https://contributors.scala-lang.org/t/wip-scala-with-explicit-nulls/2761 which is just as well since it means we can limit the comments on this PR to discussing the implementation.

abeln · 2019-01-22T00:20:02Z

@smarter I think we should be able to gate the changes behind a flag.

I don't know how to make the changes atomic, since the algorithm for developing the feature so far has been

change the type hierarchy
a million tests brake
add fixes to gradually fix the tests

Some test fixes involve changing just the tests, but others modify the compiler.
Even now there are a ~30 broken tests, one of which is the standard library.

The one way to have a less atomic change, but one that keeps the tests passing, would be to squash all the commits into a one.

For reviewing the PR, I think the best way is to go file-by-file, and not commit-by-commit. There's really not that much code to review within the compiler.

So on my end, I can

gate the feature behind a flag
update the PR as per the changes proposed in the Scala contributors thread
fix all remaining positive and run tests
rebase

How does that sound?

smarter · 2019-01-22T02:16:31Z

Sounds good. If having atomic commits require big commits that's fine too, as long as the commit message is detailed enough, think about someone doing git blame on your code in five years and what kind of things could help them understand what's going on :).

This improves flow sensitive inference so that it handles a bunch of previously-unsupported cases: * conditions inside blocks * inlined code * isInstanceOf checks * reference equality checks: eq and ne

abeln · 2019-01-22T22:39:43Z

Closing this for now while I rebase and fix the rest of the failing tests.

abgruszecki · 2019-01-29T12:49:19Z

@abeln - it might be useful for you to be aware of #4004. Current status quo is that isInstanceOf[Null] is permitted by Dotty, but should be rejected with an error (as is the case in Scalac). I saw suggestions on Scala contributors to equate x == null with x.isInstanceOf[Null] so in the case you go in that direction, please remember to update #4004 as well.

abeln added 30 commits January 15, 2019 16:53

Automatically replace "null" by "???" in positive tests

2a22e7a

The two replaced cases are: * ascriptions: (null: String) => (??? : String) * vals/vars: val x: String = null => val x: String = ???

Revert improperly modified test

3eb3874

Addititional automated conversion of null to ???

fbfbd4f

This time we convert defs as well: def foo(x: String): String = null => def foo(x: String): String = ???

Fix additional positive tests

8ebd3eb

Revert incorrect change to backend interface

73ffa5d

Commit cc64374 incorrectly updated `isBottomClass` in the bakend interface. Revert the change because in the backend types are nullable.

Refactor tests and add new source test

e4ee72c

Nullify applied types too

1217b9b

Use helper methods when typing Select trees

1f99138

Teach FirstTransform that |JavaNull is see-through

5701392

Local unit tests now pass.

Improve nullability transform

f6d116b

Tag as nullable TypeParamRefs, so we can handle polymorphic Java methods.

Moar easy test fixes

8ffaea3

Breakup test suite into smaller tests

9ee807c

Better check for whether we're before erasure

b048428

Add test for nullifying methods

9491c16

Don't nullify the inside of Class[] and the special TYPE field

270960c

Fix a couple of failing tests

2b4a7d6

Fix two more tests

9a86c34

Handle repeated parameters

9f4a6d4

Fix another test

c1df4c7

Don't nullify the return type of toString

02cefbc

abeln added 18 commits January 15, 2019 17:00

fix tests

42b49a4

fix tests

e4e66df

fix typo

f5fcaa0

Fix tests

246f70b

Fix array tests to use implicit conversions

320e38a

More robust handling of union types in JavaNull transform

5b01205

Add test exercising array conversions

fa9e5d8

Fixes for PR

c06bcc9

Add correctness proof to flow inference

830e97b

Fix typo

96021ef

PR fixes

6e75ea8

Add test

ee321cf

Move null erasure to better location

ab6934d

Comments and style fixes

fcbdf0f

More polish

ff2be5c

dottybot reviewed Jan 18, 2019

View reviewed changes

petrpan26 and others added 2 commits January 22, 2019 14:59

Modified test

8823c43

Expand flow sensitive inference

fc56d9e

This improves flow sensitive inference so that it handles a bunch of previously-unsupported cases: * conditions inside blocks * inlined code * isInstanceOf checks * reference equality checks: eq and ne

abeln closed this Jan 22, 2019

abgruszecki mentioned this pull request Jan 31, 2019

Allow pattern matching to constrain type members of function arguments #5828

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[WIP] Scala with Explicit Nulls #5747

[WIP] Scala with Explicit Nulls #5747

Uh oh!

abeln commented Jan 18, 2019 •

edited

Loading

Uh oh!

dottybot left a comment

Uh oh!

smarter commented Jan 19, 2019

Uh oh!

odersky commented Jan 19, 2019 •

edited

Loading

Uh oh!

smarter commented Jan 19, 2019 •

edited

Loading

Uh oh!

abeln commented Jan 22, 2019

Uh oh!

smarter commented Jan 22, 2019

Uh oh!

abeln commented Jan 22, 2019

Uh oh!

abgruszecki commented Jan 29, 2019

Uh oh!

Uh oh!

[WIP] Scala with Explicit Nulls #5747

[WIP] Scala with Explicit Nulls #5747

Uh oh!

Conversation

abeln commented Jan 18, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dottybot left a comment

Choose a reason for hiding this comment

Commit Messages

Uh oh!

smarter commented Jan 19, 2019

Uh oh!

odersky commented Jan 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

smarter commented Jan 19, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

abeln commented Jan 22, 2019

Uh oh!

smarter commented Jan 22, 2019

Uh oh!

abeln commented Jan 22, 2019

Uh oh!

abgruszecki commented Jan 29, 2019

Uh oh!

Uh oh!

abeln commented Jan 18, 2019 •

edited

Loading

odersky commented Jan 19, 2019 •

edited

Loading

smarter commented Jan 19, 2019 •

edited

Loading