Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update FAQ on initialization order #3017

Open
wants to merge 1 commit into
base: main
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
148 changes: 72 additions & 76 deletions _overviews/FAQ/initialization-order.md
Original file line number Diff line number Diff line change
Expand Up @@ -7,7 +7,7 @@ permalink: /tutorials/FAQ/:title.html

## Example

To understand the problem, let's pick the following concrete example.
The following example illustrates the problem:

abstract class A {
val x1: String
Expand All @@ -26,7 +26,7 @@ To understand the problem, let's pick the following concrete example.
println("C: " + x1 + ", " + x2)
}

Let's observe the initialization order through the Scala REPL:
In the Scala REPL we observe:

scala> new C
A: null, null
Expand All @@ -36,129 +36,125 @@ Let's observe the initialization order through the Scala REPL:
Only when we get to the constructor of `C` are both `x1` and `x2` initialized. Therefore, constructors of `A` and `B` risk running into `NullPointerException`s.

## Explanation
A 'strict' or 'eager' val is one which is not marked lazy.

In the absence of "early definitions" (see below), initialization of strict vals is done in the following order.
A "strict" or "eager" val is one which is not marked lazy.
Initialization of strict vals is done in the following order:

1. Superclasses are fully initialized before subclasses.
2. Otherwise, in declaration order.

Naturally when a val is overridden, it is not initialized more than once. So though x2 in the above example is seemingly defined at every point, this is not the case: an overridden val will appear to be null during the construction of superclasses, as will an abstract val.
When a `val` is overridden, in fact its accessor method (the "getter") is overridden.
So the access to `x2` in class `A` in fact invokes the overridden getter in class `C` which reads the underlying field `C.x2`.
This field is not yet initialized during the construction of `A`.

There is a compiler flag which can be useful for identifying this situation:
## Mitigation

**-Xcheckinit**: Add runtime check to field accessors.
The [`-Ysafe-init` compiler flag](https://docs.scala-lang.org/scala3/reference/other-new-features/safe-initialization.html) in Scala 3 enables compiler warnings for accesses to uninitialized fields:
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

suggest mentioning here that the flag is experimental, like all -Y flags

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That's like saying use it, but don't like use it. How about instead of experimental, we start saying "cutting-edge". Also, experimental has the narrow meaning for dotty that you need a flag and an experimental compiler to use the feature.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it will also be -Wsafe-init from 3.5


It is inadvisable to use this flag outside of testing. It adds significantly to the code size by putting a wrapper around all potentially uninitialized field accesses: the wrapper will throw an exception rather than allow a null (or 0/false in the case of primitive types) to silently appear. Note also that this adds a *runtime* check: it can only tell you anything about code paths which you exercise with it in place.
-- Warning: Test.scala:8:6 ------------------
8 | val x1: String = "hello"
| ^
| Access non-initialized value x1. Calling trace:
| ├── class B extends A { [ Test.scala:7 ]
| │ ^
| ├── abstract class A { [ Test.scala:1 ]
| │ ^
| └── println("A: " + x1 + ", " + x2) [ Test.scala:5 ]
| ^^

Using it on the opening example:
In Scala 2, the `-Xcheckinit` flag adds runtime checks in the generated bytecode to identify accesses of uninitialized fields.
The code then throws an exception rather than allowing a `null` (or `0` / `false` in the case of primitive types) to silently appear.
Note that these runtime checks only test code that is actually exectued at runtime.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
Note that these runtime checks only test code that is actually exectued at runtime.
Note that these runtime checks only test code that is actually executed at runtime.

The flag can be helpful to find accesses to uninitialized fields, but it should never be used in production due to its performance overhead.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
The flag can be helpful to find accesses to uninitialized fields, but it should never be used in production due to its performance overhead.
The flag can be helpful to find accesses to uninitialized fields, but it should not be used in production due to its performance overhead.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"must never", and spoken with Germanic authority. People run debug in production all the time just in case there's a crash, which there always is.


% scalac -Xcheckinit a.scala
% scala -e 'new C'
scala.UninitializedFieldError: Uninitialized field: a.scala: 13
at C.x2(a.scala:13)
at A.<init>(a.scala:5)
at B.<init>(a.scala:7)
at C.<init>(a.scala:12)

### Solutions ###
## Solutions

Approaches for avoiding null values include:

#### Use lazy vals ####

abstract class A {
val x1: String
lazy val x2: String = "mom"
### Use class / trait parameters

abstract class A(val x1: String, val x2: String = "mom") {
println("A: " + x1 + ", " + x2)
}
class B extends A {
lazy val x1: String = "hello"

class B(x1: String = "hello", x2: String = "mom") extends A(x1, x2) {
println("B: " + x1 + ", " + x2)
}
class C extends B {
override lazy val x2: String = "dad"

class C(x2: String = "dad") extends B(x2 = x2) {
println("C: " + x1 + ", " + x2)
}
// scala> new C
// A: hello, dad
// B: hello, dad
// C: hello, dad

Usually the best answer. Unfortunately you cannot declare an abstract lazy val. If that is what you're after, your options include:
Values passed as parameters to the superclass constructor are available in its body.

1. Declare an abstract strict val, and hope subclasses will implement it as a lazy val or with an early definition. If they do not, it will appear to be uninitialized at some points during construction.
2. Declare an abstract def, and hope subclasses will implement it as a lazy val. If they do not, it will be re-evaluated on every access.
3. Declare a concrete lazy val which throws an exception, and hope subclasses override it. If they do not, it will... throw an exception.
Scala 3 also [supports trait parameters](https://docs.scala-lang.org/scala3/reference/other-new-features/trait-parameters.html).

An exception during initialization of a lazy val will cause the right-hand side to be re-evaluated on the next access: see SLS 5.2.
Note that overriding a `val` class parameter is deprecated / disallowed in Scala 3.
Doing so in Scala 2 can lead to surprising behavior.

Note that using multiple lazy vals creates a new risk: cycles among lazy vals can result in a stack overflow on first access.
### Use lazy vals

#### Use early definitions ####
abstract class A {
val x1: String
val x2: String = "mom"
lazy val x1: String
lazy val x2: String = "mom"

println("A: " + x1 + ", " + x2)
}
class B extends {
val x1: String = "hello"
} with A {
class B extends A {
lazy val x1: String = "hello"

println("B: " + x1 + ", " + x2)
}
class C extends {
override val x2: String = "dad"
} with B {
class C extends B {
override lazy val x2: String = "dad"

println("C: " + x1 + ", " + x2)
}
// scala> new C
// A: hello, dad
// B: hello, dad
// C: hello, dad

Early definitions are a bit unwieldy, there are limitations as to what can appear and what can be referenced in an early definitions block, and they don't compose as well as lazy vals: but if a lazy val is undesirable, they present another option. They are specified in SLS 5.1.6.
Note that abstract `lazy val`s are supported in Scala 3, but not in Scala 2.
In Scala 2, you can define an abstract `val` or `def` instead.

Note that early definitions are deprecated in Scala 2.13; they will be replaced by trait parameters in Scala 3. So, early definitions are not recommended for use if future compatibility is a concern.
An exception during initialization of a lazy val will cause the right-hand side to be re-evaluated on the next access: see SLS 5.2.

#### Use constant value definitions ####
abstract class A {
val x1: String
val x2: String = "mom"
Note that using multiple lazy vals creates a new risk: cycles among lazy vals can result in a stack overflow on first access.
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Or dead lock in Scala 3 for thread-safe lazy vals.


println("A: " + x1 + ", " + x2)
}
class B extends A {
val x1: String = "hello"
final val x3 = "goodbye"
### Use a nested object
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Isn't that basically the same solution as a lazy val?


println("B: " + x1 + ", " + x2)
}
class C extends B {
override val x2: String = "dad"
Sometimes, uninitialized state in a subclass is accessed during construction of a superclass:

println("C: " + x1 + ", " + x2)
class Adder {
var sum = 0
def add(x: Int): Unit = sum += x
add(1)
}
abstract class D {
val c: C
val x3 = c.x3 // no exceptions!
println("D: " + c + " but " + x3)
class LogAdder extends Adder {
private var added: Set[Int] = Set.empty
override def add(x: Int): Unit = { added += x; super.add(x) }
}
class E extends D {
val c = new C
println(s"E: ${c.x1}, ${c.x2}, and $x3...")

In this case the state can be initialized on demand by wrapping it into a local object:

class Adder {
var sum = 0
def add(x: Int): Unit = sum += x
add(1)
}
//scala> new E
//D: null but goodbye
//A: null, null
//B: hello, null
//C: hello, dad
//E: hello, dad, and goodbye...
class LogAdder extends Adder {
private object state {
var added: Set[Int] = Set.empty
}
import state._
override def add(x: Int): Unit = { added += x; super.add(x) }
}

### Early definitions: deprecated

Sometimes all you need from an interface is a compile-time constant.
Scala 2 supports early definitinos, but they are deprecated in Scala 2.13 and unsupported in Scala 3.
See the [migration guide](https://docs.scala-lang.org/scala3/guides/migration/incompat-dropped-features.html#early-initializer) for more information.

Constant values are stricter than strict and earlier than early definitions and have even more limitations,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Omitting compile-time mechanisms to work around runtime issues seems important. Probably more is available in Scala 3 besides final vals.

Let's also respect that this was the original one-question FAQ. Maybe a minute of silence?

as they must be constants. They are specified in SLS 4.1.