Separation between Felt, Bool, and Uint types #423

Soulthym · 2025-07-18T10:06:58Z

This PR aims to solve #334
It adds proper type annotations, checking, and inference

Mainly, it splits Scalar types values into 4 possibilities:

_: an Untyped (scalar) type hole, used to represent a ScalarType that is not yet known or defined
uint: a constant unsigned integer, aimed to be used only as an index and type for RangeExpression
felt: a single field element
bool: a boolean value formed from a felt value

Aggregate types have a new syntax:

ScalarType[len]: a Vector containing len scalars of type ScalarType, where len is a positive integer:
- _[len]: a Vector of len scalar type holes
- uint[len]: a Vector of len unsigned integers
- felt[len]: a Vector of len field elements
- bool[len]: a Vector of len boolean values
ScalarType[rows, cols]: a Matrix containing rows rows and cols columns of type ScalarType, where rows and cols are positive integers:
- _[rows, cols]: a Matrix of rows rows and cols columns of scalar type holes
- uint[rows, cols]: a Matrix of rows rows and cols columns of unsigned integers
- felt[rows, cols]: a Matrix of rows rows and cols columns of field elements
- bool[rows, cols]: a Matrix of rows rows and cols columns of boolean values

I am unsure of which syntax to use for Matrices. I have re-used the one used in the Display implementation.
An alternative would be: SCALAR_TYPE[[cols], rows], which has the advantage of matching our notation for Tables (Matrices of unknown row length).
Both are possible and not an issue to change.

It also adds optional type annotations in the parser for:

const expressions: const x: TYPE = EXPR, where TYPE is one of the above types.
let expressions: let x: TYPE = EXPR;, where TYPE is one of the above types.
functions: fn func(x: TYPE) -> TYPE, where TYPE is one of the above types.
evaluator: ev evaluator(SCALAR_TYPE[a, b]), where SCALAR_TYPE is one of the above ScalarTypes and a, b are column labels

Most of individual pieces are implemented, but rely on a few fixes and improvements to work properly together. Those will be merged as soonn as the underlying issues are resolved.
I've marked those as [WIP] below.

Check-list:

bobbinth · 2025-07-21T05:24:58Z

const expressions: const x: TYPE = EXPR, where TYPE is one of the above types.

Would we be able to verify at compile time that const expressions match the declared type? I'm mainly thinking of declaring a constant to be a bool type.

let expressions: let x: TYPE = EXPR;, where TYPE is one of the above types.

Similar question to the one above, but also, if we are able to verify the type of EXPR, I'm assuming that we can infer it as well. In such a case, do we really need this syntax?

Soulthym · 2025-07-21T11:27:27Z

const expressions: const x: TYPE = EXPR, where TYPE is one of the above types.

Would we be able to verify at compile time that const expressions match the declared type? I'm mainly thinking of declaring a constant to be a bool type.

Yes it should be able to verify annotations, both on const and let expressions.

let expressions: let x: TYPE = EXPR;, where TYPE is one of the above types.

Similar question to the one above, but also, if we are able to verify the type of EXPR, I'm assuming that we can infer it as well. In such a case, do we really need this syntax?

I do agree in principle that we don't really need it.
However, my thinking was that having those annotations provided more benefits than not having them, mainly:

they make testing the inference logic a lot easier, where the tests can be self contained within a .air script. There's no need to explicitely test every type in its ast representation, only some cases to make sure inference code paths get triggered, the rest can be done "in-line" in test scripts.
they allow a greater expressivity of what should be a valid type, and hence more guardrails a module could put in place against miss-using it
they provide an escape hatch¹, should the inference be wrong (or may be too restrictive/permissive) but still reach production. Code with added annotations should remain compatible with a future inference fix, without requiring modifications or new audits.

Note about the escape hatch: you can already do it by declaring a function with your expression as body, and calling it inline where the expression used to be with its arguments that get checked against argument types. I find that to be somewhat of a hacky fix and prefer optional typing on let/const ↩

Soulthym · 2025-07-22T08:10:14Z

On second thought, I don't think inference can be done perfectly in all cases anyway without making the type system too complex.

Take the following example with 2 mutually exclusive flags a and b:

1: let a: bool = main[0];
2: let b: bool = main[1];
3: enf a^2 = a;
4: enf b^2 = b;
5: let c: TYPE = a + b;
6: enf c^2 = c;
7: let d: TYPE2 = a + b;

What should TYPE and TYPE2 be? They could be both felt and bool, depending on context: line 5 and 6 could come from a different module than line 7, after inlining. The types may not even be equal, where c: TYPE could be inferred to be a bool and d: TYPE2 would be a felt before CSE (which happens last, after type-inference).

I think ensuring inference is conservative when ambiguous + adding an explicit type cast via let annotations that respect subtyping rules under variance would let the user disambiguate in case both interpretations were valid.

Here that would mean infering TYPE = TYPE2 = felt (because (a: bool) + (b: bool) may be a felt iff a == 1 and b == 1), and let the user cast it down to a bool where appropriate, which is a subtype of felt under covariance, so the cast would be valid.

bobbinth · 2025-07-22T20:54:51Z

I think ensuring inference is conservative when ambiguous + adding an explicit type cast via let annotations that respect subtyping rules under variance would let the user disambiguate in case both interpretations were valid.

I think allowing the user to specify the type may lead to some confusion because the user my assume that specifying the type somehow enforces this type. If we do this, we should make sure to report very clear warnings that we don't actually know if what the user has specified is correct.

Take the following example with 2 mutually exclusive flags a and b:

1: let a: bool = main[0];
2: let b: bool = main[1];
3: enf a^2 = a;
4: enf b^2 = b;
5: let c: TYPE = a + b;
6: enf c^2 = c;
7: let d: TYPE2 = a + b;

A couple of comments here:

We should use a special is_bool(x) primitive to allow the user to "narrow" the type. Under the hood, this will be translated into enf x^2 = x constraint.
The type of a main trace column should always be felt and the user should not be able to change it without an extra constraint. So, for example let a: bool = main[0]; should not be valid.

So, I would re-write the above example as:

1: let a = main[0];
2: let b = main[1];
3: is_bool(a);
4: is_bool(b);
5: let c: TYPE = a + b;
6: is_bool(c);
7: let d: TYPE2 = a + b;

What should TYPE and TYPE2 be? They could be both felt and bool, depending on context: line 5 and 6 could come from a different module than line 7, after inlining. The types may not even be equal, where c: TYPE could be inferred to be a bool and d: TYPE2 would be a felt before CSE (which happens last, after type-inference).

In the above, TYPE would be bool (because we have is_bool(c) and TYPE2 would be felt (because nothing is constraining it).

Is the issue here that is_bool(c) comes after let c = a + b? If so, I wonder if we could modify the way is_bool() works specifically for type inference purposes. For example, could we write the above as:

1: let a = is_bool(main[0]);
2: let b = is_bool(main[1]);
3: let c = a + b;
4: let c = is_bool(c); // not sure if we allow variable shadowing, if not, we could use a different name
5: let d = a + b;

The constraints generated would be exactly the same as in the original example, but here, is_bool() is still the only way to narrow the type from felt to bool, but from inference standpoint, it acts kind of like a function that takes an expression and returns a boolean value.

Soulthym · 2025-07-23T09:51:26Z

Is the issue here that is_bool(c) comes after let c = a + b? If so, I wonder if we could modify the way is_bool() works specifically for type inference purposes.

The order wasn't the issue, the example was meant to show valid type conversions, and their consequence before and after CSE (mainly that d would be detected as equal to c after CSE, and hence could be casted both as a bool and felt, but not before CSE).

If we narrow it down to bool only in the presence of a is_bool(x), that should work aswell, and shouldn't require explicit casting via type annotations.

On the other points you've mentioned, I do agree with that design, and it shouldn't be too big of a change to incorporate in this PR.

bobbinth · 2025-07-23T20:14:16Z

The order wasn't the issue, the example was meant to show valid type conversions, and their consequence before and after CSE (mainly that d would be detected as equal to c after CSE, and hence could be casted both as a bool and felt, but not before CSE).

Makes sense. I think the can always do is_bool(d) and then (I'm hoping), common sub-expression elimination would be able to remove the duplicate constraints.

If we narrow it down to bool only in the presence of a is_bool(x), that should work aswell, and shouldn't require explicit casting via type annotations.

The only thing I'm wondering is whether it should be just is_bool(x) (as I had in my example) or enf is_bool(x). The latter would be more consistent, I think.

Soulthym · 2025-07-29T14:17:58Z

Hey @bobbinth!

I have rebased on next, done and merged the necessary tweaks we've discussed to the type system itself under the typing crate.

I am still merging in the integration into the existing codebase, based on next. That part should come fairly soon.

When implementing binary operations, I've noticed a few cases that, while technically correct, lose some information that could be used to refine the type system in a couple cases.

I opened a separate issue #432 to track it for now, since that shouldn't affect the soundness, while only adding precision in certain cases.

Do let me know if you'd like us to include it (or parts of it) in this PR!

…iden#432

bobbinth · 2025-07-30T05:07:41Z

I opened a separate issue #432 to track it for now, since that shouldn't affect the soundness, while only adding precision in certain cases.

Do let me know if you'd like us to include it (or parts of it) in this PR!

This PR is pretty big as is - so, I'd probably leave it for a follow-up (unless including it in this PR will simplify things somehow).

…Box + Vec<T>

Soulthym · 2025-08-01T15:36:13Z

bitwalker

Looks like there is still a number of stubbed out bits/TODOs, but I'm approving as-is for now, contingent on my couple suggestions around the typing crate rename and how we manage dependencies within the workspace.

typing/Cargo.toml

parser/Cargo.toml

Cargo.toml

Soulthym · 2025-08-26T13:16:37Z

Hey @bitwalker, thanks for the review!

I've implemented your suggestions in 9419d39, and renamed the directory accordingly to improve clarity.

Soulthym · 2025-09-01T13:31:47Z

Hey!

We've been able to sync about how Separation between Felt, Bool, and Uint types (#423) and Allow support for computed indices (#444) interact, and we think the best way would be to:

merge the Allow support for computed indices PR in next. This one should be reviewed in priority, so that we can proceed with the following items.
merge next in Separation between Felt, Bool, and Uint types
Update Separation between Felt, Bool, and Uint types to add the relevants checks for computed indices types

In the meantime, I'm merging other missing features/integrations into Separation between Felt, Bool, and Uint types, so while I wont be blocked immediately waiting for Allow support for computed indices, I will be in a few days, once the remaining features are integrated.

adr1anh · 2025-09-02T06:18:03Z

Given the changes you are planning to make to this branch, are there portions that are reviewable now/are not expected to change much?

Soulthym · 2025-09-02T07:31:15Z

Given the changes you are planning to make to this branch, are there portions that are reviewable now/are not expected to change much?

The next commit should be just that: while it is retro-compatible at the moment, there are still overlaps before I integrate it fully on most crates (ast, sema, mir, air-types) I'll ping you as soon as it's at that stage!

…update

…update-thy

Soulthym · 2025-09-19T14:00:32Z

Hey @adr1anh

The tests are currently failing due to the missing type-checking pass (all errors seem to stem from an error due to some types being None), otherwise most changes are fully merged in.

From this point, not much should change besides the new pass, a few tests, and the assert_bool handling once Mir::Cast is merged in.

I've updated the task lists above, for a quick recap here is what's left:

Leftover tasks:

type inference + check for uint indices pass (wip)
cast primitive in Mir (wip)
test new features (wip)
documentation

Leo-Besancon mentioned this pull request Jul 23, 2025

Enforce selector rules for match statements #428

Open

Soulthym force-pushed the feat-numeric-types branch from 65d0047 to 3d22950 Compare July 29, 2025 13:38

Merge branch 'feat-typing' into feat-numeric-types

7f30ca7

Soulthym force-pushed the feat-numeric-types branch from 3d22950 to 7f30ca7 Compare July 29, 2025 13:46

Soulthym mentioned this pull request Jul 29, 2025

Refine type inference #432

Open

Soulthym added 2 commits July 29, 2025 17:03

docs(typing): add a NOTE for BinType::infer_bin_ty_sub, for issue 0xM…

474de8a

…iden#432

chores(typing): make format

1255bc9

Soulthym added 12 commits July 30, 2025 12:39

fix(typing): adapt to old parser/ast/types api

9f17e24

feat(typing): impl Typing for Span<T: Typing>

926d666

feat(typing): allow forwarding of idents in sty! macro

74af2b2

refactor(typing): default impl for Kind for Kind::Value Types

5a8af93

feat(typing): impl Typing for Vec<T>

bdec887

feat(typing): Aggregate Kind variant + rework Show + impl Typing for …

cc98758

…Box + Vec<T>

refactor(typing): rename int to uint

41a3ff4

feat(typing): add an optional span to the TypeError enum

4891f5a

fix(typing): properly rename Int to UInt

3ad4b90

feat(typing): impl Display for TypeError

3402d8f

refactor(typing): integrate into codebase

60fb953

chores: cargo clippy + fmt

e7fadd4

Soulthym added 2 commits August 5, 2025 10:15

fix(typing): fix access::Default on TraceBinding

ee8b4a7

fix(typing): fix TraceSegment::kind()

6fee06e

bobbinth requested review from bitwalker and bobbinth August 7, 2025 17:48

feat(typing): assert_bool primitive

b2c75b6

Soulthym force-pushed the feat-numeric-types branch from 95f7393 to b2c75b6 Compare August 8, 2025 10:06

fix(mir/translate): fix inserted enf expr

c8cfad3

adr1anh self-requested a review August 22, 2025 12:47

bitwalker approved these changes Aug 25, 2025

View reviewed changes

typing/Cargo.toml Outdated Show resolved Hide resolved

parser/Cargo.toml Outdated Show resolved Hide resolved

Cargo.toml Show resolved Hide resolved

refactor(typing): rename typing crate to air_types

9419d39

bobbinth mentioned this pull request Sep 7, 2025

Add support for more general boundary constraints #433

Open

Soulthym and others added 13 commits September 10, 2025 10:19

Merge remote-tracking branch 'upstream/next' into feat-numeric-types-…

16195e1

…update

feat(types): FunctionType::check_args_kinds

3210f36

feat: implement typing for MIR nodes

1ceaadc

refactor: rename none::None to stale::Stale

46f8065

fix: us correct op for infer_bin_ty_*

f68eb25

refactor(types): change *_mut api + support for RefCell, Ref, and RefMut

7efee31

refactor(types): update ast, expose Typ* traits through Link

e21c2d2

Merge branch 'feat-numeric-types-update-leo' into feat-numeric-types-…

3eaf4d5

…update-thy

feat(types): implement typing for the whole pipeline

d57ae6e

fix(types): fix Trace* Typing impl

fe751bb

fix(types): various bug fixes

3c18d05

fix(types): more bug fixes, remove unused debugging

49339d6

fix(types): fix Evaluator argument types

7d49a85

Leo-Besancon mentioned this pull request Sep 23, 2025

Allow support for computed indices #444

Merged

14 tasks

Soulthym added 2 commits September 24, 2025 12:32

feat(mir): Cast primitive + translate assert_bool

4020b8a

refactor(types): rename assert_bool -> as_bool

bd2df16

Separation between Felt, Bool, and Uint types #423

Are you sure you want to change the base?

Separation between Felt, Bool, and Uint types #423

Uh oh!

Conversation

Soulthym commented Jul 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bobbinth commented Jul 21, 2025

Uh oh!

Soulthym commented Jul 21, 2025

Footnotes

Uh oh!

Soulthym commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bobbinth commented Jul 22, 2025

Uh oh!

Soulthym commented Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bobbinth commented Jul 23, 2025

Uh oh!

Soulthym commented Jul 29, 2025

Uh oh!

bobbinth commented Jul 30, 2025

Uh oh!

Soulthym commented Aug 1, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

bitwalker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Soulthym commented Aug 26, 2025

Uh oh!

Soulthym commented Sep 1, 2025

Uh oh!

adr1anh commented Sep 2, 2025

Uh oh!

Soulthym commented Sep 2, 2025

Uh oh!

Soulthym commented Sep 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Soulthym commented Jul 18, 2025 •

edited

Loading

Soulthym commented Jul 22, 2025 •

edited

Loading

Soulthym commented Jul 23, 2025 •

edited

Loading

Soulthym commented Aug 1, 2025 •

edited

Loading

Soulthym commented Sep 19, 2025 •

edited

Loading