Vectorised automatic differentiation by athas · Pull Request #2471 · diku-dk/futhark

athas · 2026-05-29T07:28:09Z

I have been sitting on this for a while, but maybe it's time to get it finished and merged. It adds facilities for vectorised AD, exposed as the following two functions:

-- | As `jvp`, but accepts a vector of seed values. Semantically
-- equivalent to mapping, but may be more efficient.
def jvp_vec 'a 'b [n] (f: a -> b) (x: a) (x': [n]a) : [n]b =
  ...

-- | As `vjp`, but accepts a vector of seed values. Semantically
-- equivalent to mapping, but may be more efficient.
def vjp_vec 'a 'b [n] (f: a -> b) (x: a) (y': [n]b) : [n]a =
  ...

The main advantage is that it allows amortisation of the primal computation over multiple tangent/adjoint computations. I forgot what state I left it in, but all tests work. In principle it is not so difficult to support, actually, but a core trick is that the transformation can always fall back to explicit looping.

athas · 2026-06-10T21:09:29Z

The last failing test is unrelated to AD, but must of course be fixed before this can be merged. I have not finished diagnosing or fixing the test, but it occurs for the "subhistogram" case of code generation for SegHist in the multicore backend - I believe it may be related to multi-versioning in the kernel body, but I'm not sure. It took a while to reproduce because that case is only hit with the right combination of thread count and input size.

athas · 2026-06-11T18:38:01Z

The program from this blog post does benefit from vectorized forward-mode AD (although reverse mode is still faster):

-- The function to approximate.
def f (x: f32) =
  if x == 0 then 0 else f32.exp (-1 / (x * x))

def poly_eval [d] (P: [d + 1]f32) (x: f32) =
  f32.sum (map2 (*) P (map (x **) (map f32.i64 (indices P))))

def N : i64 = 1000

def START : f32 = (-1)

def END : f32 = 1

def riemann_integral [d] (P: [d + 1]f32) =
  let step_size = (END - START) / f32.i64 N
  let g j =
    let x = START + f32.i64 j * step_size
    let delta = poly_eval P x - f x
    in delta * delta * step_size
  in f32.sum (tabulate N g)

def poly_init (d: i64) =
  tabulate (d + 1) (\i -> f32.i64 (i + 1) * (1 / (f32.i64 d + 1)))

entry fwd d =
  tabulate_2d (d + 1) (d + 1) (\i j -> f32.bool (j == i))
  |> map (jvp riemann_integral (poly_init d))

entry fwd_vec (gradlen: i64) d =
  let num_grads = (d + 11 + gradlen - 1) / gradlen
  let seeds gradstart =
    tabulate_2d gradlen
                (d + 1)
                (\i j -> f32.bool (j == i + gradstart))
  in map (seeds <-< (gradlen *)) (iota num_grads)
     |> map (#[unroll] jmp riemann_integral (poly_init d))
     |> flatten
     |> take d

entry rev d =
  vjp riemann_integral (poly_init d) 1

-- ==
-- entry: fwd rev
-- random input { 8i64 }
-- random input { 128i64 }

-- ==
-- entry: fwd_vec
-- random input { 1i64 8i64 }
-- random input { 1i64 128i64 }
-- random input { 2i64 128i64 }
-- random input { 4i64 128i64 }
-- random input { 8i64 128i64 }
-- random input { 16i64 128i64 }
-- random input { 32i64 128i64 }
-- random input { 64i64 128i64 }
-- random input { 128i64 128i64 }

This means that all I just need to decide on the best nomenclature for this feature, and then it is ready.

athas · 2026-06-12T12:10:01Z

I had a great idea, inspired by the Jax documentation: the surface-level functions should be mjp and jmp, for matrix-Jacobian-product and Jacobian-matrix-product, respectively. For that is exactly what it is! I still need a good term for the overall concept - currently I'm stuck on "vector AD".

athas added 30 commits December 5, 2024 20:50

Create frontend and IR for vectorised AD.

5777e79

Hook it up in internalisation, too.

a54fa53

Basic support for vectorised forward-mode AD.

7ecd907

Forgot to add tests.

a8e616e

Merge branch 'master' into ad-vec

91465c3

Scan test.

9b488e1

Add jvp_vec and vjp_vec.

9d15ec6

Merge branch 'master' into ad-vec

9a1eb4b

Merge branch 'master' into ad-vec

bc01f7d

This should not need modification.

e75f617

Change how accumulators are handled.

cbd98fb

Implement vectorised scatter.

8ae043a

Add map test.

6760758

More tests, some that fail.

240edfe

Tweak the tests.

644b8c2

Another test.

e9eac0a

Some hackyish fixes.

d1438bc

Merge branch 'master' into ad-vec

5a4128c

Fix a handful of things.

41e2738

More things work.

e8c0c14

Minor fixes.

f67a2b0

Fix vjp2_vec in interpreter.

0a10a5c

Start work on vectorised reverse mode AD.

b21f5f6

Support primitive functions properly.

ebfae42

Make unops and primfuns work.

1684541

Work on vectorised reductions.

4819c5c

Start on scan.

512c0ff

Merge branch 'master' into ad-vec

47386d2

More work.

bee9ae3

Some tests, some of which fail.

7d4f77d

athas added 4 commits June 10, 2026 10:39

More robust equality checking.

d1f4d31

Individual tests.

eeff1c7

Lower tolerance.

055da84

Rewrite this test to be less crazy.

4a94616

athas added 3 commits June 11, 2026 08:06

Don't need this.

2e40acd

No ISPC for this one.

27bd8a3

Merge branch 'master' into ad-vec

7d8c5d6

athas marked this pull request as ready for review June 11, 2026 18:38

athas added 2 commits June 12, 2026 12:48

Merge branch 'master' into ad-vec

3b7698f

Remove duplicate tests.

1d441f9

athas added 12 commits June 12, 2026 14:17

Refresh terminology.

0afeea9

Also update interpreter.

a187ad4

Add failing test.

a21e7ff

Fix typo in comment.

85ecf8a

Handle with_vjp in vector mode.

21260c5

Nomenclature fixes.

36ca87a

Fix markup.

4c0a484

Better reference.

eb74e14

Improve documentation.

349e5bc

Further elaboration.

788b443

Clarify.

04f3bb9

More.

94cb3ab

athas requested a review from zfnmxt June 13, 2026 07:01

athas added 4 commits June 13, 2026 09:55

Better naming.

3834719

More docs.

f1df3dc

Minor fices.

100de3d

Merge branch 'master' into ad-vec

9c46df2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Vectorised automatic differentiation#2471

Vectorised automatic differentiation#2471
athas wants to merge 99 commits into
masterfrom
ad-vec

athas commented May 29, 2026

Uh oh!

athas commented Jun 10, 2026

Uh oh!

athas commented Jun 11, 2026 •

edited

Loading

Uh oh!

athas commented Jun 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

athas commented May 29, 2026

Uh oh!

athas commented Jun 10, 2026

Uh oh!

athas commented Jun 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

athas commented Jun 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

athas commented Jun 11, 2026 •

edited

Loading

athas commented Jun 12, 2026 •

edited

Loading