Commit 68b61d9
committed
fix: correct output buffer sizing — use per-thread cap, not total
output_cap was n*64 (total entries) but the kernel indexes as
tid*output_cap, making the buffer n*n*64*5 uint32s (~128GB at n=10000).
Fixed to 64 per-thread entries = 12.8MB at n=10000.
Co-Authored-By: Claude Opus 4.6 <[email protected]>1 parent 24ea722 commit 68b61d9
1 file changed
+3
-5
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
431 | 431 | | |
432 | 432 | | |
433 | 433 | | |
434 | | - | |
| 434 | + | |
435 | 435 | | |
436 | | - | |
| 436 | + | |
437 | 437 | | |
438 | | - | |
439 | | - | |
440 | | - | |
| 438 | + | |
441 | 439 | | |
442 | 440 | | |
443 | 441 | | |
| |||
0 commit comments