Skip to content

gemma4_31b (MLX): don't allocate a second KV-cache copy for single-session runs #9305

gemma4_31b (MLX): don't allocate a second KV-cache copy for single-session runs

gemma4_31b (MLX): don't allocate a second KV-cache copy for single-session runs #9305

Triggered via pull request June 27, 2026 00:49
Status Success
Total duration 24m 37s
Artifacts 3

test-backend-xnnpack.yml

on: pull_request
Matrix: test-xnnpack / test-backend-linux
Matrix: test-xnnpack / test-backend-macos
Waiting for pending jobs
test-xnnpack  /  package-golden-artifacts
3m 45s
test-xnnpack / package-golden-artifacts
Fit to window
Zoom out
Zoom in

Annotations

3 warnings
test-xnnpack / test-backend-linux (xnnpack, operators) / linux-job
Node.js 20 is deprecated. The following actions target Node.js 20 but are being forced to run on Node.js 24: ./test-infra/.github/actions/setup-ssh, actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-xnnpack / test-backend-linux (xnnpack, models) / linux-job
Node.js 20 is deprecated. The following actions target Node.js 20 but are being forced to run on Node.js 24: ./test-infra/.github/actions/setup-ssh, actions/checkout@11bd71901bbe5b1630ceea73d27597364c9af683, actions/upload-artifact@ea165f8d65b6e75b540449e92b4886f43607fa02, pmeier/pytest-results-action@a2c1430e2bddadbad9f49a6f9b879f062c6b19b1. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/
test-xnnpack / package-golden-artifacts
Node.js 20 is deprecated. The following actions target Node.js 20 but are being forced to run on Node.js 24: actions/download-artifact@v4, actions/upload-artifact@v4, seemethere/upload-artifact-s3@v5. For more information see: https://github.blog/changelog/2025-09-19-deprecation-of-node-20-on-github-actions-runners/

Artifacts

Produced during runtime
Name Size Digest
golden-artifacts-xnnpack
1.57 GB
sha256:14c69cc5367f11eb367a98df6ae18581eb54d23293b4207be01f08f361751e7e
test-report-xnnpack-models
1.57 GB
sha256:b89ee9124dfad4ec9001055ce346a8e254c8ed0532beb41552824ba7d3e7668d
test-report-xnnpack-operators
8.24 MB
sha256:9bc7d3c277fae6e392cb65d52f93a637db20f01dc5ca27feeb5027e3272d8ac1