Skip to content

[ET-VK][testing] Add per-shader timing breakdown to benchmark output #8126

[ET-VK][testing] Add per-shader timing breakdown to benchmark output

[ET-VK][testing] Add per-shader timing breakdown to benchmark output #8126

Triggered via pull request February 5, 2026 23:28
Status Failure
Total duration 1h 19m 39s
Artifacts 14

cuda.yml

on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda  /  linux-job
24m 38s
unittest-cuda / linux-job
Matrix: test-models-cuda
Matrix: test-model-cuda-e2e
check-all-cuda-builds
4s
check-all-cuda-builds
Fit to window
Zoom out
Zoom in

Annotations

1 error

Artifacts

Produced during runtime
Name Size Digest
google-gemma-3-4b-it-cuda-non-quantized
7.22 GB
sha256:a1e4883982e6233719115655072c803a448ac2b94d9171a49b14883b861d9e1b
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
3.36 GB
sha256:df895e571bfe69ad35e6608bdfe8e0618018119f834cfc841ee359e53da736b8
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
6.82 GB
sha256:e87dbcfb7400ced78897c5823d597eb45f514190d4d10aa28922d5f2f27a9040
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
2.8 GB
sha256:41cf09a5678959889a3b89563c891c130f172153a4d7cd3b42b51497106dda8e
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
6.14 GB
sha256:7498b4b9e5d8be96f616685274fcbd5a279798fe75b6db8122b1339f7d9cd30e
nvidia-parakeet-tdt-cuda-non-quantized
952 MB
sha256:0e351f636f31650dd4a90ea07cca7cbe504acf3aecfa44b2228d77d5f2a8bbe3
nvidia-parakeet-tdt-cuda-quantized-int4-tile-packed
443 MB
sha256:c75f59bbf80175d04cacd7ced8513dcf413d36f8ad84d973d9da84de0148c71b
nvidia-parakeet-tdt-cuda-quantized-int4-weight-only
430 MB
sha256:341cb6c5801a698ad4dba936dca0ac4fac2ab9260815c978a673268e735f222e
openai-whisper-large-v3-turbo-cuda-non-quantized
1.18 GB
sha256:3daa2d425ac2fe5d8affc51b4c830144d631c23970a7338f7268477c4e024ff9
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
491 MB
sha256:e5d67ef029968dc84813c8b5b06a15cee4bf50dfc5c03f9bc645115e11e28b0c
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
485 MB
sha256:9f62f9c30c02c69c8e3a8e0e752ad94010548575de9c5861a29d134a7c35fd9c
openai-whisper-small-cuda-non-quantized
361 MB
sha256:3e042aab7cb13b6df6425d922f815c3cb969f8319b51baf8455811282d1d572b
openai-whisper-small-cuda-quantized-int4-tile-packed
172 MB
sha256:93e2945c45f47200e24ad72a0e9df7613076b372bbccc8315151236ecb187983
openai-whisper-small-cuda-quantized-int4-weight-only
270 MB
sha256:0d58d58093048c0a0805181ef9cd77e477ab5a9075ab0ea933a353b305f089f0