[slimtensor] Introduce Device and ScalarType headers for SlimTensor minimal support #5590
cuda.yml
on: pull_request
Matrix: export-model-cuda-artifact
Matrix: test-cuda-builds
unittest-cuda
/
linux-job
25m 1s
Matrix: test-models-cuda
Matrix: test-model-cuda-e2e
check-all-cuda-builds
3s
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
google-gemma-3-4b-it-cuda-non-quantized
Expired
|
7.22 GB |
sha256:830fbf22b01f3adb55c088adf198ddf5c004b5f0892e7cf1a46332694c6c851a
|
|
|
google-gemma-3-4b-it-cuda-quantized-int4-tile-packed
Expired
|
3.36 GB |
sha256:4bfb5a8135c75a7de06856dfd5c7c98f882d900936fb48930ec8b71b198d19f3
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-non-quantized
Expired
|
6.82 GB |
sha256:4321a14075f5139e90e4a891ab681ff0ce2fa68eba32381e732ced293d97d788
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-tile-packed
Expired
|
2.8 GB |
sha256:d9fb79cce7e327d1a1fd3ed442d6c468227ba7c3762a93858d13b2d17f0bdc23
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-quantized-int4-weight-only
Expired
|
6.14 GB |
sha256:d07c055cf2c0dfe0d50afa8a6b13dc51830ef2c5847e7a248518412c7e6b645c
|
|
|
openai-whisper-large-v3-turbo-cuda-non-quantized
Expired
|
1.18 GB |
sha256:db52918a1734dfb4691ee891be3df510c0e57ce431a0f10bad185773d16dd58d
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-tile-packed
Expired
|
491 MB |
sha256:39f70ff334610cbca440a8a41fc0eb32e8b1a5e1b720e1db632b50443c7735e1
|
|
|
openai-whisper-large-v3-turbo-cuda-quantized-int4-weight-only
Expired
|
485 MB |
sha256:b44025998371d2c7f935e29739b3c974160f9d9346ee950f42bce18119ea86e6
|
|
|
openai-whisper-small-cuda-non-quantized
Expired
|
361 MB |
sha256:0a58ff134510cdd63b5b3f0907dfe604c2967374961c3bf1612568d8ef3066f0
|
|
|
openai-whisper-small-cuda-quantized-int4-tile-packed
Expired
|
172 MB |
sha256:694cdc17da4ddb0d19de9b324f5da3ee721a962d72c53762bed26b40c19a2581
|
|
|
openai-whisper-small-cuda-quantized-int4-weight-only
Expired
|
270 MB |
sha256:25a15587b7ffe4b01cdc4a2e077e04f883592c36fece1870ddc036f55358df7d
|
|