[ET-VK][qconv] Add layout-flexible impl of quantized depthwise conv2d #1123
cuda-windows.yml
on: pull_request
Matrix: export-model-cuda-windows-artifact
Artifacts
Produced during runtime
| Name | Size | Digest | |
|---|---|---|---|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-non-quantized
|
6.82 GB |
sha256:62b239e7b4211ee4d3d68cd7f85efb9670856119331bd61e69be09ea42913c63
|
|
|
mistralai-Voxtral-Mini-3B-2507-cuda-windows-quantized-int4-weight-only
|
6.15 GB |
sha256:e27ae430ea77ace0837d79dc787e2b6c7c0c2052d1acff5c2df46c9ba7621d0b
|
|