Skip to content

Conversation

ggerganov
Copy link
Member

@ggerganov ggerganov commented Sep 23, 2025

  • Remove V100 runner and workflows
  • Allocate 2x T4 and 1x A10 runners for NVIDIA workflows
  • Disable AMD workflows until runner availability is resolved

@ggerganov ggerganov requested a review from CISC as a code owner September 23, 2025 14:12
@github-actions github-actions bot added the devops improvements to build systems and github actions label Sep 23, 2025
@ggerganov ggerganov merged commit f505bd8 into master Sep 23, 2025
92 of 100 checks passed
@ggerganov ggerganov deleted the gg/ci-new-nvidia-workflows branch September 23, 2025 17:41
gabe-l-hart added a commit to gabe-l-hart/llama.cpp that referenced this pull request Sep 23, 2025
* origin/master: (39 commits)
ci : disable AMD workflows + update NVIDIA workflows (ggml-org#16200)
ci : enable Vulkan workflow on Mac (ggml-org#16194)
ggml-cpu: Respect cpumask settings (ggml-org#16164)
ggml : fix uninitialized is_on_grid in quantize_row_iq3_xxs_impl (ggml-org#15928)
zdnn: refactor codebase + add docs (ggml-org#16178)
codeowners : add @danbev to model-conversion example [no ci] (ggml-org#16190)
devops: add s390x containers (ggml-org#15915)
ggml-cpu : fix typo in gemm comments [no ci] (ggml-org#16189)
feat: Add conversion support in GraniteHybrid for non-hybrid (all attn) (ggml-org#16177)
clang-tidy : disable warning about performance enum size (ggml-org#16127)
ggml : implement set_rows with i32 index (ggml-org#16159)
codeowners : update + cleanup (ggml-org#16174)
common : enable `--offline` mode without curl support (ggml-org#16137)
webui : fix handling incomplete chunks (ggml-org#16107)
embedding : fix typos in README (ggml-org#16171)
common : remove unused local variables (ggml-org#16140)
ggml : extend ggml_can_fuse to work with non-sequential nodes (ggml-org#16123)
ggml : add ggml_op_is_empty (ggml-org#16122)
codeowners : update ownership for @ngxson and @allozuar (ggml-org#16128)
Vulkan: add conv_transpose_2d operation (ggml-org#16022)
...
pwilkin pushed a commit to pwilkin/llama.cpp that referenced this pull request Sep 25, 2025
* ci : disable AMD workflows + update NVIDIA workflows

* cont : fixes

* cont : update nvidia vulkan workflows
struct pushed a commit to struct/llama.cpp that referenced this pull request Sep 26, 2025
* ci : disable AMD workflows + update NVIDIA workflows

* cont : fixes

* cont : update nvidia vulkan workflows
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
devops improvements to build systems and github actions
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant