-
Notifications
You must be signed in to change notification settings - Fork 13.6k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
batched-bench : add "separate text gen" mode
examples
#17103
opened Nov 8, 2025 by
ggerganov
Loading…
metal : retain src and dst buffers during async ops
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17101
opened Nov 8, 2025 by
ggerganov
Loading…
webui : add keyboard shortcut to toggle sidebar
examples
server
#17099
opened Nov 8, 2025 by
danbev
Loading…
Add Metal-4 Tensor API test harness for iOS
examples
#17098
opened Nov 8, 2025 by
ArjunDivecha
Loading…
vulkan: fuse mul_mat_id + mul
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#17095
opened Nov 8, 2025 by
jeffbolznv
Loading…
CUDA: support F32 kernel type for changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
CONV_TRANSPOSE_2D
ggml
#17094
opened Nov 8, 2025 by
AgainstEntropy
Loading…
add version to all shared object files
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
Ascend NPU
issues specific to Ascend NPUs
examples
ggml
changes relating to the ggml tensor library for machine learning
IBM zDNN
issues specific to IBM zDNN Accelerator
Nvidia GPU
Issues specific to Nvidia GPUs
OpenCL
Issues specific to the OpenCL backend
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#17091
opened Nov 7, 2025 by
furrysalamander
Loading…
opencl: add fastdiv and use it in set_rows, ported from cuda
ggml
changes relating to the ggml tensor library for machine learning
OpenCL
Issues specific to the OpenCL backend
metal : enable tensor API for A19
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#17087
opened Nov 7, 2025 by
ggerganov
Loading…
HIP: RDNA4 tensor core support for MMF
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17077
opened Nov 7, 2025 by
zhang-hui-yulo
•
Draft
[RFC] ggml: new backend for API Remoting
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
#17072
opened Nov 7, 2025 by
kpouget
Loading…
convert : handle compressed-tensors quant method
enhancement
New feature or request
python
python script changes
#17069
opened Nov 7, 2025 by
compilade
Loading…
6 of 7 tasks
Add ops needed for new hybrid models: SOFTPLUS, EXPM1, TRI, SOLVE_TRI, CUMSUM
documentation
Improvements or additions to documentation
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#17063
opened Nov 6, 2025 by
pwilkin
Loading…
cmake: add option to build and link BoringSSL
build
Compilation issues
#17062
opened Nov 6, 2025 by
angt
Loading…
[WIP] s390x ci: debug build issue
devops
improvements to build systems and github actions
#17053
opened Nov 6, 2025 by
AlekseiNikiforovIBM
Loading…
# Add Megrez-MoE Architecture Support ggml-org#16724
model
Model specific
#17052
opened Nov 6, 2025 by
tamarPal
Loading…
cuda: extended MMF_ROWS_PER_BLOCK
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#17051
opened Nov 6, 2025 by
zhang-hui-yulo
Loading…
fix : Dangling pointer for non-empty trigger words in lazy grammar construction
#17048
opened Nov 6, 2025 by
marek-hradil
Loading…
Add MoE dynamic routing with expert caching
build
Compilation issues
documentation
Improvements or additions to documentation
examples
#17044
opened Nov 6, 2025 by
jmangold23
•
Draft
ggml-hexagon: fix changes relating to the ggml tensor library for machine learning
test-backend-ops failures on specific binary ops
ggml
Previous Next
ProTip!
Find all pull requests that aren't related to any open issues with -linked:issue.