-
Notifications
You must be signed in to change notification settings - Fork 11.5k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
convert : improve model arch handling
python
python script changes
#13122
opened Apr 26, 2025 by
ngxson
Loading…
sycl : Implemented reorder Q4_K mmvq
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13109
opened Apr 25, 2025 by
sgeor255
Loading…
1 task
ggml-backend : add load_tensor() to backend API
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Kompute
https://github.com/KomputeProject/kompute/
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
[CANN] Simplify the environment variable setting for GGML_CANN_MEM_POOL and GGML_CANN_ASYNC_MODE
ggml
changes relating to the ggml tensor library for machine learning
#13104
opened Apr 25, 2025 by
bachelor-dou
Loading…
fix wrong template in GLM4-0414
python
python script changes
#13099
opened Apr 24, 2025 by
matteoserva
Loading…
ggml: Implement yield barrier using futex for improved thread scheduling efficiency
ggml
changes relating to the ggml tensor library for machine learning
#13079
opened Apr 23, 2025 by
SongXiaoXi
Loading…
SYCL: Add all missing unary kernels
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#13074
opened Apr 23, 2025 by
qnixsynapse
Loading…
Reduce enum sizes some are used in structs, which allowed them to be optimized.
build
Compilation issues
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#13071
opened Apr 22, 2025 by
GermanAizek
Loading…
fix(rpc): Improve input validation and error handling
ggml
changes relating to the ggml tensor library for machine learning
#13069
opened Apr 22, 2025 by
thevilledev
Loading…
Fix ChatGLMModel for glm-4-9b cannot find tokenizer merges in model file
python
python script changes
#13058
opened Apr 22, 2025 by
glide-the
Loading…
Update README.md for tts example to use afplay on MacOS
examples
#13056
opened Apr 22, 2025 by
maxxam1221
Loading…
ggml-cpu: Integrate fp32=bf16xbf16 SME KleidiAI kernel
ggml
changes relating to the ggml tensor library for machine learning
#13053
opened Apr 21, 2025 by
eddnjjn
Loading…
[CANN]Support OP MUL_MAT_ID
ggml
changes relating to the ggml tensor library for machine learning
#13042
opened Apr 21, 2025 by
noemotiovon
Loading…
gguf-py : avoid requiring PySide6 for packaged scripts
bugfix
fixes an issue or bug
devops
improvements to build systems and github actions
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
python
python script changes
#13036
opened Apr 20, 2025 by
compilade
Loading…
quantize: improve pattern matching for allowed tensors
examples
#13033
opened Apr 20, 2025 by
EAddario
Loading…
Bitnet: directly use scale instead of inverting it twice
python
python script changes
#13026
opened Apr 19, 2025 by
viraatdas
Loading…
Nix portability improvements
devops
improvements to build systems and github actions
ggml
changes relating to the ggml tensor library for machine learning
nix
Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#13005
opened Apr 18, 2025 by
hacker1024
Loading…
make memset range dynamic
ggml
changes relating to the ggml tensor library for machine learning
#13002
opened Apr 18, 2025 by
pockers21
Loading…
threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling
examples
ggml
changes relating to the ggml tensor library for machine learning
#12995
opened Apr 17, 2025 by
max-krasnyansky
Loading…
Fix convert script for non-hf GLM4 checkpoints
python
python script changes
#12992
opened Apr 17, 2025 by
Tianyue-Zhao
Loading…
2 of 4 tasks
sycl: use DNN in the first part of ggml_sycl_mul_mat_batched_sycl
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
#12972
opened Apr 16, 2025 by
lslusarczyk
•
Draft
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.