Releases · wangqi/llama.cpp

14 Sep 19:38

1d96938

b3756

Add stdbool.h to examples/llava/clip.h

Assets 19

14 Sep 19:28

github-actions

b3755

822b632

b3755

ggml : ggml_type_name return "NONE" for invalid values (#9458)

When running on Windows, the quantization utility attempts to print the types that are not set which leads to a crash.

Assets 19

14 May 19:30

github-actions

b2886

9f77348

b2886

script : sync ggml-rpc

Assets 20

09 May 13:59

github-actions

b2830

a743d76

b2830

CUDA: generalize FP16 fattn vec kernel (#7061)

* CUDA: generalize FP16 fattn vec kernel

* disable unsupported head sizes for AMD in test

* try AMD fix

* fix batch size 2-8

* partially revert changes

Assets 19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: wangqi/llama.cpp

b3756

b3755

b2886

b2830