Dear ggml_hsa team,
We noticed that your project serves as a critical backend for accelerating llama.cpp on AMD GPUs/NPUs. We'd like to ask: we want to use ggml-hsa to enable llama.cpp to run on AMD NPU. What efforts do I need to make, and could you provide some tips and assistance?
Thank you!
Dear ggml_hsa team,
We noticed that your project serves as a critical backend for accelerating llama.cpp on AMD GPUs/NPUs. We'd like to ask: we want to use ggml-hsa to enable llama.cpp to run on AMD NPU. What efforts do I need to make, and could you provide some tips and assistance?
Thank you!