How can I use this ggml-hsa to enable llama.cpp running on AMD NPU？

Dear ggml_hsa team,

We noticed that your project serves as a critical backend for accelerating llama.cpp on AMD GPUs/NPUs. We'd like to ask: we want to use ggml-hsa to enable llama.cpp to run on AMD NPU. What efforts do I need to make, and could you provide some tips and assistance?

Thank you!