-
Notifications
You must be signed in to change notification settings - Fork 31
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
如何分段转换llama模型为onnx? #26
Comments
同求! 我也没找到! 自己转llama到onnx踩了好多坑了还是没转成功,好奇是怎么转的 T_T |
我发现了,他是自己修改transformers源码,在transformers源码里调用了torch.onnx.export |
他修改的源码在哪里呀? |
对啊。 |
https://github.com/huggingface/transformers/blob/main/src/transformers/models/llama/modeling_llama.py 在这儿改。最终都会调这个 class |
看了一下readme,想请问一下将llama转换为onnx的代码在
tool/export-onnx.py
中么?我没有找到The text was updated successfully, but these errors were encountered: