-
Notifications
You must be signed in to change notification settings - Fork 421
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
冻结lm_head出现问题 #124
Comments
这个是否是加了纯文本数据呢 |
感谢您的回复,我发现了问题 在解决 #120 的inplace操作时,之前采用的代码是:(加入
这可能导致vit是trianable的时候完全没有梯度,现在参照您的加入 |
@ShuaiBai623 |
@Coobiw 冻结其他,仅训练视觉部分,不能用lora参数吗 |
请问一下为什么Qwen-VL冻住最后一层lm_head,然后只训练visual部分(不加lora,通过修改requires_grad实现),会报一个RuntimeError:element 0 of tensors does not require grad and does not have a grad_fn呀
The text was updated successfully, but these errors were encountered: