冻结lm_head出现问题 #124

Coobiw · 2023-10-17T02:31:24Z

请问一下为什么Qwen-VL冻住最后一层lm_head，然后只训练visual部分（不加lora，通过修改requires_grad实现），会报一个RuntimeError：element 0 of tensors does not require grad and does not have a grad_fn呀

ShuaiBai623 · 2023-10-17T03:46:02Z

这个是否是加了纯文本数据呢

Coobiw · 2023-10-18T08:22:02Z

感谢您的回复，我发现了问题在解决 #120 的inplace操作时，之前采用的代码是：（加入.data方案)

hidden_states = self.drop(hidden_states)
if images is not None:
        for idx, (i, a, b) in enumerate(img_pos):
            hidden_states.data[i][a + 1 : b] = images.data[idx]

这可能导致vit是trianable的时候完全没有梯度，现在参照您的加入.clone()方案后似乎可行了，非常感谢

sunjunlishi · 2024-04-03T02:03:03Z

@ShuaiBai623
----if not training_args.use_lora:
------- if training_args.fix_vit and hasattr(model,'transformer') and hasattr(model.transformer,'visual'):
----------model.transformer.visual.requires_grad_(False)
--------if hasattr(model.transformer.visual,'attn_pool'):
---------- model.transformer.visual.attn_pool.requires_grad_(True)
我把 ‘if not training_args.use_lora:’ 这句话去了行不行。我就想单独训练视觉部分，还想用qlora

sunjunlishi · 2024-04-03T06:15:55Z

@Coobiw 冻结其他，仅训练视觉部分，不能用lora参数吗
----if not training_args.use_lora:
------- if training_args.fix_vit and hasattr(model,'transformer') and hasattr(model.transformer,'visual'):
----------model.transformer.visual.requires_grad_(False)
--------if hasattr(model.transformer.visual,'attn_pool'):
---------- model.transformer.visual.attn_pool.requires_grad_(True)
我把 ‘if not training_args.use_lora:’ 这句话去了行不行。我就想单独训练视觉部分，还想用qlora

Coobiw closed this as completed Oct 19, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

冻结lm_head出现问题 #124

冻结lm_head出现问题 #124

Coobiw commented Oct 17, 2023 •

edited

Loading

ShuaiBai623 commented Oct 17, 2023

Coobiw commented Oct 18, 2023

sunjunlishi commented Apr 3, 2024 •

edited

Loading

sunjunlishi commented Apr 3, 2024

冻结lm_head出现问题 #124

冻结lm_head出现问题 #124

Comments

Coobiw commented Oct 17, 2023 • edited Loading

ShuaiBai623 commented Oct 17, 2023

Coobiw commented Oct 18, 2023

sunjunlishi commented Apr 3, 2024 • edited Loading

sunjunlishi commented Apr 3, 2024

Coobiw commented Oct 17, 2023 •

edited

Loading

sunjunlishi commented Apr 3, 2024 •

edited

Loading