Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

cogvideo 2b full fine-tune model conversion #633

Open
2 tasks
linwenzhao1 opened this issue Jan 2, 2025 · 1 comment
Open
2 tasks

cogvideo 2b full fine-tune model conversion #633

linwenzhao1 opened this issue Jan 2, 2025 · 1 comment
Assignees

Comments

@linwenzhao1
Copy link

System Info / 系統信息

diffusers:0.32.dev0
python: 3.11
cuda: 12.0

Information / 问题信息

  • The official example scripts / 官方的示例脚本
  • My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

使用convert_weight_sat2hf.py脚本转化全参微调后的模型,报错如下:
raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for CogVideoXTransformer3DModel:
Unexpected key(s) in state_dict: "0.transformer_blocks.shared.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.relative_attn1_bias.weight", "0.transformer_blocks.encoder.block.0.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.0.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.0.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.0.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.0.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.1.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.1.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.1.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.1.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.1.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.2.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.2.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.2.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.2.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.2.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.3.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.3.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.3.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.3.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.3.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.4.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.4.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.4.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.4.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.4.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.5.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.5.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.5.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.5.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.5.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.6.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.6.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.6.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.6.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.6.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.7.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.7.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.7.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.7.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.7.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.8.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.8.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.8.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.8.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.8.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.9.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.9.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.9.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.9.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.9.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.10.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.10.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.10.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.10.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.10.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.11.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.11.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.11.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.11.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.11.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.12.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.12.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.12.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.12.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.12.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.13.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.13.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.13.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.13.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.13.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.14.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.14.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.14.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.14.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.14.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.15.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.15.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.15.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.15.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.15.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.16.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.16.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.16.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.16.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.16.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.17.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.17.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.17.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.17.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.17.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.18.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.18.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.18.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.18.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.18.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.19.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.19.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.19.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.19.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.19.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.20.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.20.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.20.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.20.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.20.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.21.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.21.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.21.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.21.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.21.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.22.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.22.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.22.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.22.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.22.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.23.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.23.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.23.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.23.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.23.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.final_layer_norm.weight", "der.conv_in.conv.weight", "der.conv_in.conv.bias", "der.down.0.block.0.norm1.weight", "der.down.0.block.0.norm1.bias", "der.down.0.block.0.conv1.conv.weight", "der.down.0.block.0.conv1.conv.bias", "der.down.0.block.0.norm2.weight", "der.down.0.block.0.norm2.bias", "der.down.0.block.0.conv2.conv.weight", "der.down.0.block.0.conv2.conv.bias", "der.down.0.block.1.norm1.weight", "der.down.0.block.1.norm1.bias", "der.down.0.block.1.conv1.conv.weight", "der.down.0.block.1.conv1.conv.bias", "der.down.0.block.1.norm2.weight", "der.down.0.block.1.norm2.bias", "der.down.0.block.1.conv2.conv.weight", "der.down.0.block.1.conv2.conv.bias", "der.down.0.block.2.norm1.weight", "der.down.0.block.2.norm1.bias", "der.down.0.block.2.conv1.conv.weight", "der.down.0.block.2.conv1.conv.bias", "der.down.0.block.2.norm2.weight", "der.down.0.block.2.norm2.bias", "der.down.0.block.2.conv2.conv.weight", "der.down.0.block.2.conv2.conv.bias", "der.down.0.downsample.conv.weight", "der.down.0.downsample.conv.bias", "der.down.1.block.0.norm1.weight", "der.down.1.block.0.norm1.bias", "der.down.1.block.0.conv1.conv.weight", "der.down.1.block.0.conv1.conv.bias", "der.down.1.block.0.norm2.weight", "der.down.1.block.0.norm2.bias", "der.down.1.block.0.conv2.conv.weight", "der.down.1.block.0.conv2.conv.bias", "der.down.1.block.0.nin_shortcut.weight", "der.down.1.block.0.nin_shortcut.bias", "der.down.1.block.1.norm1.weight", "der.down.1.block.1.norm1.bias", "der.down.1.block.1.conv1.conv.weight", "der.down.1.block.1.conv1.conv.bias", "der.down.1.block.1.norm2.weight", "der.down.1.block.1.norm2.bias", "der.down.1.block.1.conv2.conv.weight", "der.down.1.block.1.conv2.conv.bias", "der.down.1.block.2.norm1.weight", "der.down.1.block.2.norm1.bias", "der.down.1.block.2.conv1.conv.weight", "der.down.1.block.2.conv1.conv.bias", "der.down.1.block.2.norm2.weight", "der.down.1.block.2.norm2.bias", "der.down.1.block.2.conv2.conv.weight", "der.down.1.block.2.conv2.conv.bias", "der.down.1.downsample.conv.weight", "der.down.1.downsample.conv.bias", "der.down.2.block.0.norm1.weight", "der.down.2.block.0.norm1.bias", "der.down.2.block.0.conv1.conv.weight", "der.down.2.block.0.conv1.conv.bias", "der.down.2.block.0.norm2.weight", "der.down.2.block.0.norm2.bias", "der.down.2.block.0.conv2.conv.weight", "der.down.2.block.0.conv2.conv.bias", "der.down.2.block.1.norm1.weight", "der.down.2.block.1.norm1.bias", "der.down.2.block.1.conv1.conv.weight", "der.down.2.block.1.conv1.conv.bias", "der.down.2.block.1.norm2.weight", "der.down.2.block.1.norm2.bias", "der.down.2.block.1.conv2.conv.weight", "der.down.2.block.1.conv2.conv.bias", "der.down.2.block.2.norm1.weight", "der.down.2.block.2.norm1.bias", "der.down.2.block.2.conv1.conv.weight", "der.down.2.block.2.conv1.conv.bias", "der.down.2.block.2.norm2.weight", "der.down.2.block.2.norm2.bias", "der.down.2.block.2.conv2.conv.weight", "der.down.2.block.2.conv2.conv.bias", "der.down.2.downsample.conv.weight", "der.down.2.downsample.conv.bias", "der.down.3.block.0.norm1.weight", "der.down.3.block.0.norm1.bias", "der.down.3.block.0.conv1.conv.weight", "der.down.3.block.0.conv1.conv.bias", "der.down.3.block.0.norm2.weight", "der.down.3.block.0.norm2.bias", "der.down.3.block.0.conv2.conv.weight", "der.down.3.block.0.conv2.conv.bias", "der.down.3.block.0.nin_shortcut.weight", "der.down.3.block.0.nin_shortcut.bias", "der.down.3.block.1.norm1.weight", "der.down.3.block.1.norm1.bias", "der.down.3.block.1.conv1.conv.weight", "der.down.3.block.1.conv1.conv.bias", "der.down.3.block.1.norm2.weight", "der.down.3.block.1.norm2.bias", "der.down.3.block.1.conv2.conv.weight", "der.down.3.block.1.conv2.conv.bias", "der.down.3.block.2.norm1.weight", "der.down.3.block.2.norm1.bias", "der.down.3.block.2.conv1.conv.weight", "der.down.3.block.2.conv1.conv.bias", "der.down.3.block.2.norm2.weight", "der.down.3.block.2.norm2.bias", "der.down.3.block.2.conv2.conv.weight", "der.down.3.block.2.conv2.conv.bias", "der.mid.block_1.norm1.weight", "der.mid.block_1.norm1.bias", "der.mid.block_1.conv1.conv.weight", "der.mid.block_1.conv1.conv.bias", "der.mid.block_1.norm2.weight", "der.mid.block_1.norm2.bias", "der.mid.block_1.conv2.conv.weight", "der.mid.block_1.conv2.conv.bias", "der.mid.block_2.norm1.weight", "der.mid.block_2.norm1.bias", "der.mid.block_2.conv1.conv.weight", "der.mid.block_2.conv1.conv.bias", "der.mid.block_2.norm2.weight", "der.mid.block_2.norm2.bias", "der.mid.block_2.conv2.conv.weight", "der.mid.block_2.conv2.conv.bias", "der.norm_out.weight", "der.norm_out.bias", "der.conv_out.conv.weight", "der.conv_out.conv.bias", "der.mid.block_1.norm1.norm_layer.weight", "der.mid.block_1.norm1.norm_layer.bias", "der.mid.block_1.norm1.conv_y.conv.weight", "der.mid.block_1.norm1.conv_y.conv.bias", "der.mid.block_1.norm1.conv_b.conv.weight", "der.mid.block_1.norm1.conv_b.conv.bias", "der.mid.block_1.norm2.norm_layer.weight", "der.mid.block_1.norm2.norm_layer.bias", "der.mid.block_1.norm2.conv_y.conv.weight", "der.mid.block_1.norm2.conv_y.conv.bias", "der.mid.block_1.norm2.conv_b.conv.weight", "der.mid.block_1.norm2.conv_b.conv.bias", "der.mid.block_2.norm1.norm_layer.weight", "der.mid.block_2.norm1.norm_layer.bias", "der.mid.block_2.norm1.conv_y.conv.weight", "der.mid.block_2.norm1.conv_y.conv.bias", "der.mid.block_2.norm1.conv_b.conv.weight", "der.mid.block_2.norm1.conv_b.conv.bias", "der.mid.block_2.norm2.norm_layer.weight", "der.mid.block_2.norm2.normpython-BaseException
_layer.bias", "der.mid.block_2.norm2.conv_y.conv.weight", "der.mid.block_2.norm2.conv_y.conv.bias", "der.mid.block_2.norm2.conv_b.conv.weight", "der.mid.block_2.norm2.conv_b.conv.bias", "der.up.0.block.0.norm1.norm_layer.weight", "der.up.0.block.0.norm1.norm_layer.bias", "der.up.0.block.0.norm1.conv_y.conv.weight", "der.up.0.block.0.norm1.conv_y.conv.bias", "der.up.0.block.0.norm1.conv_b.conv.weight", "der.up.0.block.0.norm1.conv_b.conv.bias", "der.up.0.block.0.conv1.conv.weight", "der.up.0.block.0.conv1.conv.bias", "der.up.0.block.0.norm2.norm_layer.weight", "der.up.0.block.0.norm2.norm_layer.bias", "der.up.0.block.0.norm2.conv_y.conv.weight", "der.up.0.block.0.norm2.conv_y.conv.bias", "der.up.0.block.0.norm2.conv_b.conv.weight", "der.up.0.block.0.norm2.conv_b.conv.bias", "der.up.0.block.0.conv2.conv.weight", "der.up.0.block.0.conv2.conv.bias", "der.up.0.block.0.nin_shortcut.weight", "der.up.0.block.0.nin_shortcut.bias", "der.up.0.block.1.norm1.norm_layer.weight", "der.up.0.block.1.norm1.norm_layer.bias", "der.up.0.block.1.norm1.conv_y.conv.weight", "der.up.0.block.1.norm1.conv_y.conv.bias", "der.up.0.block.1.norm1.conv_b.conv.weight", "der.up.0.block.1.norm1.conv_b.conv.bias", "der.up.0.block.1.conv1.conv.weight", "der.up.0.block.1.conv1.conv.bias", "der.up.0.block.1.norm2.norm_layer.weight", "der.up.0.block.1.norm2.norm_layer.bias", "der.up.0.block.1.norm2.conv_y.conv.weight", "der.up.0.block.1.norm2.conv_y.conv.bias", "der.up.0.block.1.norm2.conv_b.conv.weight", "der.up.0.block.1.norm2.conv_b.conv.bias", "der.up.0.block.1.conv2.conv.weight", "der.up.0.block.1.conv2.conv.bias", "der.up.0.block.2.norm1.norm_layer.weight", "der.up.0.block.2.norm1.norm_layer.bias", "der.up.0.block.2.norm1.conv_y.conv.weight", "der.up.0.block.2.norm1.conv_y.conv.bias", "der.up.0.block.2.norm1.conv_b.conv.weight", "der.up.0.block.2.norm1.conv_b.conv.bias", "der.up.0.block.2.conv1.conv.weight", "der.up.0.block.2.conv1.conv.bias", "der.up.0.block.2.norm2.norm_layer.weight", "der.up.0.block.2.norm2.norm_layer.bias", "der.up.0.block.2.norm2.conv_y.conv.weight", "der.up.0.block.2.norm2.conv_y.conv.bias", "der.up.0.block.2.norm2.conv_b.conv.weight", "der.up.0.block.2.norm2.conv_b.conv.bias", "der.up.0.block.2.conv2.conv.weight", "der.up.0.block.2.conv2.conv.bias", "der.up.0.block.3.norm1.norm_layer.weight", "der.up.0.block.3.norm1.norm_layer.bias", "der.up.0.block.3.norm1.conv_y.conv.weight", "der.up.0.block.3.norm1.conv_y.conv.bias", "der.up.0.block.3.norm1.conv_b.conv.weight", "der.up.0.block.3.norm1.conv_b.conv.bias", "der.up.0.block.3.conv1.conv.weight", "der.up.0.block.3.conv1.conv.bias", "der.up.0.block.3.norm2.norm_layer.weight", "der.up.0.block.3.norm2.norm_layer.bias", "der.up.0.block.3.norm2.conv_y.conv.weight", "der.up.0.block.3.norm2.conv_y.conv.bias", "der.up.0.block.3.norm2.conv_b.conv.weight", "der.up.0.block.3.norm2.conv_b.conv.bias", "der.up.0.block.3.conv2.conv.weight", "der.up.0.block.3.conv2.conv.bias", "der.up.1.block.0.norm1.norm_layer.weight", "der.up.1.block.0.norm1.norm_layer.bias", "der.up.1.block.0.norm1.conv_y.conv.weight", "der.up.1.block.0.norm1.conv_y.conv.bias", "der.up.1.block.0.norm1.conv_b.conv.weight", "der.up.1.block.0.norm1.conv_b.conv.bias", "der.up.1.block.0.conv1.conv.weight", "der.up.1.block.0.conv1.conv.bias", "der.up.1.block.0.norm2.norm_layer.weight", "der.up.1.block.0.norm2.norm_layer.bias", "der.up.1.block.0.norm2.conv_y.conv.weight", "der.up.1.block.0.norm2.conv_y.conv.bias", "der.up.1.block.0.norm2.conv_b.conv.weight", "der.up.1.block.0.norm2.conv_b.conv.bias", "der.up.1.block.0.conv2.conv.weight", "der.up.1.block.0.conv2.conv.bias", "der.up.1.block.1.norm1.norm_layer.weight", "der.up.1.block.1.norm1.norm_layer.bias", "der.up.1.block.1.norm1.conv_y.conv.weight", "der.up.1.block.1.norm1.conv_y.conv.bias", "der.up.1.block.1.norm1.conv_b.conv.weight", "der.up.1.block.1.norm1.conv_b.conv.bias", "der.up.1.block.1.conv1.conv.weight", "der.up.1.block.1.conv1.conv.bias", "der.up.1.block.1.norm2.norm_layer.weight", "der.up.1.block.1.norm2.norm_layer.bias", "der.up.1.block.1.norm2.conv_y.conv.weight", "der.up.1.block.1.norm2.conv_y.conv.bias", "der.up.1.block.1.norm2.conv_b.conv.weight", "der.up.1.block.1.norm2.conv_b.conv.bias", "der.up.1.block.1.conv2.conv.weight", "der.up.1.block.1.conv2.conv.bias", "der.up.1.block.2.norm1.norm_layer.weight", "der.up.1.block.2.norm1.norm_layer.bias", "der.up.1.block.2.norm1.conv_y.conv.weight", "der.up.1.block.2.norm1.conv_y.conv.bias", "der.up.1.block.2.norm1.conv_b.conv.weight", "der.up.1.block.2.norm1.conv_b.conv.bias", "der.up.1.block.2.conv1.conv.weight", "der.up.1.block.2.conv1.conv.bias", "der.up.1.block.2.norm2.norm_layer.weight", "der.up.1.block.2.norm2.norm_layer.bias", "der.up.1.block.2.norm2.conv_y.conv.weight", "der.up.1.block.2.norm2.conv_y.conv.bias", "der.up.1.block.2.norm2.conv_b.conv.weight", "der.up.1.block.2.norm2.conv_b.conv.bias", "der.up.1.block.2.conv2.conv.weight", "der.up.1.block.2.conv2.conv.bias", "der.up.1.block.3.norm1.norm_layer.weight", "der.up.1.block.3.norm1.norm_layer.bias", "der.up.1.block.3.norm1.conv_y.conv.weight", "der.up.1.block.3.norm1.conv_y.conv.bias", "der.up.1.block.3.norm1.conv_b.conv.weight", "der.up.1.block.3.norm1.conv_b.conv.bias", "der.up.1.block.3.conv1.conv.weight", "der.up.1.block.3.conv1.conv.bias", "der.up.1.block.3.norm2.norm_layer.weight", "der.up.1.block.3.norm2.norm_layer.bias", "der.up.1.block.3.norm2.conv_y.conv.weight", "der.up.1.block.3.norm2.conv_y.conv.bias", "der.up.1.block.3.norm2.conv_b.conv.weight", "der.up.1.block.3.norm2.conv_b.conv.bias", "der.up.1.block.3.conv2.conv.weight", "der.up.1.block.3.conv2.conv.bias", "der.up.1.upsample.conv.weight", "der.up.1.upsample.conv.bias", "der.up.2.block.0.norm1.norm_layer.weight", "der.up.2.block.0.norm1.norm_layer.bias", "der.up.2.block.0.norm1.conv_y.conv.weight", "der.up.2.block.0.norm1.conv_y.conv.bias", "der.up.2.block.0.norm1.conv_b.conv.weight", "der.up.2.block.0.norm1.conv_b.conv.bias", "der.up.2.block.0.conv1.conv.weight", "der.up.2.block.0.conv1.conv.bias", "der.up.2.block.0.norm2.norm_layer.weight", "der.up.2.block.0.norm2.norm_layer.bias", "der.up.2.block.0.norm2.conv_y.conv.weight", "der.up.2.block.0.norm2.conv_y.conv.bias", "der.up.2.block.0.norm2.conv_b.conv.weight", "der.up.2.block.0.norm2.conv_b.conv.bias", "der.up.2.block.0.conv2.conv.weight", "der.up.2.block.0.conv2.conv.bias", "der.up.2.block.0.nin_shortcut.weight", "der.up.2.block.0.nin_shortcut.bias", "der.up.2.block.1.norm1.norm_layer.weight", "der.up.2.block.1.norm1.norm_layer.bias", "der.up.2.block.1.norm1.conv_y.conv.weight", "der.up.2.block.1.norm1.conv_y.conv.bias", "der.up.2.block.1.norm1.conv_b.conv.weight", "der.up.2.block.1.norm1.conv_b.conv.bias", "der.up.2.block.1.conv1.conv.weight", "der.up.2.block.1.conv1.conv.bias", "der.up.2.block.1.norm2.norm_layer.weight", "der.up.2.block.1.norm2.norm_layer.bias", "der.up.2.block.1.norm2.conv_y.conv.weight", "der.up.2.block.1.norm2.conv_y.conv.bias", "der.up.2.block.1.norm2.conv_b.conv.weight", "der.up.2.block.1.norm2.conv_b.conv.bias", "der.up.2.block.1.conv2.conv.weight", "der.up.2.block.1.conv2.conv.bias", "der.up.2.block.2.norm1.norm_layer.weight", "der.up.2.block.2.norm1.norm_layer.bias", "der.up.2.block.2.norm1.conv_y.conv.weight", "der.up.2.block.2.norm1.conv_y.conv.bias", "der.up.2.block.2.norm1.conv_b.conv.weight", "der.up.2.block.2.norm1.conv_b.conv.bias", "der.up.2.block.2.conv1.conv.weight", "der.up.2.block.2.conv1.conv.bias", "der.up.2.block.2.norm2.norm_layer.weight", "der.up.2.block.2.norm2.norm_layer.bias", "der.up.2.block.2.norm2.conv_y.conv.weight", "der.up.2.block.2.norm2.conv_y.conv.bias", "der.up.2.block.2.norm2.conv_b.conv.weight", "der.up.2.block.2.norm2.conv_b.conv.bias", "der.up.2.block.2.conv2.conv.weight", "der.up.2.block.2.conv2.conv.bias", "der.up.2.block.3.norm1.norm_layer.weight", "der.up.2.block.3.norm1.norm_layer.bias", "der.up.2.block.3.norm1.conv_y.conv.weight", "der.up.2.block.3.norm1.conv_y.conv.bias", "der.up.2.block.3.norm1.conv_b.conv.weight", "der.up.2.block.3.norm1.conv_b.conv.bias", "der.up.2.block.3.conv1.conv.weight", "der.up.2.block.3.conv1.conv.bias", "der.up.2.block.3.norm2.norm_layer.weight", "der.up.2.block.3.norm2.norm_layer.bias", "der.up.2.block.3.norm2.conv_y.conv.weight", "der.up.2.block.3.norm2.conv_y.conv.bias", "der.up.2.block.3.norm2.conv_b.conv.weight", "der.up.2.block.3.norm2.conv_b.conv.bias", "der.up.2.block.3.conv2.conv.weight", "der.up.2.block.3.conv2.conv.bias", "der.up.2.upsample.conv.weight", "der.up.2.upsample.conv.bias", "der.up.3.block.0.norm1.norm_layer.weight", "der.up.3.block.0.norm1.norm_layer.bias", "der.up.3.block.0.norm1.conv_y.conv.weight", "der.up.3.block.0.norm1.conv_y.conv.bias", "der.up.3.block.0.norm1.conv_b.conv.weight", "der.up.3.block.0.norm1.conv_b.conv.bias", "der.up.3.block.0.conv1.conv.weight", "der.up.3.block.0.conv1.conv.bias", "der.up.3.block.0.norm2.norm_layer.weight", "der.up.3.block.0.norm2.norm_layer.bias", "der.up.3.block.0.norm2.conv_y.conv.weight", "der.up.3.block.0.norm2.conv_y.conv.bias", "der.up.3.block.0.norm2.conv_b.conv.weight", "der.up.3.block.0.norm2.conv_b.conv.bias", "der.up.3.block.0.conv2.conv.weight", "der.up.3.block.0.conv2.conv.bias", "der.up.3.block.1.norm1.norm_layer.weight", "der.up.3.block.1.norm1.norm_layer.bias", "der.up.3.block.1.norm1.conv_y.conv.weight", "der.up.3.block.1.norm1.conv_y.conv.bias", "der.up.3.block.1.norm1.conv_b.conv.weight", "der.up.3.block.1.norm1.conv_b.conv.bias", "der.up.3.block.1.conv1.conv.weight", "der.up.3.block.1.conv1.conv.bias", "der.up.3.block.1.norm2.norm_layer.weight", "der.up.3.block.1.norm2.norm_layer.bias", "der.up.3.block.1.norm2.conv_y.conv.weight", "der.up.3.block.1.norm2.conv_y.conv.bias", "der.up.3.block.1.norm2.conv_b.conv.weight", "der.up.3.block.1.norm2.conv_b.conv.bias", "der.up.3.block.1.conv2.conv.weight", "der.up.3.block.1.conv2.conv.bias", "der.up.3.block.2.norm1.norm_layer.weight", "der.up.3.block.2.norm1.norm_layer.bias", "der.up.3.block.2.norm1.conv_y.conv.weight", "der.up.3.block.2.norm1.conv_y.conv.bias", "der.up.3.block.2.norm1.conv_b.conv.weight", "der.up.3.block.2.norm1.conv_b.conv.bias", "der.up.3.block.2.conv1.conv.weight", "der.up.3.block.2.conv1.conv.bias", "der.up.3.block.2.norm2.norm_layer.weight", "der.up.3.block.2.norm2.norm_layer.bias", "der.up.3.block.2.norm2.conv_y.conv.weight", "der.up.3.block.2.norm2.conv_y.conv.bias", "der.up.3.block.2.norm2.conv_b.conv.weight", "der.up.3.block.2.norm2.conv_b.conv.bias", "der.up.3.block.2.conv2.conv.weight", "der.up.3.block.2.conv2.conv.bias", "der.up.3.block.3.norm1.norm_layer.weight", "der.up.3.block.3.norm1.norm_layer.bias", "der.up.3.block.3.norm1.conv_y.conv.weight", "der.up.3.block.3.norm1.conv_y.conv.bias", "der.up.3.block.3.norm1.conv_b.conv.weight", "der.up.3.block.3.norm1.conv_b.conv.bias", "der.up.3.block.3.conv1.conv.weight", "der.up.3.block.3.conv1.conv.bias", "der.up.3.block.3.norm2.norm_layer.weight", "der.up.3.block.3.norm2.norm_layer.bias", "der.up.3.block.3.norm2.conv_y.conv.weight", "der.up.3.block.3.norm2.conv_y.conv.bias", "der.up.3.block.3.norm2.conv_b.conv.weight", "der.up.3.block.3.norm2.conv_b.conv.bias", "der.up.3.block.3.conv2.conv.weight", "der.up.3.block.3.conv2.conv.bias", "der.up.3.upsample.conv.weight", "der.up.3.upsample.conv.bias", "der.norm_out.norm_layer.weight", "der.norm_out.norm_layer.bias", "der.norm_out.conv_y.conv.weight", "der.norm_out.conv_y.conv.bias", "der.norm_out.conv_b.conv.weight", "der.norm_out.conv_b.conv.bias", "patch_embed.pos_embedding".

Expected behavior / 期待表现

正常转模型

@zRzRzRzRzRzRzR zRzRzRzRzRzRzR self-assigned this Jan 3, 2025
@zRzRzRzRzRzRzR
Copy link
Member

请使用 release 包 CogVieoX1.0版本中的转换代码转换

@linwenzhao1 linwenzhao1 changed the title cogvideo 2b 全参微调模型转diffusers失败 cogvideo 2b full fine-tune model conversion Jan 3, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants