We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
diffusers:0.32.dev0 python: 3.11 cuda: 12.0
使用convert_weight_sat2hf.py脚本转化全参微调后的模型,报错如下: raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format( RuntimeError: Error(s) in loading state_dict for CogVideoXTransformer3DModel: Unexpected key(s) in state_dict: "0.transformer_blocks.shared.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.relative_attn1_bias.weight", "0.transformer_blocks.encoder.block.0.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.0.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.0.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.0.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.0.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.1.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.1.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.1.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.1.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.1.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.2.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.2.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.2.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.2.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.2.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.3.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.3.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.3.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.3.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.3.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.4.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.4.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.4.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.4.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.4.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.5.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.5.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.5.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.5.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.5.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.6.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.6.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.6.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.6.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.6.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.7.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.7.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.7.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.7.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.7.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.8.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.8.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.8.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.8.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.8.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.9.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.9.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.9.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.9.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.9.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.10.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.10.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.10.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.10.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.10.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.11.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.11.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.11.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.11.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.11.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.12.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.12.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.12.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.12.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.12.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.13.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.13.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.13.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.13.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.13.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.14.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.14.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.14.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.14.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.14.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.15.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.15.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.15.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.15.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.15.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.16.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.16.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.16.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.16.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.16.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.17.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.17.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.17.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.17.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.17.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.18.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.18.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.18.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.18.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.18.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.19.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.19.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.19.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.19.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.19.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.20.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.20.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.20.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.20.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.20.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.21.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.21.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.21.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.21.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.21.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.22.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.22.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.22.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.22.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.22.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.23.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.23.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.23.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.23.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.23.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.final_layer_norm.weight", "der.conv_in.conv.weight", "der.conv_in.conv.bias", "der.down.0.block.0.norm1.weight", "der.down.0.block.0.norm1.bias", "der.down.0.block.0.conv1.conv.weight", "der.down.0.block.0.conv1.conv.bias", "der.down.0.block.0.norm2.weight", "der.down.0.block.0.norm2.bias", "der.down.0.block.0.conv2.conv.weight", "der.down.0.block.0.conv2.conv.bias", "der.down.0.block.1.norm1.weight", "der.down.0.block.1.norm1.bias", "der.down.0.block.1.conv1.conv.weight", "der.down.0.block.1.conv1.conv.bias", "der.down.0.block.1.norm2.weight", "der.down.0.block.1.norm2.bias", "der.down.0.block.1.conv2.conv.weight", "der.down.0.block.1.conv2.conv.bias", "der.down.0.block.2.norm1.weight", "der.down.0.block.2.norm1.bias", "der.down.0.block.2.conv1.conv.weight", "der.down.0.block.2.conv1.conv.bias", "der.down.0.block.2.norm2.weight", "der.down.0.block.2.norm2.bias", "der.down.0.block.2.conv2.conv.weight", "der.down.0.block.2.conv2.conv.bias", "der.down.0.downsample.conv.weight", "der.down.0.downsample.conv.bias", "der.down.1.block.0.norm1.weight", "der.down.1.block.0.norm1.bias", "der.down.1.block.0.conv1.conv.weight", "der.down.1.block.0.conv1.conv.bias", "der.down.1.block.0.norm2.weight", "der.down.1.block.0.norm2.bias", "der.down.1.block.0.conv2.conv.weight", "der.down.1.block.0.conv2.conv.bias", "der.down.1.block.0.nin_shortcut.weight", "der.down.1.block.0.nin_shortcut.bias", "der.down.1.block.1.norm1.weight", "der.down.1.block.1.norm1.bias", "der.down.1.block.1.conv1.conv.weight", "der.down.1.block.1.conv1.conv.bias", "der.down.1.block.1.norm2.weight", "der.down.1.block.1.norm2.bias", "der.down.1.block.1.conv2.conv.weight", "der.down.1.block.1.conv2.conv.bias", "der.down.1.block.2.norm1.weight", "der.down.1.block.2.norm1.bias", "der.down.1.block.2.conv1.conv.weight", "der.down.1.block.2.conv1.conv.bias", "der.down.1.block.2.norm2.weight", "der.down.1.block.2.norm2.bias", "der.down.1.block.2.conv2.conv.weight", "der.down.1.block.2.conv2.conv.bias", "der.down.1.downsample.conv.weight", "der.down.1.downsample.conv.bias", "der.down.2.block.0.norm1.weight", "der.down.2.block.0.norm1.bias", "der.down.2.block.0.conv1.conv.weight", "der.down.2.block.0.conv1.conv.bias", "der.down.2.block.0.norm2.weight", "der.down.2.block.0.norm2.bias", "der.down.2.block.0.conv2.conv.weight", "der.down.2.block.0.conv2.conv.bias", "der.down.2.block.1.norm1.weight", "der.down.2.block.1.norm1.bias", "der.down.2.block.1.conv1.conv.weight", "der.down.2.block.1.conv1.conv.bias", "der.down.2.block.1.norm2.weight", "der.down.2.block.1.norm2.bias", "der.down.2.block.1.conv2.conv.weight", "der.down.2.block.1.conv2.conv.bias", "der.down.2.block.2.norm1.weight", "der.down.2.block.2.norm1.bias", "der.down.2.block.2.conv1.conv.weight", "der.down.2.block.2.conv1.conv.bias", "der.down.2.block.2.norm2.weight", "der.down.2.block.2.norm2.bias", "der.down.2.block.2.conv2.conv.weight", "der.down.2.block.2.conv2.conv.bias", "der.down.2.downsample.conv.weight", "der.down.2.downsample.conv.bias", "der.down.3.block.0.norm1.weight", "der.down.3.block.0.norm1.bias", "der.down.3.block.0.conv1.conv.weight", "der.down.3.block.0.conv1.conv.bias", "der.down.3.block.0.norm2.weight", "der.down.3.block.0.norm2.bias", "der.down.3.block.0.conv2.conv.weight", "der.down.3.block.0.conv2.conv.bias", "der.down.3.block.0.nin_shortcut.weight", "der.down.3.block.0.nin_shortcut.bias", "der.down.3.block.1.norm1.weight", "der.down.3.block.1.norm1.bias", "der.down.3.block.1.conv1.conv.weight", "der.down.3.block.1.conv1.conv.bias", "der.down.3.block.1.norm2.weight", "der.down.3.block.1.norm2.bias", "der.down.3.block.1.conv2.conv.weight", "der.down.3.block.1.conv2.conv.bias", "der.down.3.block.2.norm1.weight", "der.down.3.block.2.norm1.bias", "der.down.3.block.2.conv1.conv.weight", "der.down.3.block.2.conv1.conv.bias", "der.down.3.block.2.norm2.weight", "der.down.3.block.2.norm2.bias", "der.down.3.block.2.conv2.conv.weight", "der.down.3.block.2.conv2.conv.bias", "der.mid.block_1.norm1.weight", "der.mid.block_1.norm1.bias", "der.mid.block_1.conv1.conv.weight", "der.mid.block_1.conv1.conv.bias", "der.mid.block_1.norm2.weight", "der.mid.block_1.norm2.bias", "der.mid.block_1.conv2.conv.weight", "der.mid.block_1.conv2.conv.bias", "der.mid.block_2.norm1.weight", "der.mid.block_2.norm1.bias", "der.mid.block_2.conv1.conv.weight", "der.mid.block_2.conv1.conv.bias", "der.mid.block_2.norm2.weight", "der.mid.block_2.norm2.bias", "der.mid.block_2.conv2.conv.weight", "der.mid.block_2.conv2.conv.bias", "der.norm_out.weight", "der.norm_out.bias", "der.conv_out.conv.weight", "der.conv_out.conv.bias", "der.mid.block_1.norm1.norm_layer.weight", "der.mid.block_1.norm1.norm_layer.bias", "der.mid.block_1.norm1.conv_y.conv.weight", "der.mid.block_1.norm1.conv_y.conv.bias", "der.mid.block_1.norm1.conv_b.conv.weight", "der.mid.block_1.norm1.conv_b.conv.bias", "der.mid.block_1.norm2.norm_layer.weight", "der.mid.block_1.norm2.norm_layer.bias", "der.mid.block_1.norm2.conv_y.conv.weight", "der.mid.block_1.norm2.conv_y.conv.bias", "der.mid.block_1.norm2.conv_b.conv.weight", "der.mid.block_1.norm2.conv_b.conv.bias", "der.mid.block_2.norm1.norm_layer.weight", "der.mid.block_2.norm1.norm_layer.bias", "der.mid.block_2.norm1.conv_y.conv.weight", "der.mid.block_2.norm1.conv_y.conv.bias", "der.mid.block_2.norm1.conv_b.conv.weight", "der.mid.block_2.norm1.conv_b.conv.bias", "der.mid.block_2.norm2.norm_layer.weight", "der.mid.block_2.norm2.normpython-BaseException _layer.bias", "der.mid.block_2.norm2.conv_y.conv.weight", "der.mid.block_2.norm2.conv_y.conv.bias", "der.mid.block_2.norm2.conv_b.conv.weight", "der.mid.block_2.norm2.conv_b.conv.bias", "der.up.0.block.0.norm1.norm_layer.weight", "der.up.0.block.0.norm1.norm_layer.bias", "der.up.0.block.0.norm1.conv_y.conv.weight", "der.up.0.block.0.norm1.conv_y.conv.bias", "der.up.0.block.0.norm1.conv_b.conv.weight", "der.up.0.block.0.norm1.conv_b.conv.bias", "der.up.0.block.0.conv1.conv.weight", "der.up.0.block.0.conv1.conv.bias", "der.up.0.block.0.norm2.norm_layer.weight", "der.up.0.block.0.norm2.norm_layer.bias", "der.up.0.block.0.norm2.conv_y.conv.weight", "der.up.0.block.0.norm2.conv_y.conv.bias", "der.up.0.block.0.norm2.conv_b.conv.weight", "der.up.0.block.0.norm2.conv_b.conv.bias", "der.up.0.block.0.conv2.conv.weight", "der.up.0.block.0.conv2.conv.bias", "der.up.0.block.0.nin_shortcut.weight", "der.up.0.block.0.nin_shortcut.bias", "der.up.0.block.1.norm1.norm_layer.weight", "der.up.0.block.1.norm1.norm_layer.bias", "der.up.0.block.1.norm1.conv_y.conv.weight", "der.up.0.block.1.norm1.conv_y.conv.bias", "der.up.0.block.1.norm1.conv_b.conv.weight", "der.up.0.block.1.norm1.conv_b.conv.bias", "der.up.0.block.1.conv1.conv.weight", "der.up.0.block.1.conv1.conv.bias", "der.up.0.block.1.norm2.norm_layer.weight", "der.up.0.block.1.norm2.norm_layer.bias", "der.up.0.block.1.norm2.conv_y.conv.weight", "der.up.0.block.1.norm2.conv_y.conv.bias", "der.up.0.block.1.norm2.conv_b.conv.weight", "der.up.0.block.1.norm2.conv_b.conv.bias", "der.up.0.block.1.conv2.conv.weight", "der.up.0.block.1.conv2.conv.bias", "der.up.0.block.2.norm1.norm_layer.weight", "der.up.0.block.2.norm1.norm_layer.bias", "der.up.0.block.2.norm1.conv_y.conv.weight", "der.up.0.block.2.norm1.conv_y.conv.bias", "der.up.0.block.2.norm1.conv_b.conv.weight", "der.up.0.block.2.norm1.conv_b.conv.bias", "der.up.0.block.2.conv1.conv.weight", "der.up.0.block.2.conv1.conv.bias", "der.up.0.block.2.norm2.norm_layer.weight", "der.up.0.block.2.norm2.norm_layer.bias", "der.up.0.block.2.norm2.conv_y.conv.weight", "der.up.0.block.2.norm2.conv_y.conv.bias", "der.up.0.block.2.norm2.conv_b.conv.weight", "der.up.0.block.2.norm2.conv_b.conv.bias", "der.up.0.block.2.conv2.conv.weight", "der.up.0.block.2.conv2.conv.bias", "der.up.0.block.3.norm1.norm_layer.weight", "der.up.0.block.3.norm1.norm_layer.bias", "der.up.0.block.3.norm1.conv_y.conv.weight", "der.up.0.block.3.norm1.conv_y.conv.bias", "der.up.0.block.3.norm1.conv_b.conv.weight", "der.up.0.block.3.norm1.conv_b.conv.bias", "der.up.0.block.3.conv1.conv.weight", "der.up.0.block.3.conv1.conv.bias", "der.up.0.block.3.norm2.norm_layer.weight", "der.up.0.block.3.norm2.norm_layer.bias", "der.up.0.block.3.norm2.conv_y.conv.weight", "der.up.0.block.3.norm2.conv_y.conv.bias", "der.up.0.block.3.norm2.conv_b.conv.weight", "der.up.0.block.3.norm2.conv_b.conv.bias", "der.up.0.block.3.conv2.conv.weight", "der.up.0.block.3.conv2.conv.bias", "der.up.1.block.0.norm1.norm_layer.weight", "der.up.1.block.0.norm1.norm_layer.bias", "der.up.1.block.0.norm1.conv_y.conv.weight", "der.up.1.block.0.norm1.conv_y.conv.bias", "der.up.1.block.0.norm1.conv_b.conv.weight", "der.up.1.block.0.norm1.conv_b.conv.bias", "der.up.1.block.0.conv1.conv.weight", "der.up.1.block.0.conv1.conv.bias", "der.up.1.block.0.norm2.norm_layer.weight", "der.up.1.block.0.norm2.norm_layer.bias", "der.up.1.block.0.norm2.conv_y.conv.weight", "der.up.1.block.0.norm2.conv_y.conv.bias", "der.up.1.block.0.norm2.conv_b.conv.weight", "der.up.1.block.0.norm2.conv_b.conv.bias", "der.up.1.block.0.conv2.conv.weight", "der.up.1.block.0.conv2.conv.bias", "der.up.1.block.1.norm1.norm_layer.weight", "der.up.1.block.1.norm1.norm_layer.bias", "der.up.1.block.1.norm1.conv_y.conv.weight", "der.up.1.block.1.norm1.conv_y.conv.bias", "der.up.1.block.1.norm1.conv_b.conv.weight", "der.up.1.block.1.norm1.conv_b.conv.bias", "der.up.1.block.1.conv1.conv.weight", "der.up.1.block.1.conv1.conv.bias", "der.up.1.block.1.norm2.norm_layer.weight", "der.up.1.block.1.norm2.norm_layer.bias", "der.up.1.block.1.norm2.conv_y.conv.weight", "der.up.1.block.1.norm2.conv_y.conv.bias", "der.up.1.block.1.norm2.conv_b.conv.weight", "der.up.1.block.1.norm2.conv_b.conv.bias", "der.up.1.block.1.conv2.conv.weight", "der.up.1.block.1.conv2.conv.bias", "der.up.1.block.2.norm1.norm_layer.weight", "der.up.1.block.2.norm1.norm_layer.bias", "der.up.1.block.2.norm1.conv_y.conv.weight", "der.up.1.block.2.norm1.conv_y.conv.bias", "der.up.1.block.2.norm1.conv_b.conv.weight", "der.up.1.block.2.norm1.conv_b.conv.bias", "der.up.1.block.2.conv1.conv.weight", "der.up.1.block.2.conv1.conv.bias", "der.up.1.block.2.norm2.norm_layer.weight", "der.up.1.block.2.norm2.norm_layer.bias", "der.up.1.block.2.norm2.conv_y.conv.weight", "der.up.1.block.2.norm2.conv_y.conv.bias", "der.up.1.block.2.norm2.conv_b.conv.weight", "der.up.1.block.2.norm2.conv_b.conv.bias", "der.up.1.block.2.conv2.conv.weight", "der.up.1.block.2.conv2.conv.bias", "der.up.1.block.3.norm1.norm_layer.weight", "der.up.1.block.3.norm1.norm_layer.bias", "der.up.1.block.3.norm1.conv_y.conv.weight", "der.up.1.block.3.norm1.conv_y.conv.bias", "der.up.1.block.3.norm1.conv_b.conv.weight", "der.up.1.block.3.norm1.conv_b.conv.bias", "der.up.1.block.3.conv1.conv.weight", "der.up.1.block.3.conv1.conv.bias", "der.up.1.block.3.norm2.norm_layer.weight", "der.up.1.block.3.norm2.norm_layer.bias", "der.up.1.block.3.norm2.conv_y.conv.weight", "der.up.1.block.3.norm2.conv_y.conv.bias", "der.up.1.block.3.norm2.conv_b.conv.weight", "der.up.1.block.3.norm2.conv_b.conv.bias", "der.up.1.block.3.conv2.conv.weight", "der.up.1.block.3.conv2.conv.bias", "der.up.1.upsample.conv.weight", "der.up.1.upsample.conv.bias", "der.up.2.block.0.norm1.norm_layer.weight", "der.up.2.block.0.norm1.norm_layer.bias", "der.up.2.block.0.norm1.conv_y.conv.weight", "der.up.2.block.0.norm1.conv_y.conv.bias", "der.up.2.block.0.norm1.conv_b.conv.weight", "der.up.2.block.0.norm1.conv_b.conv.bias", "der.up.2.block.0.conv1.conv.weight", "der.up.2.block.0.conv1.conv.bias", "der.up.2.block.0.norm2.norm_layer.weight", "der.up.2.block.0.norm2.norm_layer.bias", "der.up.2.block.0.norm2.conv_y.conv.weight", "der.up.2.block.0.norm2.conv_y.conv.bias", "der.up.2.block.0.norm2.conv_b.conv.weight", "der.up.2.block.0.norm2.conv_b.conv.bias", "der.up.2.block.0.conv2.conv.weight", "der.up.2.block.0.conv2.conv.bias", "der.up.2.block.0.nin_shortcut.weight", "der.up.2.block.0.nin_shortcut.bias", "der.up.2.block.1.norm1.norm_layer.weight", "der.up.2.block.1.norm1.norm_layer.bias", "der.up.2.block.1.norm1.conv_y.conv.weight", "der.up.2.block.1.norm1.conv_y.conv.bias", "der.up.2.block.1.norm1.conv_b.conv.weight", "der.up.2.block.1.norm1.conv_b.conv.bias", "der.up.2.block.1.conv1.conv.weight", "der.up.2.block.1.conv1.conv.bias", "der.up.2.block.1.norm2.norm_layer.weight", "der.up.2.block.1.norm2.norm_layer.bias", "der.up.2.block.1.norm2.conv_y.conv.weight", "der.up.2.block.1.norm2.conv_y.conv.bias", "der.up.2.block.1.norm2.conv_b.conv.weight", "der.up.2.block.1.norm2.conv_b.conv.bias", "der.up.2.block.1.conv2.conv.weight", "der.up.2.block.1.conv2.conv.bias", "der.up.2.block.2.norm1.norm_layer.weight", "der.up.2.block.2.norm1.norm_layer.bias", "der.up.2.block.2.norm1.conv_y.conv.weight", "der.up.2.block.2.norm1.conv_y.conv.bias", "der.up.2.block.2.norm1.conv_b.conv.weight", "der.up.2.block.2.norm1.conv_b.conv.bias", "der.up.2.block.2.conv1.conv.weight", "der.up.2.block.2.conv1.conv.bias", "der.up.2.block.2.norm2.norm_layer.weight", "der.up.2.block.2.norm2.norm_layer.bias", "der.up.2.block.2.norm2.conv_y.conv.weight", "der.up.2.block.2.norm2.conv_y.conv.bias", "der.up.2.block.2.norm2.conv_b.conv.weight", "der.up.2.block.2.norm2.conv_b.conv.bias", "der.up.2.block.2.conv2.conv.weight", "der.up.2.block.2.conv2.conv.bias", "der.up.2.block.3.norm1.norm_layer.weight", "der.up.2.block.3.norm1.norm_layer.bias", "der.up.2.block.3.norm1.conv_y.conv.weight", "der.up.2.block.3.norm1.conv_y.conv.bias", "der.up.2.block.3.norm1.conv_b.conv.weight", "der.up.2.block.3.norm1.conv_b.conv.bias", "der.up.2.block.3.conv1.conv.weight", "der.up.2.block.3.conv1.conv.bias", "der.up.2.block.3.norm2.norm_layer.weight", "der.up.2.block.3.norm2.norm_layer.bias", "der.up.2.block.3.norm2.conv_y.conv.weight", "der.up.2.block.3.norm2.conv_y.conv.bias", "der.up.2.block.3.norm2.conv_b.conv.weight", "der.up.2.block.3.norm2.conv_b.conv.bias", "der.up.2.block.3.conv2.conv.weight", "der.up.2.block.3.conv2.conv.bias", "der.up.2.upsample.conv.weight", "der.up.2.upsample.conv.bias", "der.up.3.block.0.norm1.norm_layer.weight", "der.up.3.block.0.norm1.norm_layer.bias", "der.up.3.block.0.norm1.conv_y.conv.weight", "der.up.3.block.0.norm1.conv_y.conv.bias", "der.up.3.block.0.norm1.conv_b.conv.weight", "der.up.3.block.0.norm1.conv_b.conv.bias", "der.up.3.block.0.conv1.conv.weight", "der.up.3.block.0.conv1.conv.bias", "der.up.3.block.0.norm2.norm_layer.weight", "der.up.3.block.0.norm2.norm_layer.bias", "der.up.3.block.0.norm2.conv_y.conv.weight", "der.up.3.block.0.norm2.conv_y.conv.bias", "der.up.3.block.0.norm2.conv_b.conv.weight", "der.up.3.block.0.norm2.conv_b.conv.bias", "der.up.3.block.0.conv2.conv.weight", "der.up.3.block.0.conv2.conv.bias", "der.up.3.block.1.norm1.norm_layer.weight", "der.up.3.block.1.norm1.norm_layer.bias", "der.up.3.block.1.norm1.conv_y.conv.weight", "der.up.3.block.1.norm1.conv_y.conv.bias", "der.up.3.block.1.norm1.conv_b.conv.weight", "der.up.3.block.1.norm1.conv_b.conv.bias", "der.up.3.block.1.conv1.conv.weight", "der.up.3.block.1.conv1.conv.bias", "der.up.3.block.1.norm2.norm_layer.weight", "der.up.3.block.1.norm2.norm_layer.bias", "der.up.3.block.1.norm2.conv_y.conv.weight", "der.up.3.block.1.norm2.conv_y.conv.bias", "der.up.3.block.1.norm2.conv_b.conv.weight", "der.up.3.block.1.norm2.conv_b.conv.bias", "der.up.3.block.1.conv2.conv.weight", "der.up.3.block.1.conv2.conv.bias", "der.up.3.block.2.norm1.norm_layer.weight", "der.up.3.block.2.norm1.norm_layer.bias", "der.up.3.block.2.norm1.conv_y.conv.weight", "der.up.3.block.2.norm1.conv_y.conv.bias", "der.up.3.block.2.norm1.conv_b.conv.weight", "der.up.3.block.2.norm1.conv_b.conv.bias", "der.up.3.block.2.conv1.conv.weight", "der.up.3.block.2.conv1.conv.bias", "der.up.3.block.2.norm2.norm_layer.weight", "der.up.3.block.2.norm2.norm_layer.bias", "der.up.3.block.2.norm2.conv_y.conv.weight", "der.up.3.block.2.norm2.conv_y.conv.bias", "der.up.3.block.2.norm2.conv_b.conv.weight", "der.up.3.block.2.norm2.conv_b.conv.bias", "der.up.3.block.2.conv2.conv.weight", "der.up.3.block.2.conv2.conv.bias", "der.up.3.block.3.norm1.norm_layer.weight", "der.up.3.block.3.norm1.norm_layer.bias", "der.up.3.block.3.norm1.conv_y.conv.weight", "der.up.3.block.3.norm1.conv_y.conv.bias", "der.up.3.block.3.norm1.conv_b.conv.weight", "der.up.3.block.3.norm1.conv_b.conv.bias", "der.up.3.block.3.conv1.conv.weight", "der.up.3.block.3.conv1.conv.bias", "der.up.3.block.3.norm2.norm_layer.weight", "der.up.3.block.3.norm2.norm_layer.bias", "der.up.3.block.3.norm2.conv_y.conv.weight", "der.up.3.block.3.norm2.conv_y.conv.bias", "der.up.3.block.3.norm2.conv_b.conv.weight", "der.up.3.block.3.norm2.conv_b.conv.bias", "der.up.3.block.3.conv2.conv.weight", "der.up.3.block.3.conv2.conv.bias", "der.up.3.upsample.conv.weight", "der.up.3.upsample.conv.bias", "der.norm_out.norm_layer.weight", "der.norm_out.norm_layer.bias", "der.norm_out.conv_y.conv.weight", "der.norm_out.conv_y.conv.bias", "der.norm_out.conv_b.conv.weight", "der.norm_out.conv_b.conv.bias", "patch_embed.pos_embedding".
正常转模型
The text was updated successfully, but these errors were encountered:
请使用 release 包 CogVieoX1.0版本中的转换代码转换
Sorry, something went wrong.
zRzRzRzRzRzRzR
No branches or pull requests
System Info / 系統信息
diffusers:0.32.dev0
python: 3.11
cuda: 12.0
Information / 问题信息
Reproduction / 复现过程
使用convert_weight_sat2hf.py脚本转化全参微调后的模型,报错如下:
raise RuntimeError('Error(s) in loading state_dict for {}:\n\t{}'.format(
RuntimeError: Error(s) in loading state_dict for CogVideoXTransformer3DModel:
Unexpected key(s) in state_dict: "0.transformer_blocks.shared.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.0.layer.0.SelfAttention.relative_attn1_bias.weight", "0.transformer_blocks.encoder.block.0.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.0.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.0.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.0.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.0.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.1.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.1.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.1.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.1.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.1.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.1.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.2.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.2.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.2.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.2.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.2.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.2.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.3.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.3.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.3.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.3.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.3.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.3.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.4.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.4.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.4.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.4.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.4.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.4.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.5.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.5.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.5.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.5.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.5.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.5.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.6.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.6.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.6.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.6.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.6.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.6.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.7.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.7.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.7.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.7.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.7.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.7.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.8.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.8.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.8.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.8.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.8.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.8.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.9.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.9.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.9.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.9.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.9.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.9.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.10.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.10.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.10.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.10.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.10.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.10.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.11.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.11.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.11.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.11.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.11.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.11.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.12.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.12.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.12.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.12.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.12.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.12.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.13.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.13.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.13.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.13.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.13.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.13.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.14.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.14.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.14.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.14.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.14.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.14.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.15.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.15.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.15.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.15.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.15.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.15.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.16.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.16.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.16.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.16.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.16.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.16.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.17.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.17.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.17.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.17.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.17.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.17.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.18.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.18.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.18.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.18.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.18.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.18.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.19.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.19.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.19.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.19.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.19.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.19.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.20.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.20.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.20.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.20.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.20.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.20.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.21.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.21.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.21.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.21.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.21.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.21.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.22.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.22.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.22.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.22.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.22.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.22.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.q.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.k.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.v.weight", "0.transformer_blocks.encoder.block.23.layer.0.SelfAttention.o.weight", "0.transformer_blocks.encoder.block.23.layer.0.layer_norm.weight", "0.transformer_blocks.encoder.block.23.layer.1.DenseReluDense.wi_0.weight", "0.transformer_blocks.encoder.block.23.layer.1.DenseReluDense.wi_1.weight", "0.transformer_blocks.encoder.block.23.layer.1.DenseReluDense.wo.weight", "0.transformer_blocks.encoder.block.23.layer.1.layer_norm.weight", "0.transformer_blocks.encoder.final_layer_norm.weight", "der.conv_in.conv.weight", "der.conv_in.conv.bias", "der.down.0.block.0.norm1.weight", "der.down.0.block.0.norm1.bias", "der.down.0.block.0.conv1.conv.weight", "der.down.0.block.0.conv1.conv.bias", "der.down.0.block.0.norm2.weight", "der.down.0.block.0.norm2.bias", "der.down.0.block.0.conv2.conv.weight", "der.down.0.block.0.conv2.conv.bias", "der.down.0.block.1.norm1.weight", "der.down.0.block.1.norm1.bias", "der.down.0.block.1.conv1.conv.weight", "der.down.0.block.1.conv1.conv.bias", "der.down.0.block.1.norm2.weight", "der.down.0.block.1.norm2.bias", "der.down.0.block.1.conv2.conv.weight", "der.down.0.block.1.conv2.conv.bias", "der.down.0.block.2.norm1.weight", "der.down.0.block.2.norm1.bias", "der.down.0.block.2.conv1.conv.weight", "der.down.0.block.2.conv1.conv.bias", "der.down.0.block.2.norm2.weight", "der.down.0.block.2.norm2.bias", "der.down.0.block.2.conv2.conv.weight", "der.down.0.block.2.conv2.conv.bias", "der.down.0.downsample.conv.weight", "der.down.0.downsample.conv.bias", "der.down.1.block.0.norm1.weight", "der.down.1.block.0.norm1.bias", "der.down.1.block.0.conv1.conv.weight", "der.down.1.block.0.conv1.conv.bias", "der.down.1.block.0.norm2.weight", "der.down.1.block.0.norm2.bias", "der.down.1.block.0.conv2.conv.weight", "der.down.1.block.0.conv2.conv.bias", "der.down.1.block.0.nin_shortcut.weight", "der.down.1.block.0.nin_shortcut.bias", "der.down.1.block.1.norm1.weight", "der.down.1.block.1.norm1.bias", "der.down.1.block.1.conv1.conv.weight", "der.down.1.block.1.conv1.conv.bias", "der.down.1.block.1.norm2.weight", "der.down.1.block.1.norm2.bias", "der.down.1.block.1.conv2.conv.weight", "der.down.1.block.1.conv2.conv.bias", "der.down.1.block.2.norm1.weight", "der.down.1.block.2.norm1.bias", "der.down.1.block.2.conv1.conv.weight", "der.down.1.block.2.conv1.conv.bias", "der.down.1.block.2.norm2.weight", "der.down.1.block.2.norm2.bias", "der.down.1.block.2.conv2.conv.weight", "der.down.1.block.2.conv2.conv.bias", "der.down.1.downsample.conv.weight", "der.down.1.downsample.conv.bias", "der.down.2.block.0.norm1.weight", "der.down.2.block.0.norm1.bias", "der.down.2.block.0.conv1.conv.weight", "der.down.2.block.0.conv1.conv.bias", "der.down.2.block.0.norm2.weight", "der.down.2.block.0.norm2.bias", "der.down.2.block.0.conv2.conv.weight", "der.down.2.block.0.conv2.conv.bias", "der.down.2.block.1.norm1.weight", "der.down.2.block.1.norm1.bias", "der.down.2.block.1.conv1.conv.weight", "der.down.2.block.1.conv1.conv.bias", "der.down.2.block.1.norm2.weight", "der.down.2.block.1.norm2.bias", "der.down.2.block.1.conv2.conv.weight", "der.down.2.block.1.conv2.conv.bias", "der.down.2.block.2.norm1.weight", "der.down.2.block.2.norm1.bias", "der.down.2.block.2.conv1.conv.weight", "der.down.2.block.2.conv1.conv.bias", "der.down.2.block.2.norm2.weight", "der.down.2.block.2.norm2.bias", "der.down.2.block.2.conv2.conv.weight", "der.down.2.block.2.conv2.conv.bias", "der.down.2.downsample.conv.weight", "der.down.2.downsample.conv.bias", "der.down.3.block.0.norm1.weight", "der.down.3.block.0.norm1.bias", "der.down.3.block.0.conv1.conv.weight", "der.down.3.block.0.conv1.conv.bias", "der.down.3.block.0.norm2.weight", "der.down.3.block.0.norm2.bias", "der.down.3.block.0.conv2.conv.weight", "der.down.3.block.0.conv2.conv.bias", "der.down.3.block.0.nin_shortcut.weight", "der.down.3.block.0.nin_shortcut.bias", "der.down.3.block.1.norm1.weight", "der.down.3.block.1.norm1.bias", "der.down.3.block.1.conv1.conv.weight", "der.down.3.block.1.conv1.conv.bias", "der.down.3.block.1.norm2.weight", "der.down.3.block.1.norm2.bias", "der.down.3.block.1.conv2.conv.weight", "der.down.3.block.1.conv2.conv.bias", "der.down.3.block.2.norm1.weight", "der.down.3.block.2.norm1.bias", "der.down.3.block.2.conv1.conv.weight", "der.down.3.block.2.conv1.conv.bias", "der.down.3.block.2.norm2.weight", "der.down.3.block.2.norm2.bias", "der.down.3.block.2.conv2.conv.weight", "der.down.3.block.2.conv2.conv.bias", "der.mid.block_1.norm1.weight", "der.mid.block_1.norm1.bias", "der.mid.block_1.conv1.conv.weight", "der.mid.block_1.conv1.conv.bias", "der.mid.block_1.norm2.weight", "der.mid.block_1.norm2.bias", "der.mid.block_1.conv2.conv.weight", "der.mid.block_1.conv2.conv.bias", "der.mid.block_2.norm1.weight", "der.mid.block_2.norm1.bias", "der.mid.block_2.conv1.conv.weight", "der.mid.block_2.conv1.conv.bias", "der.mid.block_2.norm2.weight", "der.mid.block_2.norm2.bias", "der.mid.block_2.conv2.conv.weight", "der.mid.block_2.conv2.conv.bias", "der.norm_out.weight", "der.norm_out.bias", "der.conv_out.conv.weight", "der.conv_out.conv.bias", "der.mid.block_1.norm1.norm_layer.weight", "der.mid.block_1.norm1.norm_layer.bias", "der.mid.block_1.norm1.conv_y.conv.weight", "der.mid.block_1.norm1.conv_y.conv.bias", "der.mid.block_1.norm1.conv_b.conv.weight", "der.mid.block_1.norm1.conv_b.conv.bias", "der.mid.block_1.norm2.norm_layer.weight", "der.mid.block_1.norm2.norm_layer.bias", "der.mid.block_1.norm2.conv_y.conv.weight", "der.mid.block_1.norm2.conv_y.conv.bias", "der.mid.block_1.norm2.conv_b.conv.weight", "der.mid.block_1.norm2.conv_b.conv.bias", "der.mid.block_2.norm1.norm_layer.weight", "der.mid.block_2.norm1.norm_layer.bias", "der.mid.block_2.norm1.conv_y.conv.weight", "der.mid.block_2.norm1.conv_y.conv.bias", "der.mid.block_2.norm1.conv_b.conv.weight", "der.mid.block_2.norm1.conv_b.conv.bias", "der.mid.block_2.norm2.norm_layer.weight", "der.mid.block_2.norm2.normpython-BaseException
_layer.bias", "der.mid.block_2.norm2.conv_y.conv.weight", "der.mid.block_2.norm2.conv_y.conv.bias", "der.mid.block_2.norm2.conv_b.conv.weight", "der.mid.block_2.norm2.conv_b.conv.bias", "der.up.0.block.0.norm1.norm_layer.weight", "der.up.0.block.0.norm1.norm_layer.bias", "der.up.0.block.0.norm1.conv_y.conv.weight", "der.up.0.block.0.norm1.conv_y.conv.bias", "der.up.0.block.0.norm1.conv_b.conv.weight", "der.up.0.block.0.norm1.conv_b.conv.bias", "der.up.0.block.0.conv1.conv.weight", "der.up.0.block.0.conv1.conv.bias", "der.up.0.block.0.norm2.norm_layer.weight", "der.up.0.block.0.norm2.norm_layer.bias", "der.up.0.block.0.norm2.conv_y.conv.weight", "der.up.0.block.0.norm2.conv_y.conv.bias", "der.up.0.block.0.norm2.conv_b.conv.weight", "der.up.0.block.0.norm2.conv_b.conv.bias", "der.up.0.block.0.conv2.conv.weight", "der.up.0.block.0.conv2.conv.bias", "der.up.0.block.0.nin_shortcut.weight", "der.up.0.block.0.nin_shortcut.bias", "der.up.0.block.1.norm1.norm_layer.weight", "der.up.0.block.1.norm1.norm_layer.bias", "der.up.0.block.1.norm1.conv_y.conv.weight", "der.up.0.block.1.norm1.conv_y.conv.bias", "der.up.0.block.1.norm1.conv_b.conv.weight", "der.up.0.block.1.norm1.conv_b.conv.bias", "der.up.0.block.1.conv1.conv.weight", "der.up.0.block.1.conv1.conv.bias", "der.up.0.block.1.norm2.norm_layer.weight", "der.up.0.block.1.norm2.norm_layer.bias", "der.up.0.block.1.norm2.conv_y.conv.weight", "der.up.0.block.1.norm2.conv_y.conv.bias", "der.up.0.block.1.norm2.conv_b.conv.weight", "der.up.0.block.1.norm2.conv_b.conv.bias", "der.up.0.block.1.conv2.conv.weight", "der.up.0.block.1.conv2.conv.bias", "der.up.0.block.2.norm1.norm_layer.weight", "der.up.0.block.2.norm1.norm_layer.bias", "der.up.0.block.2.norm1.conv_y.conv.weight", "der.up.0.block.2.norm1.conv_y.conv.bias", "der.up.0.block.2.norm1.conv_b.conv.weight", "der.up.0.block.2.norm1.conv_b.conv.bias", "der.up.0.block.2.conv1.conv.weight", "der.up.0.block.2.conv1.conv.bias", "der.up.0.block.2.norm2.norm_layer.weight", "der.up.0.block.2.norm2.norm_layer.bias", "der.up.0.block.2.norm2.conv_y.conv.weight", "der.up.0.block.2.norm2.conv_y.conv.bias", "der.up.0.block.2.norm2.conv_b.conv.weight", "der.up.0.block.2.norm2.conv_b.conv.bias", "der.up.0.block.2.conv2.conv.weight", "der.up.0.block.2.conv2.conv.bias", "der.up.0.block.3.norm1.norm_layer.weight", "der.up.0.block.3.norm1.norm_layer.bias", "der.up.0.block.3.norm1.conv_y.conv.weight", "der.up.0.block.3.norm1.conv_y.conv.bias", "der.up.0.block.3.norm1.conv_b.conv.weight", "der.up.0.block.3.norm1.conv_b.conv.bias", "der.up.0.block.3.conv1.conv.weight", "der.up.0.block.3.conv1.conv.bias", "der.up.0.block.3.norm2.norm_layer.weight", "der.up.0.block.3.norm2.norm_layer.bias", "der.up.0.block.3.norm2.conv_y.conv.weight", "der.up.0.block.3.norm2.conv_y.conv.bias", "der.up.0.block.3.norm2.conv_b.conv.weight", "der.up.0.block.3.norm2.conv_b.conv.bias", "der.up.0.block.3.conv2.conv.weight", "der.up.0.block.3.conv2.conv.bias", "der.up.1.block.0.norm1.norm_layer.weight", "der.up.1.block.0.norm1.norm_layer.bias", "der.up.1.block.0.norm1.conv_y.conv.weight", "der.up.1.block.0.norm1.conv_y.conv.bias", "der.up.1.block.0.norm1.conv_b.conv.weight", "der.up.1.block.0.norm1.conv_b.conv.bias", "der.up.1.block.0.conv1.conv.weight", "der.up.1.block.0.conv1.conv.bias", "der.up.1.block.0.norm2.norm_layer.weight", "der.up.1.block.0.norm2.norm_layer.bias", "der.up.1.block.0.norm2.conv_y.conv.weight", "der.up.1.block.0.norm2.conv_y.conv.bias", "der.up.1.block.0.norm2.conv_b.conv.weight", "der.up.1.block.0.norm2.conv_b.conv.bias", "der.up.1.block.0.conv2.conv.weight", "der.up.1.block.0.conv2.conv.bias", "der.up.1.block.1.norm1.norm_layer.weight", "der.up.1.block.1.norm1.norm_layer.bias", "der.up.1.block.1.norm1.conv_y.conv.weight", "der.up.1.block.1.norm1.conv_y.conv.bias", "der.up.1.block.1.norm1.conv_b.conv.weight", "der.up.1.block.1.norm1.conv_b.conv.bias", "der.up.1.block.1.conv1.conv.weight", "der.up.1.block.1.conv1.conv.bias", "der.up.1.block.1.norm2.norm_layer.weight", "der.up.1.block.1.norm2.norm_layer.bias", "der.up.1.block.1.norm2.conv_y.conv.weight", "der.up.1.block.1.norm2.conv_y.conv.bias", "der.up.1.block.1.norm2.conv_b.conv.weight", "der.up.1.block.1.norm2.conv_b.conv.bias", "der.up.1.block.1.conv2.conv.weight", "der.up.1.block.1.conv2.conv.bias", "der.up.1.block.2.norm1.norm_layer.weight", "der.up.1.block.2.norm1.norm_layer.bias", "der.up.1.block.2.norm1.conv_y.conv.weight", "der.up.1.block.2.norm1.conv_y.conv.bias", "der.up.1.block.2.norm1.conv_b.conv.weight", "der.up.1.block.2.norm1.conv_b.conv.bias", "der.up.1.block.2.conv1.conv.weight", "der.up.1.block.2.conv1.conv.bias", "der.up.1.block.2.norm2.norm_layer.weight", "der.up.1.block.2.norm2.norm_layer.bias", "der.up.1.block.2.norm2.conv_y.conv.weight", "der.up.1.block.2.norm2.conv_y.conv.bias", "der.up.1.block.2.norm2.conv_b.conv.weight", "der.up.1.block.2.norm2.conv_b.conv.bias", "der.up.1.block.2.conv2.conv.weight", "der.up.1.block.2.conv2.conv.bias", "der.up.1.block.3.norm1.norm_layer.weight", "der.up.1.block.3.norm1.norm_layer.bias", "der.up.1.block.3.norm1.conv_y.conv.weight", "der.up.1.block.3.norm1.conv_y.conv.bias", "der.up.1.block.3.norm1.conv_b.conv.weight", "der.up.1.block.3.norm1.conv_b.conv.bias", "der.up.1.block.3.conv1.conv.weight", "der.up.1.block.3.conv1.conv.bias", "der.up.1.block.3.norm2.norm_layer.weight", "der.up.1.block.3.norm2.norm_layer.bias", "der.up.1.block.3.norm2.conv_y.conv.weight", "der.up.1.block.3.norm2.conv_y.conv.bias", "der.up.1.block.3.norm2.conv_b.conv.weight", "der.up.1.block.3.norm2.conv_b.conv.bias", "der.up.1.block.3.conv2.conv.weight", "der.up.1.block.3.conv2.conv.bias", "der.up.1.upsample.conv.weight", "der.up.1.upsample.conv.bias", "der.up.2.block.0.norm1.norm_layer.weight", "der.up.2.block.0.norm1.norm_layer.bias", "der.up.2.block.0.norm1.conv_y.conv.weight", "der.up.2.block.0.norm1.conv_y.conv.bias", "der.up.2.block.0.norm1.conv_b.conv.weight", "der.up.2.block.0.norm1.conv_b.conv.bias", "der.up.2.block.0.conv1.conv.weight", "der.up.2.block.0.conv1.conv.bias", "der.up.2.block.0.norm2.norm_layer.weight", "der.up.2.block.0.norm2.norm_layer.bias", "der.up.2.block.0.norm2.conv_y.conv.weight", "der.up.2.block.0.norm2.conv_y.conv.bias", "der.up.2.block.0.norm2.conv_b.conv.weight", "der.up.2.block.0.norm2.conv_b.conv.bias", "der.up.2.block.0.conv2.conv.weight", "der.up.2.block.0.conv2.conv.bias", "der.up.2.block.0.nin_shortcut.weight", "der.up.2.block.0.nin_shortcut.bias", "der.up.2.block.1.norm1.norm_layer.weight", "der.up.2.block.1.norm1.norm_layer.bias", "der.up.2.block.1.norm1.conv_y.conv.weight", "der.up.2.block.1.norm1.conv_y.conv.bias", "der.up.2.block.1.norm1.conv_b.conv.weight", "der.up.2.block.1.norm1.conv_b.conv.bias", "der.up.2.block.1.conv1.conv.weight", "der.up.2.block.1.conv1.conv.bias", "der.up.2.block.1.norm2.norm_layer.weight", "der.up.2.block.1.norm2.norm_layer.bias", "der.up.2.block.1.norm2.conv_y.conv.weight", "der.up.2.block.1.norm2.conv_y.conv.bias", "der.up.2.block.1.norm2.conv_b.conv.weight", "der.up.2.block.1.norm2.conv_b.conv.bias", "der.up.2.block.1.conv2.conv.weight", "der.up.2.block.1.conv2.conv.bias", "der.up.2.block.2.norm1.norm_layer.weight", "der.up.2.block.2.norm1.norm_layer.bias", "der.up.2.block.2.norm1.conv_y.conv.weight", "der.up.2.block.2.norm1.conv_y.conv.bias", "der.up.2.block.2.norm1.conv_b.conv.weight", "der.up.2.block.2.norm1.conv_b.conv.bias", "der.up.2.block.2.conv1.conv.weight", "der.up.2.block.2.conv1.conv.bias", "der.up.2.block.2.norm2.norm_layer.weight", "der.up.2.block.2.norm2.norm_layer.bias", "der.up.2.block.2.norm2.conv_y.conv.weight", "der.up.2.block.2.norm2.conv_y.conv.bias", "der.up.2.block.2.norm2.conv_b.conv.weight", "der.up.2.block.2.norm2.conv_b.conv.bias", "der.up.2.block.2.conv2.conv.weight", "der.up.2.block.2.conv2.conv.bias", "der.up.2.block.3.norm1.norm_layer.weight", "der.up.2.block.3.norm1.norm_layer.bias", "der.up.2.block.3.norm1.conv_y.conv.weight", "der.up.2.block.3.norm1.conv_y.conv.bias", "der.up.2.block.3.norm1.conv_b.conv.weight", "der.up.2.block.3.norm1.conv_b.conv.bias", "der.up.2.block.3.conv1.conv.weight", "der.up.2.block.3.conv1.conv.bias", "der.up.2.block.3.norm2.norm_layer.weight", "der.up.2.block.3.norm2.norm_layer.bias", "der.up.2.block.3.norm2.conv_y.conv.weight", "der.up.2.block.3.norm2.conv_y.conv.bias", "der.up.2.block.3.norm2.conv_b.conv.weight", "der.up.2.block.3.norm2.conv_b.conv.bias", "der.up.2.block.3.conv2.conv.weight", "der.up.2.block.3.conv2.conv.bias", "der.up.2.upsample.conv.weight", "der.up.2.upsample.conv.bias", "der.up.3.block.0.norm1.norm_layer.weight", "der.up.3.block.0.norm1.norm_layer.bias", "der.up.3.block.0.norm1.conv_y.conv.weight", "der.up.3.block.0.norm1.conv_y.conv.bias", "der.up.3.block.0.norm1.conv_b.conv.weight", "der.up.3.block.0.norm1.conv_b.conv.bias", "der.up.3.block.0.conv1.conv.weight", "der.up.3.block.0.conv1.conv.bias", "der.up.3.block.0.norm2.norm_layer.weight", "der.up.3.block.0.norm2.norm_layer.bias", "der.up.3.block.0.norm2.conv_y.conv.weight", "der.up.3.block.0.norm2.conv_y.conv.bias", "der.up.3.block.0.norm2.conv_b.conv.weight", "der.up.3.block.0.norm2.conv_b.conv.bias", "der.up.3.block.0.conv2.conv.weight", "der.up.3.block.0.conv2.conv.bias", "der.up.3.block.1.norm1.norm_layer.weight", "der.up.3.block.1.norm1.norm_layer.bias", "der.up.3.block.1.norm1.conv_y.conv.weight", "der.up.3.block.1.norm1.conv_y.conv.bias", "der.up.3.block.1.norm1.conv_b.conv.weight", "der.up.3.block.1.norm1.conv_b.conv.bias", "der.up.3.block.1.conv1.conv.weight", "der.up.3.block.1.conv1.conv.bias", "der.up.3.block.1.norm2.norm_layer.weight", "der.up.3.block.1.norm2.norm_layer.bias", "der.up.3.block.1.norm2.conv_y.conv.weight", "der.up.3.block.1.norm2.conv_y.conv.bias", "der.up.3.block.1.norm2.conv_b.conv.weight", "der.up.3.block.1.norm2.conv_b.conv.bias", "der.up.3.block.1.conv2.conv.weight", "der.up.3.block.1.conv2.conv.bias", "der.up.3.block.2.norm1.norm_layer.weight", "der.up.3.block.2.norm1.norm_layer.bias", "der.up.3.block.2.norm1.conv_y.conv.weight", "der.up.3.block.2.norm1.conv_y.conv.bias", "der.up.3.block.2.norm1.conv_b.conv.weight", "der.up.3.block.2.norm1.conv_b.conv.bias", "der.up.3.block.2.conv1.conv.weight", "der.up.3.block.2.conv1.conv.bias", "der.up.3.block.2.norm2.norm_layer.weight", "der.up.3.block.2.norm2.norm_layer.bias", "der.up.3.block.2.norm2.conv_y.conv.weight", "der.up.3.block.2.norm2.conv_y.conv.bias", "der.up.3.block.2.norm2.conv_b.conv.weight", "der.up.3.block.2.norm2.conv_b.conv.bias", "der.up.3.block.2.conv2.conv.weight", "der.up.3.block.2.conv2.conv.bias", "der.up.3.block.3.norm1.norm_layer.weight", "der.up.3.block.3.norm1.norm_layer.bias", "der.up.3.block.3.norm1.conv_y.conv.weight", "der.up.3.block.3.norm1.conv_y.conv.bias", "der.up.3.block.3.norm1.conv_b.conv.weight", "der.up.3.block.3.norm1.conv_b.conv.bias", "der.up.3.block.3.conv1.conv.weight", "der.up.3.block.3.conv1.conv.bias", "der.up.3.block.3.norm2.norm_layer.weight", "der.up.3.block.3.norm2.norm_layer.bias", "der.up.3.block.3.norm2.conv_y.conv.weight", "der.up.3.block.3.norm2.conv_y.conv.bias", "der.up.3.block.3.norm2.conv_b.conv.weight", "der.up.3.block.3.norm2.conv_b.conv.bias", "der.up.3.block.3.conv2.conv.weight", "der.up.3.block.3.conv2.conv.bias", "der.up.3.upsample.conv.weight", "der.up.3.upsample.conv.bias", "der.norm_out.norm_layer.weight", "der.norm_out.norm_layer.bias", "der.norm_out.conv_y.conv.weight", "der.norm_out.conv_y.conv.bias", "der.norm_out.conv_b.conv.weight", "der.norm_out.conv_b.conv.bias", "patch_embed.pos_embedding".
Expected behavior / 期待表现
正常转模型
The text was updated successfully, but these errors were encountered: