修复BCEWithLogitsLoss类型不匹配及save_image API兼容性问题 by Dingchunru · Pull Request #40 · ando-khachatryan/HiDDeN

Dingchunru · 2025-06-19T06:29:40Z

问题描述：

BCEWithLogitsLoss类型错误：

使用torch.nn.BCEWithLogitsLoss时，目标张量默认创建为torch.long类型，但该损失函数要求torch.float32

错误信息：RuntimeError: result type Float can't be cast to the desired output type Long

问题影响：导致训练过程中断，无法计算二元交叉熵损失

save_image API兼容性问题：

新版torchvision的save_image函数参数解析方式变化

错误信息：AttributeError: 'int' object has no attribute 'upper'

问题影响：无法正确保存生成的图像结果

修改方案：

BCEWithLogitsLoss类型修复：

在所有torch.full()创建目标张量处显式指定dtype=torch.float32

确保目标张量与模型输出类型一致，符合概率计算的数学要求

save_image API修复：

将位置参数改为命名参数nrow=original_images.shape[0]

显式添加format='JPEG'参数确保输出格式明确

代码变更示例：

diff

BCEWithLogitsLoss 相关修改

d_target_label_cover = torch.full((images_cover.size(0),), self.cover_label)

d_target_label_cover = torch.full((images_cover.size(0),), self.cover_label, dtype=torch.float32)

g_target_label_encoded = torch.full((images_cover.size(0),), self.encoded_label)

g_target_label_encoded = torch.full((images_cover.size(0),), self.encoded_label, dtype=torch.float32)

save_image 相关修改

save_image(original_images, "original.jpg", original_images.shape[0])

save_image(original_images, "original.jpg", nrow=original_images.shape[0], format='JPEG')
影响范围：

正向影响：修复后可以正常进行模型训练和图像保存

兼容性：修改后的代码兼容PyTorch 1.7+和torchvision 0.8+版本

性能：无负面影响，类型匹配反而可能提升GPU计算效率

测试建议：

运行训练脚本验证BCEWithLogitsLoss计算是否正常

检查生成的图像文件是否符合预期格式和布局

在不同PyTorch版本(1.7-2.0)环境下验证兼容性

问题描述： save_image API兼容性问题：新版torchvision的save_image函数参数解析方式变化错误信息：AttributeError: 'int' object has no attribute 'upper' 问题影响：无法正确保存生成的图像结果修改方案：将位置参数改为命名参数nrow=original_images.shape[0] 显式添加format='JPEG'参数确保输出格式明确代码变更示例： # save_image 相关修改 - save_image(original_images, "original.jpg", original_images.shape[0]) + save_image(original_images, "original.jpg", nrow=original_images.shape[0], format='JPEG') 影响范围：正向影响：修复后可以正常进行模型训练和图像保存兼容性：修改后的代码兼容PyTorch 1.7+和torchvision 0.8+版本性能：无负面影响，类型匹配反而可能提升GPU计算效率测试建议：运行训练脚本验证BCEWithLogitsLoss计算是否正常检查生成的图像文件是否符合预期格式和布局在不同PyTorch版本(1.7-2.0)环境下验证兼容性

问题描述： BCEWithLogitsLoss类型错误：使用torch.nn.BCEWithLogitsLoss时，目标张量默认创建为torch.long类型，但该损失函数要求torch.float32 错误信息：RuntimeError: result type Float can't be cast to the desired output type Long 问题影响：导致训练过程中断，无法计算二元交叉熵损失修改方案： BCEWithLogitsLoss类型修复：在所有torch.full()创建目标张量处显式指定dtype=torch.float32 确保目标张量与模型输出类型一致，符合概率计算的数学要求代码变更示例： diff # BCEWithLogitsLoss 相关修改 - d_target_label_cover = torch.full((images_cover.size(0),), self.cover_label) + d_target_label_cover = torch.full((images_cover.size(0),), self.cover_label, dtype=torch.float32) - g_target_label_encoded = torch.full((images_cover.size(0),), self.encoded_label) + g_target_label_encoded = torch.full((images_cover.size(0),), self.encoded_label, dtype=torch.float32) 兼容性：修改后的代码兼容PyTorch 1.7+和torchvision 0.8+版本性能：无负面影响，类型匹配反而可能提升GPU计算效率测试建议：运行训练脚本验证BCEWithLogitsLoss计算是否正常检查生成的图像文件是否符合预期格式和布局在不同PyTorch版本(1.7-2.0)环境下验证兼容性

Dingchunru added 2 commits June 19, 2025 14:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

修复BCEWithLogitsLoss类型不匹配及save_image API兼容性问题#40

修复BCEWithLogitsLoss类型不匹配及save_image API兼容性问题#40
Dingchunru wants to merge 2 commits intoando-khachatryan:masterfrom
Dingchunru:master

Dingchunru commented Jun 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Dingchunru commented Jun 19, 2025

BCEWithLogitsLoss 相关修改

save_image 相关修改

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant