Skip to content

batch size 能否为1? #19

@JuncFang-git

Description

@JuncFang-git
Image 很棒的工作,感谢开源! 代码中提到assert bsz >= 4 # we need minimal batch=4 to assign different target time, see L132。 但在qwen image edit这样的大模型训练中,bsz=4显存几乎无法支撑。 请问有什么方法能够在bsz=1的情况下也能训练呢?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions