Skip to content

Puzzletron OOM during step 6 (one-block scoring) for Qwen 3.5-2B VLM #1779

Description

@danielkorzekwa

branch: dkorzekwa/claude_qwen35

Environment: 2× H100 80GB, Puzzletron full pipeline, nproc_per_node=2

Model: Qwen3.5-2B (model_type: qwen3_5, a VLM with nested text_config)

Failure: torch.OutOfMemoryError on GPU 1 during step 6 ("calculating one block scores"), with ~74 GiB consumed out of 80 GiB.

Metadata

Metadata

Labels

bugSomething isn't working

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions