branch: dkorzekwa/claude_qwen35
Environment: 2× H100 80GB, Puzzletron full pipeline, nproc_per_node=2
Model: Qwen3.5-2B (model_type: qwen3_5, a VLM with nested text_config)
Failure: torch.OutOfMemoryError on GPU 1 during step 6 ("calculating one block scores"), with ~74 GiB consumed out of 80 GiB.
branch: dkorzekwa/claude_qwen35
Environment: 2× H100 80GB, Puzzletron full pipeline, nproc_per_node=2
Model: Qwen3.5-2B (model_type: qwen3_5, a VLM with nested text_config)
Failure: torch.OutOfMemoryError on GPU 1 during step 6 ("calculating one block scores"), with ~74 GiB consumed out of 80 GiB.