fix multiGPU #15

ROGERDJQ · 2022-05-19T10:13:42Z

Here may include a issue when multiGPUs are used.
Since the default self.batch_size=8 at L452, when multiGPUs are used and data shape is more than 8, I found the domain_batched at line452 has actually fewer dimensions than it should be, which leads to the zip error at line 460. (suppose for 3 GPU with train batch size=4, with all domains, line 452 only returns one element, while the other three at line 456 returns 2 elements: one with shape (8,), the other with shape (4,) ) It is noticed that domain_batched is not used afterward. So may be a straightforward way is to delete it.

songfeng · 2022-05-23T15:01:09Z

@sivasankalpp could you take a look?

fix multiGPU

5cdf87a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix multiGPU #15

fix multiGPU #15

Uh oh!

ROGERDJQ commented May 19, 2022

Uh oh!

songfeng commented May 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix multiGPU #15

Are you sure you want to change the base?

fix multiGPU #15

Uh oh!

Conversation

ROGERDJQ commented May 19, 2022

Uh oh!

songfeng commented May 23, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants