Problem
Use of private memory fails for some valid parameter configurations for the mm nvidia version if private opencl address space is used. Used to work in older versions. The performance is slightly worse if local address space is used.
// FIXME: there seems to be a bug in AdjustArraySizesForAllocations
// oclReduceSeq(AddressSpace.Private)(fun((p13, p14) =>
oclReduceSeq(AddressSpace.Local)(fun((p13, p14) =>
Reproduce
Change lines in expression. Then, execute this test. The first parameter configuration will fail if the private version is used.
Output and Performance (RTX 2070)
Local
1. 6.081472 ms
2. 5.368928 ms
Private
1. CODE_GENERATION_ERROR - assertion failed
2. 4.872192 ms
Todo
Find out reason for assertion failed - error.
Problem
Use of private memory fails for some valid parameter configurations for the mm nvidia version if private opencl address space is used. Used to work in older versions. The performance is slightly worse if local address space is used.
Reproduce
Change lines in expression. Then, execute this test. The first parameter configuration will fail if the private version is used.
Output and Performance (RTX 2070)
Local
Private
Todo
Find out reason for
assertion failed- error.