Skip to content

Use of opencl local address space instead of private in nvidia mm version #222

@johanneslenfers

Description

@johanneslenfers

Problem

Use of private memory fails for some valid parameter configurations for the mm nvidia version if private opencl address space is used. Used to work in older versions. The performance is slightly worse if local address space is used.

                // FIXME: there seems to be a bug in AdjustArraySizesForAllocations
//                oclReduceSeq(AddressSpace.Private)(fun((p13, p14) =>
                  oclReduceSeq(AddressSpace.Local)(fun((p13, p14) =>

Reproduce

Change lines in expression. Then, execute this test. The first parameter configuration will fail if the private version is used.

Output and Performance (RTX 2070)

Local

 1. 6.081472 ms
 2. 5.368928 ms

Private

1. CODE_GENERATION_ERROR - assertion failed
2. 4.872192 ms

Todo

Find out reason for assertion failed - error.

Metadata

Metadata

Assignees

No one assigned

    Labels

    bugSomething isn't workingprio:low

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions