-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gpu: nvidia: ip: respect acc_mode for sum post-op #2479
Conversation
0f0897a
to
383b1a2
Compare
383b1a2
to
441a27d
Compare
441a27d
to
7841ac5
Compare
7841ac5
to
d1255da
Compare
I think this PR still needs a review from @oneapi-src/onednn-doc |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Edits suggested, please incorporate as you see fit! Thanks!
d1255da
to
a8d99a7
Compare
Thanks @ranukund , I added the suggested changes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the addressing the feedback. One relatively minor comment and it's good to go.
a8d99a7
to
90190e7
Compare
It looks like the precommit check failures are in files unrelated to this PR. |
Description
Currently the inner product error threshold in benchdnn is set to 0. In some cases for large shapes on nvidia backend there are some precision issues (e.g. the cases reported in MFDNN-12610). This PR adjusts the error threshold so that such cases are not reported as failures.
Fixes MFDNN-12610.