Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

gpu: nvidia: ip: respect acc_mode for sum post-op #2479

Merged
merged 1 commit into from
Feb 25, 2025

Conversation

sgeor255
Copy link
Contributor

Description

Currently the inner product error threshold in benchdnn is set to 0. In some cases for large shapes on nvidia backend there are some precision issues (e.g. the cases reported in MFDNN-12610). This PR adjusts the error threshold so that such cases are not reported as failures.

Fixes MFDNN-12610.

@sgeor255 sgeor255 requested a review from a team as a code owner January 22, 2025 10:46
@github-actions github-actions bot added the component:tests Codeowner: @oneapi-src/onednn-arch label Jan 22, 2025
@sgeor255 sgeor255 force-pushed the svet/nvdia-ip-precision branch from 0f0897a to 383b1a2 Compare January 22, 2025 11:59
@sgeor255 sgeor255 force-pushed the svet/nvdia-ip-precision branch from 383b1a2 to 441a27d Compare January 31, 2025 16:37
@sgeor255 sgeor255 requested review from a team as code owners January 31, 2025 16:37
@github-actions github-actions bot added documentation A request to change/fix/improve the documentation. Codeowner: @oneapi-src/onednn-doc platform:gpu-nvidia Codeowner: @oneapi-src/onednn-gpu-nvidia labels Jan 31, 2025
@sgeor255 sgeor255 force-pushed the svet/nvdia-ip-precision branch from 441a27d to 7841ac5 Compare February 4, 2025 15:04
@sgeor255 sgeor255 force-pushed the svet/nvdia-ip-precision branch from 7841ac5 to d1255da Compare February 6, 2025 08:37
@sgeor255
Copy link
Contributor Author

sgeor255 commented Feb 6, 2025

I think this PR still needs a review from @oneapi-src/onednn-doc

Copy link
Contributor

@ranukund ranukund left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Edits suggested, please incorporate as you see fit! Thanks!

@sgeor255 sgeor255 force-pushed the svet/nvdia-ip-precision branch from d1255da to a8d99a7 Compare February 6, 2025 15:37
@sgeor255
Copy link
Contributor Author

sgeor255 commented Feb 6, 2025

Thanks @ranukund , I added the suggested changes.

@sgeor255 sgeor255 requested a review from ranukund February 6, 2025 15:39
Copy link
Contributor

@ranukund ranukund left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you!

@sgeor255 sgeor255 changed the title gpu: nvidia: ip: adjust benchdnn error threshold gpu: nvidia: ip: respect fp_math_mode for sum post-op Feb 19, 2025
@sgeor255 sgeor255 changed the title gpu: nvidia: ip: respect fp_math_mode for sum post-op gpu: nvidia: ip: respect acc_mode for sum post-op Feb 19, 2025
Copy link
Contributor

@dzarukin dzarukin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the addressing the feedback. One relatively minor comment and it's good to go.

@sgeor255 sgeor255 force-pushed the svet/nvdia-ip-precision branch from a8d99a7 to 90190e7 Compare February 24, 2025 12:52
@sgeor255 sgeor255 requested a review from dzarukin February 24, 2025 12:53
@sgeor255
Copy link
Contributor Author

It looks like the precommit check failures are in files unrelated to this PR.

@sgeor255 sgeor255 merged commit bbf8399 into uxlfoundation:main Feb 25, 2025
21 of 22 checks passed
manaalmj pushed a commit to manaalmj/oneDNN that referenced this pull request Mar 4, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component:tests Codeowner: @oneapi-src/onednn-arch documentation A request to change/fix/improve the documentation. Codeowner: @oneapi-src/onednn-doc platform:gpu-nvidia Codeowner: @oneapi-src/onednn-gpu-nvidia
Projects
None yet
Development

Successfully merging this pull request may close these issues.

7 participants