Skip to content

AWQ Support for ONNX Backend #3571

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 5 commits into
base: develop
Choose a base branch
from

Conversation

andrey-churkin
Copy link
Contributor

@andrey-churkin andrey-churkin commented Jul 2, 2025

Changes

AWQ Support for ONNX Backend

Reason for changes

Ref: 168332

Related tickets

Ref: 168332

Tests

  • tinyllama_data_free_awq_backend_ONNX

@andrey-churkin andrey-churkin requested a review from a team as a code owner July 2, 2025 12:25
@github-actions github-actions bot added the NNCF PTQ Pull requests that updates NNCF PTQ label Jul 2, 2025
@andrey-churkin
Copy link
Contributor Author

@andrey-churkin
Copy link
Contributor Author

@openvinotoolkit/nncf-maintainers Please review

Copy link
Contributor

@ljaljushkin ljaljushkin left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Comment on lines 118 to +124
metric_value: 0.85466
num_int4: 94
num_int8: 124
tinyllama_data_free_awq_backend_ONNX:
metric_value: 0.82562
num_int4: 264
num_int8: 84
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd also double check why similarity and number of int4 and int8 are different for the same compression configuration

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
NNCF PTQ Pull requests that updates NNCF PTQ
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants