Updated reference with torch.compile by nikita-malininn · Pull Request #3381 · openvinotoolkit/nncf

nikita-malininn · 2025-03-26T12:11:45Z

Changes

Added torch.compile for forward & backward in reference implementation.

Reason for changes

Training speed-up from 7 minutes to 5 minutes for 1 epoch of phi3.5 qat-lora tuning

Related tickets

163973

Tests

https://github.com/openvinotoolkit/nncf/actions/runs/14083174698 - passed
Windows precommit run - windows/precommit_torch_cpu/639/ - passed

examples/llm_compression/torch/qat_with_lora example times:

Epoch	Branch	Time
Epoch 0	nm/ref_compile	4m 10s
Epoch 0	develop	4m 30s

tests times:

Test	Branch	Time
tests/torch/quantization/test_strip.py	develop	36.99s
tests/torch/quantization/test_strip.py	nm/ref_compile	49.80s
tests/torch/ptq/test_fq_lora.py	develop	19.51s
tests/torch/ptq/test_fq_lora.py	nm/ref_compile	23.45s

Reopened #3343

nikita-malininn · 2025-03-26T13:17:32Z

https://github.com/openvinotoolkit/nncf/actions/runs/14083174698/job/39440621184#step:8:4268

WARNING:nncf:Could not use torch.compile with reference functions. Falling back on not compiled versions - Reason: Windows not yet supported for torch.compile

alexsu52 · 2025-04-11T08:59:27Z

        orig_shape = grad_output.shape
        grad_output = grad_output.reshape(input_shape)
-
+        # TODO:(nlyalyus) should be implemented via torch extensions, but some optimizations are required: ticket-161670


@ljaljushkin, @nikita-malininn what do you think about covering this TODO in this PR, since it didn't make this into the release and we probably have time to not leave technical debt?

Agree, we have time now.
But, as we discussed, to make a decision about implementing CUDA or Triton kernel for group-wise fake quantize, we first need to perform benchmarking (task 165734). torch.compile could be a good option

nikita-malininn · 2025-04-17T10:08:46Z

@alexsu52, @ljaljushkin, @AlexanderDokuchaev, review, please.

ljaljushkin

No major comments from my side.
only rebase is needed

…mpile

AlexanderDokuchaev · 2025-04-24T20:26:43Z


+torch_executor = ReferenceQuantize(backend_type=ReferenceBackendType.TORCH)
+torch_forward = CompilationWrapper(torch_executor.forward)
+torch_backward = CompilationWrapper(torch_executor.backward)


Wrapped function lose annotation

If write decorator like function, correctly it looks like

nncf/nncf/common/utils/caching.py

Line 61 in dd99ed5

def cache_results(cache: ResultsCache) -> Callable[[TFunc], TFunc]:

(used functools.wrap and TypeVar to keep signature and docstring of function)
It breaks suggestion in editors and check arguments by mypy, but i dont know how do it for class.

Co-authored-by: Alexander Dokuchaev <alexander.dokuchaev@intel.com>

github-actions Bot added the NNCF PT Pull requests that updates NNCF PyTorch label Mar 26, 2025

nikita-malininn requested a review from ljaljushkin March 26, 2025 13:15

nikita-malininn marked this pull request as ready for review March 26, 2025 13:15

nikita-malininn requested a review from a team as a code owner March 26, 2025 13:15

ljaljushkin suggested changes Mar 26, 2025

View reviewed changes

Comment thread nncf/torch/quantization/reference.py Outdated

nikita-malininn requested a review from ljaljushkin March 26, 2025 15:56

nikita-malininn marked this pull request as draft March 27, 2025 07:13

nikita-malininn marked this pull request as ready for review March 27, 2025 16:01

nikita-malininn assigned ljaljushkin Mar 27, 2025

ljaljushkin requested a review from AlexanderDokuchaev March 28, 2025 14:49

ljaljushkin approved these changes Mar 28, 2025

View reviewed changes

ljaljushkin assigned AlexanderDokuchaev and unassigned ljaljushkin Mar 31, 2025

AlexanderDokuchaev requested changes Apr 7, 2025

View reviewed changes

Comment thread nncf/torch/quantization/reference.py Outdated

nikita-malininn requested a review from AlexanderDokuchaev April 9, 2025 06:55

alexsu52 reviewed Apr 11, 2025

View reviewed changes

nikita-malininn marked this pull request as draft April 11, 2025 09:32

nikita-malininn force-pushed the nm/ref_compile branch from 03689f0 to be7e640 Compare April 11, 2025 14:22

nikita-malininn added 11 commits April 11, 2025 16:34

Updated ref wrap

25daf42

Apply comment

7f3f3a5

Added wrapper

088406a

Update approach

e5d15da

Small fixes

2214036

Update quantize_functions.py

a9d6f90

Update

8241082

Remove comment

7dd097c

Replace module

29c55bc

Add test

80952e3

Rollback

be7e640

nikita-malininn added 2 commits April 17, 2025 11:13

Update test

f6dc0b5

Change ref

29981bd

nikita-malininn marked this pull request as ready for review April 17, 2025 10:08

nikita-malininn requested review from alexsu52 and ljaljushkin April 17, 2025 10:08

ljaljushkin suggested changes Apr 17, 2025

View reviewed changes

Comment thread tests/torch/test_utils.py Outdated

Comment thread nncf/torch/quantization/reference.py

nikita-malininn requested a review from ljaljushkin April 17, 2025 15:25

nikita-malininn added 2 commits April 17, 2025 17:37

Update test for windows

3be6181

Apply comment

8b567ed

ljaljushkin suggested changes Apr 23, 2025

View reviewed changes

Merge remote-tracking branch 'openvinotoolkit/develop' into nm/ref_co…

063dacb

…mpile

nikita-malininn requested a review from ljaljushkin April 23, 2025 11:09

ljaljushkin approved these changes Apr 23, 2025

View reviewed changes

AlexanderDokuchaev approved these changes Apr 24, 2025

View reviewed changes

Update nncf/torch/utils.py

1036480

Co-authored-by: Alexander Dokuchaev <alexander.dokuchaev@intel.com>

alexsu52 merged commit 3f7d9e4 into openvinotoolkit:develop Apr 28, 2025
20 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Updated reference with torch.compile#3381

Updated reference with torch.compile#3381
alexsu52 merged 17 commits intoopenvinotoolkit:developfrom
nikita-malininn:nm/ref_compile

nikita-malininn commented Mar 26, 2025 •

edited

Loading

Uh oh!

nikita-malininn commented Mar 26, 2025

Uh oh!

Uh oh!

Uh oh!

alexsu52 Apr 11, 2025

Uh oh!

ljaljushkin Apr 11, 2025

Uh oh!

Uh oh!

nikita-malininn commented Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

ljaljushkin left a comment

Uh oh!

Uh oh!

AlexanderDokuchaev Apr 24, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

nikita-malininn commented Mar 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Reason for changes

Related tickets

Tests

Uh oh!

nikita-malininn commented Mar 26, 2025

Uh oh!

Uh oh!

Uh oh!

alexsu52 Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

ljaljushkin Apr 11, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

nikita-malininn commented Apr 17, 2025

Uh oh!

Uh oh!

Uh oh!

ljaljushkin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

AlexanderDokuchaev Apr 24, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

nikita-malininn commented Mar 26, 2025 •

edited

Loading