Migrate `INCConfig` for HPU #779

yiliu30 · 2026-01-06T07:27:31Z

As part of vllm-project/vllm#31716, the INCConfig in vllm-project/vllm will be used for GPU/CPU. However, we still need a placeholder for the INC path in the plugin.

cc @hshen14 @thuang6 @kzawora-intel @xuechendi

Signed-off-by: yiliu30 <[email protected]>

Copilot

Pull request overview

This PR migrates the HPU-specific INCConfig implementation to maintain compatibility as the main vLLM repository moves INC configuration for GPU/CPU use. A placeholder _FakeINCConfig is introduced to handle the "inc" quantization method in the HPU plugin while delegating actual quantization to unquantized methods.

Key Changes:

Introduces a new _FakeINCConfig class that acts as a stub for Intel Neural Compressor quantization
Monkey-patches vLLM's get_quantization_config function to intercept "inc" requests and return the fake config

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.

File	Description
vllm_gaudi/extension/quant.py	New file defining `_FakeINCConfig` stub that returns unquantized methods for linear and MoE layers
vllm_gaudi/extension/ops.py	Adds monkey-patch function `oot_get_quantization_config` to override vLLM's quantization config retrieval for "inc" method

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

vllm_gaudi/extension/quant.py

vllm_gaudi/extension/ops.py

Signed-off-by: yiliu30 <[email protected]>

github-actions · 2026-01-06T12:20:25Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

github-actions · 2026-01-08T05:12:50Z

✅ CI Passed

All checks passed successfully against the following vllm commit:
0d7667419f738e44ce7f9bf311987e15b01970b0

add fake inc config

c56d8c2

Signed-off-by: yiliu30 <[email protected]>

Copilot AI review requested due to automatic review settings January 6, 2026 07:27

yiliu30 requested review from adobrzyn, afierka-intel, iboiko-habana, kamil-kaczor, ksmusz, kzawora-intel, mgawarkiewicz-intel, michalkuligowski and xuechendi as code owners January 6, 2026 07:27

yiliu30 mentioned this pull request Jan 6, 2026

Consolidate Intel Quantization Toolkit Integration in vLLM vllm-project/vllm#31716

Open

1 task

Copilot AI reviewed Jan 6, 2026

View reviewed changes

vllm_gaudi/extension/quant.py Outdated Show resolved Hide resolved

vllm_gaudi/extension/ops.py Show resolved Hide resolved

vllm_gaudi/extension/ops.py Show resolved Hide resolved

yiliu30 changed the title ~~Migrate HPU INCConfig~~ Migrate INCConfig for HPU Jan 6, 2026

yiliu30 added 2 commits January 6, 2026 07:31

fix

154a1bd

Signed-off-by: yiliu30 <[email protected]>

fix typo

eb8ff54

Signed-off-by: yiliu30 <[email protected]>

github-actions bot mentioned this pull request Jan 6, 2026

🚦 Team Review Dashboard #701

Open

xuechendi self-assigned this Jan 7, 2026

Merge branch 'main' into hpu-inc-config

eba50bc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Migrate `INCConfig` for HPU #779

Migrate `INCConfig` for HPU #779

Uh oh!

yiliu30 commented Jan 6, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 6, 2026

Uh oh!

github-actions bot commented Jan 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Migrate INCConfig for HPU #779

Are you sure you want to change the base?

Migrate INCConfig for HPU #779

Uh oh!

Conversation

yiliu30 commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

github-actions bot commented Jan 6, 2026

✅ CI Passed

Uh oh!

github-actions bot commented Jan 8, 2026

✅ CI Passed

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Migrate `INCConfig` for HPU #779

Migrate `INCConfig` for HPU #779

yiliu30 commented Jan 6, 2026 •

edited

Loading