Skip to content

Conversation

@dudilester
Copy link
Contributor

No description provided.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR introduces dynamic scaling for the V-Cache (value cache) on the hidden dimension, extending the existing dynamic scaling support beyond just the sequence length (T) dimension. The change modifies the value cache scaling mechanism to use a tuple of two scale tensors instead of a single tensor.

Key Changes:

  • Extended value cache scaling to support two dimensions: sequence length and hidden dimension
  • Modified value_scales from a single tensor to a tuple of two tensors (value_scales_on_T, value_scales_on_hidden)
  • Updated all related cache operations and type signatures to handle the new tuple structure

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

Show a summary per file
File Description
vllm_gaudi/v1/worker/hpu_model_runner.py Initializes two separate scale tensors for V-Cache and combines them into a tuple
vllm_gaudi/extension/ops.py Updates unflatten operation to handle tuple structure for v_scales
vllm_gaudi/extension/cache_ops.py Modifies copy_blocks to access first element of v_scales tuple
vllm_gaudi/attention/ops/hpu_paged_attn.py Updates type hints to reflect tuple structure for value scales
vllm_gaudi/attention/backends/hpu_attn.py Updates kv_cache type signature to reflect new tuple structure

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch 4 times, most recently from a38f37c to 348722d Compare December 30, 2025 06:39
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from 316fef7 to 500c8ba Compare December 30, 2025 06:55
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

Signed-off-by: Dudi Lester <[email protected]>
@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from b695d45 to 86ee21a Compare January 1, 2026 08:54
@github-actions
Copy link

github-actions bot commented Jan 1, 2026

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

github-actions bot commented Jan 1, 2026

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

1 similar comment
@github-actions
Copy link

github-actions bot commented Jan 5, 2026

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

@github-actions
Copy link

github-actions bot commented Jan 6, 2026

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

1 similar comment
@github-actions
Copy link

github-actions bot commented Jan 7, 2026

✅ CI Passed

All checks passed successfully against the following vllm commit:
b3a2bdf1ac90748d58bf8c05f8d0095ede5c7eca

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from 2f61c6e to 686dba2 Compare January 12, 2026 15:41
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
aa125ecf0edb9cd67656553d11d643aeb444ff9e

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from 88e44f5 to b02f2cc Compare January 15, 2026 15:08
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
4c1c501a7ee1d5efbad945ea62a702ce5cefb799

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from 1d2ad8e to 9875c13 Compare January 18, 2026 10:47
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
6218034dd7f9a56596e4fd8c8c8fc1d8011ed9c2

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from fc74a67 to 1c95dd4 Compare January 18, 2026 14:52
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from 1b0ca62 to 9f5d89d Compare January 18, 2026 15:31
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from 394011c to 89f051a Compare January 18, 2026 17:54
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
6218034dd7f9a56596e4fd8c8c8fc1d8011ed9c2

@dudilester dudilester force-pushed the dev/dudilester/dynamic_kv_on_h_dim branch from bbc87a0 to 89f051a Compare January 19, 2026 13:21
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

Your branch is behind the base branch. Please merge or rebase to get the latest changes.

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
6218034dd7f9a56596e4fd8c8c8fc1d8011ed9c2

@github-actions
Copy link

✅ CI Passed

All checks passed successfully against the following vllm commit:
6218034dd7f9a56596e4fd8c8c8fc1d8011ed9c2

@adobrzyn adobrzyn merged commit 472ed88 into vllm-project:main Jan 20, 2026
52 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants