Add vLLM NIXL PD smoke compatibility by AAbouzeid · Pull Request #313 · ovg-project/kvcached

AAbouzeid · 2026-04-21T23:24:54Z

Summary

add an eager vLLM NixlConnector compatibility patch for kvcached PD disaggregation
align NIXL registration with kvcached KV tensor block counts and keep kvcached NIXL on the NHD path
add a hard guard for KVCACHED_CONTIGUOUS_LAYOUT=true with NIXL to avoid silent KV corruption
add a GPU smoke script that compares plain vLLM+NIXL against kvcached+NIXL

Validation

python -m pytest tests/test_vllm_nixl_compat.py -q
bash -n tools/run_vllm_nixl_pd_smoke.sh
pod smoke: INSTALL_VLLM=0 ./tools/run_vllm_nixl_pd_smoke.sh
pod semantic smoke prompts: arithmetic retrieval, ticket lookup, checksum extraction; each passed with and without kvcached

cui36 · 2026-04-25T01:52:01Z

/gemini review

gemini-code-assist

Code Review

This pull request implements monkey-patches for vLLM's NixlConnector to ensure compatibility with kvcached during PD disaggregation. The changes force an NHD layout and synchronize the physical block count in NixlConnectorWorker with kvcached's internal allocation. A review comment identifies that the current block count synchronization only handles over-allocation and suggests updating the logic to handle any mismatch to avoid assertion failures.

AAbouzeid · 2026-05-05T20:55:56Z

/gemini review

gemini-code-assist

Code Review

This pull request introduces patches for vLLM's NixlConnector to ensure compatibility with kvcached PD disaggregation. It addresses layout incompatibilities by forcing NHD and resolves block count assertion failures by dynamically updating the connector's block count using a new tracking variable. Review feedback suggests that using a module-level global variable for this state is brittle and may cause issues in multi-engine or multi-tenant scenarios, recommending a more explicit configuration management approach instead.

Patch vLLM's NixlConnector path for kvcached by using the NHD layout, reconciling registered KV block counts with kvcached tensors, and rejecting kvcached's contiguous layout for NIXL because vLLM registers per-layer K/V regions as block-contiguous memory. Add focused unit coverage and an end-to-end smoke script that compares plain vLLM+NIXL against kvcached+NIXL.

… attrs

gemini-code-assist Bot reviewed Apr 25, 2026

View reviewed changes

Comment thread kvcached/integration/vllm/autopatch.py Outdated

cui36 mentioned this pull request May 5, 2026

Roadmap/Feature requested/TODOs---Start Here for New Contributors #273

Open

28 tasks

gemini-code-assist Bot reviewed May 5, 2026

View reviewed changes

Comment thread kvcached/integration/vllm/interfaces.py Outdated

cui36 mentioned this pull request May 9, 2026

[Do Not Merge] P2pNcclConnector PD disaggregation #327

Open

cui36 requested a review from qinganrice May 22, 2026 21:43

qinganrice reviewed May 23, 2026

View reviewed changes

Comment thread kvcached/integration/vllm/autopatch.py Outdated

Comment thread kvcached/integration/vllm/autopatch.py Outdated

AAbouzeid force-pushed the fix/pd-disagg-nixl-connector-minimal branch from 53ba7ba to eae4a9f Compare May 24, 2026 19:53

AAbouzeid changed the title ~~Patch NixlConnector for kvcached PD disaggregation (closes #302)~~ Add vLLM NIXL PD smoke compatibility May 24, 2026

fix: harden NixlConnector kvcached patch for vLLM version and backend…

c5845ba

… attrs

cui36 merged commit 74887c9 into ovg-project:main Jun 1, 2026
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vLLM NIXL PD smoke compatibility#313

Add vLLM NIXL PD smoke compatibility#313
cui36 merged 2 commits into
ovg-project:mainfrom
AAbouzeid:fix/pd-disagg-nixl-connector-minimal

AAbouzeid commented Apr 21, 2026 •

edited

Loading

Uh oh!

cui36 commented Apr 25, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

AAbouzeid commented May 5, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

AAbouzeid commented Apr 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Uh oh!

cui36 commented Apr 25, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

AAbouzeid commented May 5, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

AAbouzeid commented Apr 21, 2026 •

edited

Loading