Skip to content

Conversation

@pi314ever
Copy link

@pi314ever pi314ever commented Dec 13, 2025

Add docs for #711 feature. I'm not sure where the best place to put the docs, the placement is flexible.

This docker is reply on upstream PR merge for

Prefill(CUDA) -> Decode(Gaudi): #vllm-project/vllm#30275
Prefill(Gaudi) -> Decode(Cuda): #vllm-project/vllm#30448

Copilot AI review requested due to automatic review settings December 13, 2025 00:53
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Copy link
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

This PR adds documentation for the heterogeneous PD (Prefill-Decode) disaggregation feature, which allows splitting model execution across CUDA prefill nodes and Gaudi decode nodes. The documentation covers setup requirements, installation procedures, service configuration, and verification steps.

Key Changes:

  • Added comprehensive setup guide for CUDA+Gaudi multi-node systems
  • Documented installation of NIXL with UCX support
  • Provided launch configurations for prefill, decode, and proxy services

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

6 similar comments
@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

@github-actions
Copy link

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Signed-off-by: Daniel Huang <[email protected]>
Signed-off-by: Daniel Huang <[email protected]>
Signed-off-by: Daniel Huang <[email protected]>
Signed-off-by: Daniel Huang <[email protected]>
Signed-off-by: Daniel Huang <[email protected]>
Signed-off-by: Daniel Huang <[email protected]>
Signed-off-by: Daniel Huang <[email protected]>
@pi314ever pi314ever force-pushed the ucx-nixl-hetero-docs branch from c5511f6 to 6ff44c9 Compare January 6, 2026 00:17
@github-actions
Copy link

github-actions bot commented Jan 6, 2026

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

@github-actions
Copy link

github-actions bot commented Jan 6, 2026

🚧 CI Blocked

The main CI workflow was not started for the following reason:

This is a Draft PR. Please mark it as 'Ready for Review' to trigger the CI.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant