Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support non p2p configuration when initializing the comms #4543

Merged
merged 11 commits into from
Jul 31, 2024

Conversation

jnke2016
Copy link
Contributor

@jnke2016 jnke2016 commented Jul 18, 2024

closes #4490

@jnke2016 jnke2016 requested a review from a team as a code owner July 18, 2024 13:02
@ChuckHastings ChuckHastings added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Jul 18, 2024
@@ -52,6 +52,21 @@ def dask_client():
stop_dask_client(dask_client, dask_cluster)


# FIXME: Add tests leveraging this fixture
@pytest.fixture(scope="module")
def dask_client_non_p2p():
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we know if we'd like to run tests that don't create UCX endpoints by hardcoding a different fixture, or would it be better to pass a CLI option, or something else? I think it's okay to add this fixture now if we're going to use it, but I wonder if we should think of an alternative if something like a CLI option to pytest would be better.

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we should eliminate the p2p feature entirely. IIRC, the only reason we required the P2P to be configured was because at the time we initially implemented it was required to properly set up the NCCL communication that the C++ library uses. NCCL has been improved so that the P2P feature is no longer required to make this configuration. Joseph's changes here and his testing have validated that belief.

I'd be inclined to merge this PR and create a new issue to address dropping the P2P configuration option and convert all of the tests to configure with the non-P2P configuration.

@ChuckHastings ChuckHastings added this to the 24.08 milestone Jul 30, 2024
@rlratzel
Copy link
Contributor

/merge

@rapids-bot rapids-bot bot merged commit c941748 into rapidsai:branch-24.08 Jul 31, 2024
131 of 137 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
improvement Improvement / enhancement to an existing function non-breaking Non-breaking change python
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[FEA]: Support Cugraph dask algorithms without creating ucx endpoints.
4 participants