Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Re-introduce cuda streams #550

Closed
wants to merge 5 commits into from
Closed

Re-introduce cuda streams #550

wants to merge 5 commits into from

Conversation

ngc92
Copy link
Contributor

@ngc92 ngc92 commented Jun 4, 2024

For now, this "just" adds a single stream, and moves as many operations there as possible
(everything except device->host memcpy, if I haven't missed anything)
to ensure that "local reasoning" is enough to understand any stream parallism and host-device asynchronicity, for now, every "main" function ends with a full device synchronization.
We may want to lift that restriction in the future, but we should be very careful about that.

@ngc92
Copy link
Contributor Author

ngc92 commented Jun 7, 2024

closing this in favour of #552

@ngc92 ngc92 closed this Jun 7, 2024
@ngc92 ngc92 deleted the stream-2 branch July 11, 2024 12:42
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant