Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactors trace operations to be more self-contained, separates front… #486

Merged
merged 1 commit into from
Feb 3, 2025

Conversation

pranavm-nvidia
Copy link
Collaborator

…end/trace tensors more cleanly

  • Refactors Trace operation so that it reports how many outputs it generates instead of requiring the caller to know.

    The trace op is now also responsible for creating its own output trace
    tensors. Additionally, build/build_internal have been removed, meaning
    the trace does not create frontend tensors anymore.

    Frontend tensors no longer create trace tensors directly but instead only
    interface with ops and wrap their outputs as needed.

  • Consolidates and renames some frontend Tensor constructors to better reflect their purpose. For example, create_directly -> fast_init.

  • Temporarily removes the "how to add ops" guide. A new version of this will be written once we have switched to the TRT dialect, which will signficantly affect how ops are added.

…end/trace tensors more cleanly

- Refactors Trace operation so that it reports how many outputs it generates instead
     of requiring the caller to know.

     The trace op is now also responsible for creating its own output trace
     tensors. Additionally, `build`/`build_internal` have been removed, meaning
     the trace does *not* create frontend tensors anymore.

     Frontend tensors no longer create trace tensors directly but instead only
     interface with ops and wrap their outputs as needed.

- Consolidates and renames some frontend Tensor constructors to better reflect
     their purpose. For example, `create_directly` -> `fast_init`.

- Temporarily removes the "how to add ops" guide. A new version of this will be
     written once we have switched to the TRT dialect, which will signficantly affect
     how ops are added.
@pranavm-nvidia pranavm-nvidia merged commit ecb8238 into main Feb 3, 2025
1 check passed
@pranavm-nvidia pranavm-nvidia deleted the trace-ops-refactor branch February 3, 2025 18:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants