Skip to content

fix: disable compile fast path on H20 runtimes#84

Open
txhno wants to merge 1 commit intoNVIDIA:mainfrom
txhno:fix/h20-disable-compile-cudagraph
Open

fix: disable compile fast path on H20 runtimes#84
txhno wants to merge 1 commit intoNVIDIA:mainfrom
txhno:fix/h20-disable-compile-cudagraph

Conversation

@txhno
Copy link
Copy Markdown

@txhno txhno commented Apr 9, 2026

Summary

  • detect NVIDIA H20 runtimes on older Torch versions before PersonaPlex initializes the generation fast path
  • disable torch.compile and CUDA graphs for that runtime combination to avoid the hard crash path
  • apply the guard in both moshi.server and moshi.offline

Testing

  • python3 -m py_compile moshi/moshi/utils/compile.py moshi/moshi/utils/runtime_compat.py moshi/moshi/server.py moshi/moshi/offline.py

Closes #51

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Floating Point 0 Error With H20

2 participants