v0.20.0 #5351
Pinned
nv-guomingz
announced in
Announcements
v0.20.0
#5351
Replies: 2 comments 6 replies
-
Hi All, Can we deploy llms in production using tensorrt without additional licenses? tensorrt-llm has tensorrt as a dependency. Kindly clarify |
Beta Was this translation helpful? Give feedback.
5 replies
-
TnesorRT-LLM can be used for production deployment without additional license constraints and it has already been used by lots of production customers. June |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
TensorRT-LLM Release 0.20.0
Key Features and Enhancements
examples/models/core/qwen/README.md
.examples/models/contrib/hyperclovax/README.md
examples/scaffolding/contrib/Dynasor/README.md
Infrastructure Changes
API Changes
Fixed Issues
Known Issues
What's Changed
enable_overlap_scheduler
by @kaiyux in fix: wrong argument nameenable_overlap_scheduler
#4433Llama-3_3-Nemotron-Super-49B-v1
integration-perf-tests (TRT flow, trtllm-bench) by @venkywonka in test(perf): Add someLlama-3_3-Nemotron-Super-49B-v1
integration-perf-tests (TRT flow, trtllm-bench) #4128Phi-4-mini-instruct
perf tests (test(perf): Add remainingPhi-4-mini-instruct
perf tests #4443) by @venkywonka in [cherry-pick] test(perf): Add remainingPhi-4-mini-instruct
perf tests (#4443) #4589Llama-3_1-Nemotron-Ultra-253B-v1
perf tests (cpp) #4446) by @venkywonka in [cherry-pick] test(perf): Add Llama-3_1-Nemotron-Ultra-253B-v1 perf tests (cpp) (#4446) #4590Llama-3_3-Nemotron-Super-49B-v1
integration-perf-tests (cpp) #4499) by @venkywonka in [cherry-pick] test(perf): Pt.2 Add Llama-3_3-Nemotron-Super-49B-v1 integration-perf-tests (cpp) (#4499) #4588New Contributors
Full Changelog: v0.20.0rc3...v0.20.0
This discussion was created from the release v0.20.0.
Beta Was this translation helpful? Give feedback.
All reactions