Does anyone have FP8 vs. FP16 benchmark results on H100? #7385
Unanswered
ajtejankar
asked this question in
Q&A
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I am benchmarking FP8 vs. FP16 on H100 and I don't see much of an improvement. The repro script (uses vllm's benchmarking code) and results are below. I am worried that I may be doing something weird so just want to confirm.
The results are:
Beta Was this translation helpful? Give feedback.
All reactions