https://github.com/NVIDIA/TransformerEngine/pull/2731
NVIDIA/TransformerEngine#2731