You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
On the H100 there is a fully fused layer for an entire transformer block called TransformerLayer provided for PyTorch via TransformerEngine. How can I get this layer into TensorRt? Can it be exported via ONNX?
On the H100 there is a fully fused layer for an entire transformer block called TransformerLayer provided for PyTorch via TransformerEngine. How can I get this layer into TensorRt? Can it be exported via ONNX?