Add rmsnorm kernel for T5 #89
Labels
benchmark
Measure, measure, measure
performance
make things faster, always
triton
triton kernel / implementation
Milestone
T5 uses
rmsnorm
instead oflayernorm
.They are both very similar, just there is no centering with the mean.
We may implement both normalization in the same kernel.
Paper: https://arxiv.org/pdf/1910.07467.pdf
HF implementation:
https://github.com/huggingface/transformers/blob/main/src/transformers/models/t5/modeling_t5.py#L239
Perf discussion:
pytorch/pytorch#72643
huggingface/transformers#14656
The text was updated successfully, but these errors were encountered: