fastertransformer speed slower than pytorch #325

lucasjinreal · 2022-09-21T07:33:31Z

I am runing on vit got unexpected result:

FP32 op time :  2464.206495285034 ms
FP32 torch time :  2419.6650743484497 ms

it's even slower than pytorch.....

The text was updated successfully, but these errors were encountered:

byshiue · 2022-09-22T01:36:11Z

It is a possible case because GEMM takes almost all time under FP32.

In such case, small noise of time of GEMM may affect the latency obviously. In your case, the relative difference of latency is about 2%, which may be a noise. For such cases, FT and pytorch should have similar latency.

We don't suggest using FP32 for transformer model because FP16 can bring lots of speedup without accuracy drop.

lucasjinreal · 2022-09-22T03:14:12Z

@byshiue I found I didn't search GEMM info which caused using default gemm. Does searching a best algo will boost time a little bit? Does this gemm info file can cross different PC with same GPU card model?

byshiue · 2022-12-02T14:18:41Z

Sorry for delay reply. Searching best algo may improve the speed. It is case by case.
In general, the gemm info file can used in different devices with same GPU.

bnabis93 mentioned this issue Jul 8, 2023

Benchmark ViT in Faster transformer bnabis93/vision-language-examples#12

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fastertransformer speed slower than pytorch #325

fastertransformer speed slower than pytorch #325

lucasjinreal commented Sep 21, 2022

byshiue commented Sep 22, 2022

lucasjinreal commented Sep 22, 2022

byshiue commented Dec 2, 2022

fastertransformer speed slower than pytorch #325

fastertransformer speed slower than pytorch #325

Comments

lucasjinreal commented Sep 21, 2022

byshiue commented Sep 22, 2022

lucasjinreal commented Sep 22, 2022

byshiue commented Dec 2, 2022