attention-is-all-you-need

Here is 1 public repository matching this topic...

nevinbaiju / transformer_cpp_ITCS-5182

Optimization of Attention layers for efficient inferencing on the CPU and GPU. It covers optimizations for AVX and CUDA also efficient memory processing techniques.

deep-learning hpc transformers avx avx2 cuda-kernels attention-is-all-you-need cuda-programming openm

Updated Dec 18, 2023
C++

Improve this page

Add a description, image, and links to the attention-is-all-you-need topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the attention-is-all-you-need topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attention-is-all-you-need

Here is 1 public repository matching this topic...

nevinbaiju / transformer_cpp_ITCS-5182

Improve this page

Add this topic to your repo