Example of FP32 -> BF16 conversion in epilogue of GEMM #506

sanchitintel · 2025-09-12T00:42:04Z

Counterpart of examples/00_bmg_gemm/00_bmg_gemm.cpp with BF16 output instead of FP32.
A is BF16, B is BF16, C is FP32, and D is BF16.

Barring a few lines (some of which have been adapted/copy-pasted from https://github.com/intel/cutlass-sycl/blob/e83f147263dd8ca3589b34d76ce6fbec58bac048/test/unit/gemm/device/default_gemm_group_configuration.hpp), the code is almost as same as examples/00_bmg_gemm/00_bmg_gemm.cpp, so ideally, both files' code should be combined. Please use a diff tool such as BeyondCompare to see the difference in both files.

This code isn't adding any new functionality, but is merely an example.

Add this example to the existing example file

Thanks!

Example of dtype conversion in GEMM epilogue

198c45b

sanchitintel mentioned this pull request Sep 12, 2025

Support fp32 accumulation for bf16 gemm and grouped gemm #482

Open

sanchitintel changed the title ~~Example of simple GEMM dtype conversion in GEMM epilogue~~ Example of dtype conversion in epilogue of GEMM Sep 12, 2025

sanchitintel changed the title ~~Example of dtype conversion in epilogue of GEMM~~ Example of FP32 -> BF16 conversion in epilogue of GEMM Sep 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Example of FP32 -> BF16 conversion in epilogue of GEMM #506

Example of FP32 -> BF16 conversion in epilogue of GEMM #506

Uh oh!

sanchitintel commented Sep 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Example of FP32 -> BF16 conversion in epilogue of GEMM #506

Are you sure you want to change the base?

Example of FP32 -> BF16 conversion in epilogue of GEMM #506

Uh oh!

Conversation

sanchitintel commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

sanchitintel commented Sep 12, 2025 •

edited

Loading