Skip to content

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

License

Notifications You must be signed in to change notification settings

tlc-pack/cutlass_fpA_intB_gemm

About

A standalone GEMM kernel for fp16 activation and quantized weight, extracted from FasterTransformer

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published