Add quantization support for PagedAttention TPU Pallas kernel. #21152

copybara-service · 2024-05-09T21:46:55Z

Add quantization support for PagedAttention TPU Pallas kernel.

PiperOrigin-RevId: 634914369

copybara-service bot force-pushed the test_623966992 branch 2 times, most recently from dd1ca1f to e568161 Compare May 10, 2024 16:43

copybara-service bot force-pushed the test_623966992 branch 6 times, most recently from d402d34 to b4c1efd Compare May 17, 2024 23:01

Add quantization support for PagedAttention TPU Pallas kernel.

1043e24

PiperOrigin-RevId: 634914369

copybara-service bot force-pushed the test_623966992 branch from b4c1efd to 1043e24 Compare May 17, 2024 23:17

copybara-service bot merged commit 1043e24 into main May 17, 2024
1 of 2 checks passed

copybara-service bot deleted the test_623966992 branch May 17, 2024 23:17

Provide feedback