Skip to content

Conversation

@copybara-service
Copy link

[ragged-paged-attn] Implement static kv cache quantization. (The scale of kv cache is a scalar float value)

@copybara-service copybara-service bot force-pushed the test_748443665 branch 2 times, most recently from 6f68f4e to c284a2d Compare May 9, 2025 23:23
@copybara-service copybara-service bot force-pushed the test_748443665 branch 11 times, most recently from efd0260 to e642828 Compare May 23, 2025 21:33
…e of kv cache is a scalar float value)

PiperOrigin-RevId: 762576286
@copybara-service copybara-service bot merged commit 966bcb9 into main May 23, 2025
@copybara-service copybara-service bot deleted the test_748443665 branch May 23, 2025 21:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant