lightning-indexer

Here is 1 public repository matching this topic...

RightNow-AI / StreamIndex

Memory-bounded compressed sparse attention via streaming top-k. Triton kernels for the DeepSeek-V4 lightning indexer. 32x regime extension on a single H200 | by RightNow https://www.rightnowai.co/

cuda triton attention sparse-attention long-context deepseek deepseek-v4 compressed-sparse-attention lightning-indexer

Updated May 5, 2026
Python

Improve this page

Add a description, image, and links to the lightning-indexer topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the lightning-indexer topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lightning-indexer

Here is 1 public repository matching this topic...

RightNow-AI / StreamIndex

Improve this page

Add this topic to your repo