Skip to content
@xlite-dev

xlite-dev

Develop ML/AI toolkits and ML/AI/CUDA Learning resources.

Pinned Loading

  1. LeetCUDA Public

    📚LeetCUDA: 200+ CUDA/Tensor Cores Kernels, HGEMM, FA-2 MMA.

    Cuda 4.8k 519

  2. lite.ai.toolkit Public

    🛠 A lite C++ AI toolkit: 100+🎉 models with MNN, ORT and TRT.

    C++ 4.1k 743

  3. Awesome-LLM-Inference Public

    📚A curated list of Awesome LLM Inference Papers with Codes.

    Python 4.1k 287

  4. torchlm Public

    💎An easy-to-use PyTorch library for face landmarks detection.

    Python 260 24

  5. Awesome-DiT-Inference Public

    📚A curated list of Awesome DiT Inference Papers with Codes.

    261 16

  6. ffpa-attn Public

    📚FFPA: Extend FA-2 with Split-D for large headdim, 2x↑ vs SDPA.

    Cuda 186 8

Repositories

Showing 10 of 25 repositories

Top languages

Loading…

Most used topics

Loading…