Skip to content

Pinned Loading

  1. flashinfer Public

    FlashInfer: Kernel Library for LLM Serving

    Cuda 2.4k 247

  2. whl Public

    Pre-built wheels for flashinfer python package.

    HTML

Repositories

Showing 10 of 10 repositories
  • flashinfer Public

    FlashInfer: Kernel Library for LLM Serving

    Cuda 2,379 Apache-2.0 247 91 9 Updated Mar 14, 2025
  • flashinfer-nightly Public

    FlashInfer Nightly

    6 MIT 1 0 0 Updated Mar 13, 2025
  • flashinfer-ai.github.io Public

    Project website of FlashInfer project

    SCSS 0 4 1 0 Updated Mar 11, 2025
  • whl Public

    Pre-built wheels for flashinfer python package.

    HTML 0 0 0 0 Updated Mar 11, 2025
  • web-data Public
    0 Apache-2.0 0 0 0 Updated Mar 4, 2025
  • Jupyter Notebook 2 0 0 0 Updated Jan 10, 2025
  • debug-print Public

    Debug print operator for cudagraph debugging

    Cuda 10 0 0 0 Updated Aug 2, 2024
  • llvm-project Public Forked from llvm/llvm-project

    The LLVM Project is a collection of modular and reusable compiler and toolchain technologies.

    0 13,187 0 0 Updated Apr 21, 2024
  • candle Public Forked from huggingface/candle

    Minimalist ML framework for Rust

    Rust 0 Apache-2.0 1,075 0 0 Updated Mar 7, 2024
  • tg4perfetto Public Forked from ihavnoid/tg4perfetto

    Simple python library for generating your own perfetto traces for your application. Can be used for both app instrumentation and custom trace generation (for your own purposes)

    Python 0 Apache-2.0 4 0 0 Updated Aug 1, 2022