Skip to content
View DD-DuDa's full-sized avatar

Block or report DD-DuDa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DD-DuDa/README.md
  • 👋 Hi, I’m @DD-DuDa
  • 👀 I’m interested in Efficient Deep Learning Systems and Hardware Accelerators.
  • 🌱 I’m currently a PHD in the University of Edinburgh.
  • 📫 How to reach me: duda200054@gmail.com

Pinned Loading

  1. BitDecoding Public

    A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.

    C++ 25

  2. BitDistiller Public

    [ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

    Python 108 16

  3. Cute-Learning Public

    Examples of CUDA implementations by Cutlass CuTe

    Makefile 149 17

  4. awesome-vit-quantization-acceleration Public

    List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.

    78 4