Skip to content
View DD-DuDa's full-sized avatar

Block or report DD-DuDa

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DD-DuDa/README.md
  • 👋 Hi, I’m @DD-DuDa
  • 👀 I’m interested in Efficient Deep Learning Systems and Hardware Accelerators.
  • 🌱 I’m currently a PHD in the University of Edinburgh.
  • 📫 How to reach me: [email protected]

Pinned Loading

  1. BitDecoding BitDecoding Public

    A GPU-optimized system for efficient long-context LLMs decoding with low-bit KV cache.

    C++ 32

  2. BitDistiller BitDistiller Public

    [ACL 2024] A novel QAT with Self-Distillation framework to enhance ultra low-bit LLMs.

    Python 108 17

  3. Cute-Learning Cute-Learning Public

    Examples of CUDA implementations by Cutlass CuTe

    Makefile 153 18

  4. awesome-vit-quantization-acceleration awesome-vit-quantization-acceleration Public

    List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.

    78 4