Skip to content
@HKUNLP

HKU NLP Group

Pinned Loading

  1. efficient-attention Public

    [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling

    Python 82 3

  2. RSA Public

    Forked from chang-github-00/RSA

    Retrieved Sequence Augmentation for Protein Representation Learning

    Python 51 3

  3. reparam-discrete-diffusion Public

    Reparameterized Discrete Diffusion Models for Text Generation

    Python 97 3

  4. ChunkLlama Public

    [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

    Python 402 20

  5. diffusion-of-thoughts Public

    [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

    Python 142 10

  6. Dream Public

    Dream 7B, a large diffusion language model

    Python 522 20

Repositories

Showing 10 of 25 repositories
  • Dream Public

    Dream 7B, a large diffusion language model

    Python 522 20 14 2 Updated Apr 11, 2025
  • critic-rl Public

    Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

    Python 90 Apache-2.0 6 0 0 Updated Apr 10, 2025
  • hkunlp.github.io Public

    Website for HKU NLP group (under construction)

    JavaScript 13 MIT 8 0 0 Updated Apr 3, 2025
  • DiffuLLaMA Public

    [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

    Python 148 11 4 0 Updated Mar 18, 2025
  • diffusion-of-thoughts Public

    [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

    Python 142 10 1 0 Updated Mar 4, 2025
  • DiffuSearch Public

    [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"

    Python 24 Apache-2.0 1 0 0 Updated Mar 3, 2025
  • diffusion-vs-ar Public

    [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"

    Python 47 Apache-2.0 3 0 0 Updated Feb 14, 2025
  • STRING Public

    [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

    Python 72 MIT 3 3 1 Updated Nov 25, 2024
  • DensePolicy Public Forked from zhao-ht/DensePolicy
    SAS 0 2 0 0 Updated Oct 22, 2024
  • ChunkLlama Public

    [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

    Python 402 Apache-2.0 20 11 1 Updated Oct 16, 2024

Top languages

Loading…

Most used topics

Loading…