Skip to content
@HKUNLP

HKU NLP Group

Pinned Loading

  1. efficient-attention Public

    [EVA ICLR'23; LARA ICML'22] Efficient attention mechanisms via control variates, random features, and importance sampling

    Python 82 4

  2. RSA Public

    Forked from chang-github-00/RSA

    Retrieved Sequence Augmentation for Protein Representation Learning

    Python 50 3

  3. icl-ceil Public

    [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.

    Python 98 11

  4. reparam-discrete-diffusion Public

    Reparameterized Discrete Diffusion Models for Text Generation

    Python 96 3

  5. ChunkLlama Public

    [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

    Python 394 19

  6. diffusion-of-thoughts Public

    [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

    Python 136 9

Repositories

Showing 10 of 24 repositories
  • hkunlp.github.io Public

    Website for HKU NLP group (under construction)

    JavaScript 10 MIT 6 0 0 Updated Mar 20, 2025
  • DiffuLLaMA Public

    [ICLR2025] DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models

    Python 118 9 5 0 Updated Mar 18, 2025
  • diffusion-of-thoughts Public

    [NeurIPS 2024] Code for the paper "Diffusion of Thoughts: Chain-of-Thought Reasoning in Diffusion Language Models"

    Python 136 9 1 0 Updated Mar 4, 2025
  • DiffuSearch Public

    [ICLR 2025] Code for the paper "Implicit Search via Discrete Diffusion: A Study on Chess"

    Python 21 Apache-2.0 1 0 0 Updated Mar 3, 2025
  • critic-rl Public

    Code for Paper: Teaching Language Models to Critique via Reinforcement Learning

    Python 84 Apache-2.0 4 0 0 Updated Feb 17, 2025
  • diffusion-vs-ar Public

    [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"

    Python 45 Apache-2.0 3 0 0 Updated Feb 14, 2025
  • STRING Public

    [ICLR'25] Data and code for our paper "Why Does the Effective Context Length of LLMs Fall Short?"

    Python 70 MIT 3 2 1 Updated Nov 25, 2024
  • DensePolicy Public Forked from zhao-ht/DensePolicy
    SAS 0 2 0 0 Updated Oct 22, 2024
  • ChunkLlama Public

    [ICML'24] Data and code for our paper "Training-Free Long-Context Scaling of Large Language Models"

    Python 394 Apache-2.0 19 10 1 Updated Oct 16, 2024
  • GSM-Plus Public Forked from qtli/GSM-Plus

    GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.

    Python 2 6 0 0 Updated Jul 8, 2024

Most used topics

Loading…