Skip to content

ARN - A lightweight attention structure for speeding up Transformer

Notifications You must be signed in to change notification settings

Kaixin-Wu-for-Open-Source/ARN

Repository files navigation

Introduction

The Implementation of 《Speeding Up Transformer Decoding via an Attention Refinement Network》 [COLING2022,Long Paper](Kaixin Wu, Yue Zhang, Bojie Hu, Tong Zhang)

Data Preparation

Download WMT14 En->De data to ARN/ directory.

sh runs/prepare-wmt14-en2de.sh

Model Training

  • Transformer Baseline
sh runs/run-wmt14-en2de-baseline.sh
  • ARN
sh runs/run-wmt14-en2de-arn.sh

About

ARN - A lightweight attention structure for speeding up Transformer

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published