📄 DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression

Official repository for DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression
🔧 Enhancing prompt efficiency through dynamic attention-aware compression
📄 Accepted at ACL 2025
🌐 arXiv Paper | 💾 Code on GitHub

✨ Overview

We introduce DAC, a dynamic attention-aware method for task-agnostic prompt compression. DAC jointly models attention weights and dynamic entropy changes to iteratively preserve the most informative tokens, effectively reducing input length and improving LLM efficiency and performance.

📄 Paper Link: https://arxiv.org/abs/2507.11942

🖼️ Framework

🛠️ 2. Environment Setup

This project uses conda to manage dependencies. You can easily reproduce the environment using the provided environment.yml.

# Clone the repository
git clone https://github.com/QQQ-yi/DAC.git
cd DAC

# Create and activate the conda environment
conda create -n dac python=3.10
conda activate dac
pip install -r requirements.txt

▶️ 3. How to Run

🚀 Quick Start

from compressor import PromptCompressor

# Initialize the compressor (supports Hugging Face models)
model_name = "Qwen/Qwen2-0.5B-Instruct"
compressor = PromptCompressor(model_name)

# Long input context (e.g., retrieved documents, conversation history)
context = """
Artificial intelligence is a branch of computer science aimed at creating systems capable of performing tasks that typically require human intelligence...
"""

# Perform compression
result = compressor.compress(
    context=context,
    compress_ratio=0.9,                     # Keep only 10% of tokens (10x compression)
    method="dynamic_attn_ppl",              # Compression method
    fusion="additive",                      # Fusion strategy
    alpha=0.8,                              # Attention weight in additive fusion
    dyn_time=10,                            # Number of dynamic iterations
    preserve_punct=False,                   # Preserve punctuation and special tokens or not
    return_info=True                        # Return detailed info
)

# Output results
print("Compressed text:", result["compressed_text"])
print("Original tokens:", result["original_tokens"])
print("Compressed tokens:", result["compressed_tokens"])
print("Actual compression ratio:", result["actual_ratio"])

📚 4. Citation

@misc{zhao2025dac,
      title   = {DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression}, 
      author  = {Yi Zhao and Zuchao Li and Hai Zhao and Baoyuan Qi and Guoming Liu},
      year    = {2025},
      eprint  = {2507.11942},
      archivePrefix = {arXiv},
      primaryClass  = {cs.CL},
      url     = {https://arxiv.org/abs/2507.11942}
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
__pycache__		__pycache__
fig		fig
LICENSE		LICENSE
README.md		README.md
compressor.py		compressor.py
example.py		example.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📄 DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression

✨ Overview

🖼️ Framework

🛠️ 2. Environment Setup

▶️ 3. How to Run

🚀 Quick Start

📚 4. Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📄 DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression

✨ Overview

🖼️ Framework

🛠️ 2. Environment Setup

▶️ 3. How to Run

🚀 Quick Start

📚 4. Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages