Hydra

This repository contains the replication package of the paper "Do Not Treat Code as Natural Language: Implications for Repository-Level Code Generation and Beyond".

Abstract

Large language models for code (CodeLLMs) have demonstrated remarkable success in standalone code completion and generation, yet their effectiveness diminishes in repository-level settings where cross-file dependencies and structural context are essential. Existing Retrieval-Augmented Generation (RAG) approaches often borrow strategies from NLP, relying on chunking-based indexing and similarity-based retrieval that overlook structural relationships and miss functionally relevant dependencies.

We present Hydra, a repository-level code generation framework that treats code as structured code rather than natural language. Our approach introduces: (i) structure-aware indexing that preserves code structure and dependencies, (ii) a lightweight dependency-aware retriever (DAR) that identifies true dependencies, and (iii) hybrid retrieval combining dependency-aware and similarity-based methods.

Extensive experiments on DevEval and RepoExec benchmarks show that HyDra achieves state-of-the-art performance, surpassing the strongest baseline by over 5% in Pass@1 and enabling smaller models to match larger ones.

Research Questions

RQ1: How effective is Structure-Aware Indexing compared to Chunking-Based Indexing?
RQ2: How do different retrieval approaches affect repository-level code generation performance?
RQ3: How effective is Hydra compared to existing state-of-the-art approaches for repository-level code generation?
RQ4: How does the computational cost of running Hydra compare to state-of-the-art approaches for repository-level code generation?

Quick Start

Prerequisites

Create a new conda environment and install dependencies:

# Create conda environment
conda create -n hydra python=3.10.0
conda activate hydra

# Install required packages
pip install -r requirements.txt

Setup and Installation

Important: You must complete the following setup steps before running any experiments.

Extract benchmark data:

cd data
unzip temp.zip

# Extract RepoExec benchmark
cd ../benchmark/RepoExec
unzip test-apps.zip

# Extract DevEval benchmark
cd ../DevEval
tar -xzf data.tar.gz
wget https://huggingface.co/datasets/LJ0815/DevEval/resolve/main/Source_Code.tar.gz
tar -xvzf Source_Code.tar.gz

Prepare structured context (required for experiments):

# For RepoExec benchmark
bash src/context_formulation/structured_indexer/run.sh --dataset RepoExec

# For DevEval benchmark  
bash src/context_formulation/structured_indexer/run.sh --dataset DevEval

Documentation

Important Before reproducing experiments, you must first train the DAR (Dependency-Aware Retriever) model.

For detailed instructions and comprehensive guides, please refer to:

Training.md - DAR (Dependency-Aware Retriever) training guide including:
- Dataset construction methodology
- Model architecture and training procedures
Reproduce.md - Complete experimental reproduction guide including:
- Benchmark setup and data preparation
- Research questions reproduction (RQ1-RQ4)
- Code generation pipeline
- Evaluation and metrics calculation

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
benchmark		benchmark
data		data
docs		docs
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hydra

Abstract

Research Questions

Quick Start

Prerequisites

Setup and Installation

Documentation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

solis-team/Hydra

Folders and files

Latest commit

History

Repository files navigation

Hydra

Abstract

Research Questions

Quick Start

Prerequisites

Setup and Installation

Documentation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages