Code Summarization Transformer

Automatic summarization of source code using a neural network, based on the Universal Transformer architecture.

Check out the live demo!

Details

The Transformer architecture for sequence-to-sequence modeling is comprised of an Encoder and a Decoder. The Encoder and Decoder have sets of layers, each of which has a self-attention block and a feed-forward block. The Decoder layers additionally have an encoder-decoder attention block, which attends to the processed input as well as the currently generated output.

The Universal Transformer architecture uses the same encoder layer across the entire Encoder; likewise with the Decoder. This reduces the size of the model, and improves accuracy across many tasks, including those of an algorithmic nature (e.g. interpreting source code).

I implemented this project using TensorFlow.

Training Locally

Download the dataset and create the SentencePiece tokenizers
Run the training script train.py, providing the parameters num_epochs, model_path (path to the model, which contains a transformer_description.json file with necessary attributes), and dataset_path (ordinarily data/leclair_java)

Running Locally

If you trained a model yourself, you can run an interactive demo by running translation_transformer.py with the arguments model_path and dataset_path.

There is a pretrained model in the TFLite format at models/java_summ_ut_4.tflite, which you can try by running server.py. You can also run gunicorn server:code_summarization_server to run it using Gunicorn.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
data		data
models		models
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
__init__.py		__init__.py
beam_search_decode.py		beam_search_decode.py
deploy.py		deploy.py
evaluate_summarization.py		evaluate_summarization.py
heroku.yml		heroku.yml
requirements.txt		requirements.txt
server.py		server.py
train.py		train.py
transformer.py		transformer.py
translation_transformer.py		translation_transformer.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Code Summarization Transformer

Details

Training Locally

Running Locally

About

Uh oh!

Releases

Packages

Uh oh!

Languages

nathanielwarner/code_summarization_transformer

Folders and files

Latest commit

History

Repository files navigation

Code Summarization Transformer

Details

Training Locally

Running Locally

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages