Pointer_Transformer_Generator tensorflow 2.0.0

For the abstractive summarization task, I wanted to experiment the transformer model. I recreated a transformer model (thanks to tensorflow transformer tutorial) and added a pointer module (have a look at this paper for more informations on the pointer generator network : https://arxiv.org/abs/1704.04368 ).

PS : I will add very soon a section explaining the integration of the pointer module in the transformer

Please follow the next steps to launch the project :

Step 1 : The data

Option 1 : Download the data

Download the data (chunk files format : tfrecords) https://drive.google.com/open?id=1uHrMWd7Pbs_-DCl0eeMxePbxgmSce5LO

Option 2 : Download raw data and process it

Use this project : https://github.com/steph1793/CNN-DailyMail-Bin-To-TFRecords

Step 2 : launch the project :

python main.py --max_enc_len=400 \
--max_dec_len=100 \
--batch_size=16 \
--vocab_size=50000 \
--num_layers=3 \
--model_depth=512 \
--num_heads=8 \
--dff=2048 \
--seed=123 \
--log_step_count_steps=1 \
--max_steps=230000 \
--mode=train \
--save_summary_steps=10000 \
--checkpoints_save_steps=10000 \
--model_dir=model_folder \
--data_dir=data_folder \
--vocab_path=vocab \

PS : Feel free to change some of the hyperparameters
python main.py --help , for more details on the hyperparameters

Requirements

python >= 3.6
tensorflow 2.0.0
argparse
os
glob
numpy

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
LICENSE.md		LICENSE.md
README.md		README.md
batcher.py		batcher.py
build_eval_test.py		build_eval_test.py
data_helper.py		data_helper.py
layers.py		layers.py
main.py		main.py
predict_helper.py		predict_helper.py
training_helper.py		training_helper.py
transformer.py		transformer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Pointer_Transformer_Generator tensorflow 2.0.0

Step 1 : The data

Option 1 : Download the data

Option 2 : Download raw data and process it

Step 2 : launch the project :

Requirements

About

Releases

Packages

Languages

License

totallyahmed/PGen

Folders and files

Latest commit

History

Repository files navigation

Pointer_Transformer_Generator tensorflow 2.0.0

Step 1 : The data

Option 1 : Download the data

Option 2 : Download raw data and process it

Step 2 : launch the project :

Requirements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages