Given a series of input text sequences and output summaries, that task is to build, train, and evaluate the model to produce summaries.
The dataset is the open-source amazon food reviews. It is available on kaggle. For convenience, we have also shared the data on drive.
We have code available in Keras and Pytorch. However, the Pytorch code still needs improvement. Our main code is in Keras.
In order to run the code, follow these steps:
- Clone the repository
- Download the data
- Create a free google account (in case you do not have it already)
- Upload data and
attention.py
to your Google Drive (for the pytorch code only data is needed) - Open
summarization.py
(or any of the pytorch files) and go to Google Colab's link on top of the file - Create a copy of the file (go to File, then, Save a Copy on Drive)
- Run the code