A text summarisation algorithm and interface
To run the code, python main.py
. When you find an article or document to summarize, copy the text, paste it into the white box at the bottom of the interface, and press enter.
Inside TextBox.py
, you can set values associated with the colour scheme and font.
Inside Summarizer.py
are the algorithms for the text summarization.
When using the program, the following operations are available:
- update (button): after changing the coherence or independence params (but not length), press update to recalculate the summary
- less/more (button): use this to toggle between showing only the summary or showing the entire document with the summary highlighted
- length (slider): this changes how many sentences are extracted for the summary. When set to 0, the entire document is shown.
- coherence (slider): this changes the weight of coherence (i.e. how important it is for consecutive sentences in the summary to be related)
- independence (slider): this changes the weight of independence (i.e. how important it is to include sentences in the summary that introduce new content/do not depend on previous sentences for comprehension)
This project is written in Python2.7 with Tkinter for the GUI. It requires the following non-default libraries:
- nltk
- numpy, scipy
- ScrolledText