Skip to content
This repository was archived by the owner on Jun 6, 2021. It is now read-only.

adamchristiansen/text-summarizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

An extractive text summarizer

This is a project for the University of Lethbridge graduate level CPSC 5310 course, Studies in Computation Intelligence: Advanced Data Processing. The strategies implemented are based on the work of Gong and Liu.

Yihong Gong and Xin Liu. Generic text summarization using relevance measure and latent semantic analysis. In Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pages 19–25. ACM, 2001.

Running

The simple way to run the program is to open a shell and run:

$ python3 run.py

which is equivalent to:

$ python3 runp.py --prep --summarize --eval

All of the generated data can be removed with:

$ python3 run.py --clean

Dependencies

  • Python 3
    • numpy
    • matplotlib

Programs

The following programs are included in the root directory:

  • eval.py: evaluate summarization results.
  • lsa.py: perform a summarization using latent semantic analysis.
  • prep.py: preprocess the corpus into usable data.
  • rel.py: perform a summarization using the relevance measure.
  • run.py: run all of the other scripts in the correct order.

About

An extractive text summarizer

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages