RAKE

A Python implementation of the Rapid Automatic Keyword Extraction (RAKE) algorithm as described in: Rose, S., Engel, D., Cramer, N., & Cowley, W. (2010). Automatic Keyword Extraction from Individual Documents.

This is forked from aneesha/RAKE to support more features :

support python 3.0
support command line call
rearrange the codes to increase the readability

Usage

You can easily run it in cmd.

python rake.py [-h] [-o OUTPUTFILEPATH] [-m MINPHRASECHAR] [-a MAXPHRASELENGTH] inputFilePath stopwordsFilePath

positional arguments:

inputFilePath The file path of input document(s). One line represents a document.

stopwordsFilePath The file path of stopwords, each line represents a word.

optional arguments:

-h, --help show this help message and exit

-o OUTPUTFILEPATH, --outputFilePath OUTPUTFILEPATH The file path of output. (default output.txt in current dir).

-m MINPHRASECHAR, --minPhraseChar MINPHRASECHAR The minimum number of characters of a phrase.(default 1)

-a MAXPHRASELENGTH, --maxPhraseLength MAXPHRASELENGTH The maximum length of a phrase.(default 3)

Example

python rake.py example/input.txt example/stopwords.txt -o example/output.txt

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

RAKE

Usage

Example

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

RAKE

Usage

Example

License