Implemented TextRank #54

ygorg · 2018-10-11T17:11:49Z

textrank.py is the method described in the paper, along with its post-processing. the paper does not describe a candidate selection happening before the TextRank algorithm.
pketextrank.py is an implementation of the core method described in the paper. The candidate selection is the one used in pke to allow comparison between the weighting of the candidates.

textrank.py is the method described in the paper, along with its post-processing. the paper does not describe a candidate selection happening before the TextRank algorithm. pketextrank.py is an implementation of the core method described in the paper. The candidate selection is the one used in PKE to allow comparison between the weighting of the candidates.

Keeping only the comparable version

The candidate generation detailed in the paper is used when `top` parameter is used in candidate_weighting The graph creation is more accurate according to the paper

ygorg · 2018-10-16T07:48:50Z

The warning was added because for some algorithm returning n best candidates according to an other parameter is an approach by itself.
For example in TextRank the parameter for candidate generation is the number of term to keep in the graph (T), but if there is less than n candidate generated, changing the T parameter can be either done by slowly increasing T to increase the number of generated candidate, or always using 1, but the algorithm won't be as described in the TextRank algorithm.

ygorg · 2018-10-16T07:52:15Z

The candidate generation described in the paper is implemented in the candidate_weighting function because if they are generated and weighted in the get_n_best the weighting of the candidate will happen in two function depending on the parameters which does not comply to the use of the package.

ygorg added 6 commits October 11, 2018 19:09

Removed paper implementation

d05511d

Keeping only the comparable version

Renamed "pketextrank.py" to "textrank.py"

27605bb

Checked parameters used in paper

c9807a3

Implementation is more accurate

bb74fc1

The candidate generation detailed in the paper is used when `top` parameter is used in candidate_weighting The graph creation is more accurate according to the paper

Added a warning if less than n best candidates are returned

31f8a42

boudinfl merged commit 129c44f into boudinfl:python3 Oct 17, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented TextRank #54

Implemented TextRank #54

ygorg commented Oct 11, 2018

ygorg commented Oct 16, 2018

ygorg commented Oct 16, 2018

Implemented TextRank #54

Implemented TextRank #54

Conversation

ygorg commented Oct 11, 2018

ygorg commented Oct 16, 2018

ygorg commented Oct 16, 2018