Skip to content

Implementation of the algorithm detailed in ElKishky, Ahmed, et al. "Scalable topical phrase mining from text corpora."

License

Notifications You must be signed in to change notification settings

anirudyd/topmine

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is an implementation of the algorithm detailed in: El-Kishky, Ahmed, et al. "Scalable topical phrase mining from text corpora." Proceedings of the VLDB Endowment 8.3 (2014): 305-316.APA

In order to run the code, simply follow these steps:

  • Put the file on which you want to run topmine in the folder named “input”
  • Open topmine.py and change the variable named “file_name” to point at the correct file in the “input” folder.
  • Change the “num_topic” variable to the desired value (default value is 4).
  • Other configuration changes can be made by changing the variable values in files “run_phrase_mining.py” and “run_phrase_lda.py”.
  • Run the command “python topmine.py”
  • The results should be available after the execution in the “output” folder.

About

Implementation of the algorithm detailed in ElKishky, Ahmed, et al. "Scalable topical phrase mining from text corpora."

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages