Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missing "pmi_ngram.py" file #1

Closed
VincentDENGP opened this issue Sep 17, 2021 · 2 comments
Closed

Missing "pmi_ngram.py" file #1

VincentDENGP opened this issue Sep 17, 2021 · 2 comments

Comments

@VincentDENGP
Copy link

Hi, thanks for sharing the code. I'm trying to run your code but seems missing one file "pmi_ngram.py"? Does this file be supposed to implement by myself?

@happygirlzt
Copy link

Hi, thank you very much for sharing the code. I would appreciate it if you can share this pmi_ngram.py. Thanks. @shizhediao

@shizhediao
Copy link
Owner

shizhediao commented Mar 13, 2022

Hi, I have uploaded the pmi_ngram.py. Hope it helps.
This is an example script for extracting domain-specific words/ngrams and you may find more effective strategies to extract useful ngrams.
In addition, you could tune the hyper-parameters of min_pmi, min_count, ngram_freq_threshold to build a better domain-specific vocabulary list.
Sorry for my late reply!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants