Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option to evenly split labels in classifier #44

Closed
smassung opened this issue Nov 16, 2014 · 1 comment
Closed

Option to evenly split labels in classifier #44

smassung opened this issue Nov 16, 2014 · 1 comment

Comments

@smassung
Copy link
Member

Also in a config option, something under [classifier], like even-split = true (false by default). Finds the label with the lowest number of documents and randomly truncates the rest to be that amount.

It should be split during classifier runtime (still index the whole corpus).

@smassung
Copy link
Member Author

fixed in 3aa8128

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant