Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can features be discarded by the classifier? #58

Closed
PKholyavin opened this issue Aug 23, 2020 · 1 comment
Closed

Can features be discarded by the classifier? #58

PKholyavin opened this issue Aug 23, 2020 · 1 comment

Comments

@PKholyavin
Copy link

Hello!
I am using crfsuite to train models on my own datasets, and I am testing different sets of features (I have a lot of them). However, some of those features seem to have no effect on classification results: e.g. first I use set of features A and get an F1 = X, and then I use set A + B and get the same results, and this repeats on every train and test set I have (if it is any help, my data is various acoustic features of speech in two languages). My question is: is this normal, or is there a possibility that some of these features are somehow discarded by the model?
Thank you in advance!

@PKholyavin
Copy link
Author

PKholyavin commented Aug 23, 2020

Update: I have checked attributes_, and these features are indeed not used. What could be the reason?
Update 2: found out I had NaN values in those sets. Apparently if there are any these features are discarded with no warning...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant