Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some confusion about use dump.cc to dump a readable model #8

Closed
Lisz1024 opened this issue Aug 22, 2016 · 3 comments
Closed

Some confusion about use dump.cc to dump a readable model #8

Lisz1024 opened this issue Aug 22, 2016 · 3 comments

Comments

@Lisz1024
Copy link

I used the agaricus data to test dump.cc , and have dumped a readable model.
But the model's keys are very large such as 602879701896396800 , the train&test data's feature index only 0 to 120+ .
So I confusing about what key's mean in the readable model? How can I match it with feature'index.

@CNevd
Copy link
Owner

CNevd commented Aug 22, 2016

the original id is reversed here in src/base/localizer.h, try use --max_key=xxx in the command if you know the max key of your data where xxx is a little greater than max key
eg:
max key is 125
use:

../../dmlc-core/tracker/dmlc_local.py -n 1 -s 1 build/difacto.dmlc guide/demo.conf --max_key=126

then it'll keep the original ids as the key

@Lisz1024
Copy link
Author

@CNevd thank you very much.
it can work by set max_key use

../../dmlc-core/tracker/dmlc_local.py -n 1 -s 1 build/difacto.dmlc guide/demo.conf -max_key 126

In addition, I try to write a inverse function for ReverseBytes in dump.cc
It can work even if don't know max_key.

@CNevd
Copy link
Owner

CNevd commented Aug 26, 2016

@Lisz1024 great job!

This issue was closed.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants