Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

can not recurrent your experiment results with scripts/run_**.sh #2

Open
shaomai00 opened this issue Mar 13, 2018 · 0 comments
Open

Comments

@shaomai00
Copy link

I'm appreciate with your work, however, I can not recurrent your experiment results in your paper just with data downloaded in The Extreme Classification Repository and bash in scripts/run_**.sh.
For example:
I ran bash scripts/run_bibtex.sh and got this result:
...
Iter: 39 Gamma: 0.037723 Update Time: 13.024 seconds
Train score: 0.9502 0.6221 0.4287
Chunk # 0 1 2 3 Done
Test score w/ mu: 0.5642 0.3372 0.2422
Test score w/o mu: 0.5773 0.3408 0.2456 (0.134 seconds)
Epoch time=54.03
It seems that there is some overfitting.

I also run bash scripts/run_eurlex.sh and got this result:
...
Iter: 149 Gamma: 0.0233309 Update Time: 55.364 seconds
Train score: 0.7312 0.6302 0.5318
Test score w/ mu: 0.5832 0.4496 0.3606 (15.714 seconds)
Epoch time=1090.93
The test score also lower than scores which showed in you paper.
I'm wondering why should this happened? Is there something I forgot?

Just in case,I list my matlab code which I used to generate .mat data file here:
data = {};
[x, y] = read_data('dataset/RCV1-x/rcv1x_train.txt');
data.X_tr = x';
data.Y_tr = y';
[x1, y1] = read_data('dataset/RCV1-x/rcv1x_test.txt');
data.X_ts = x1';
data.Y_ts = y1';
save('dataset/RCV1-x/rcv.mat', 'data');

please help~ thank you very much^^

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant