Join GitHub today
GitHub is home to over 36 million developers working together to host and review code, manage projects, and build software together.
Sign upRReliefFix #919
Conversation
This comment has been minimized.
This comment has been minimized.
codecov-io
commented
Dec 11, 2015
Current coverage is
|
kernc
changed the title
RReliefFix
[wip] RReliefFix
Dec 11, 2015
kernc
changed the title
[wip] RReliefFix
RReliefFix
Dec 11, 2015
kernc
assigned
BlazZupan
Dec 11, 2015
kernc
force-pushed the
kernc:rrelieffix
branch
from
9f73aae
to
521fc0a
Dec 11, 2015
kernc
changed the title
RReliefFix
[WIP] RReliefFix
Dec 11, 2015
kernc
changed the title
[WIP] RReliefFix
RReliefFix
Dec 11, 2015
This comment has been minimized.
This comment has been minimized.
|
The same problem persist, nothing has changed (in terms of the ouput). Try xor.tab data set (a and b should be informative) and housing.tab (there, all the scores are still around 0). |
This comment has been minimized.
This comment has been minimized.
|
You need to |
This comment has been minimized.
This comment has been minimized.
|
while the scores for Orange3 with same parameters is too high for attributes other than a, b:
Also, the difference of scores is not large enough, considering that attributes c, d, e are all irrelevant. |
This comment has been minimized.
This comment has been minimized.
|
I checked the differences between this code and my old code, and I can't find anything that would explain the difference in results. My code follows Kononenko's "Strojno učenje" (the white book from 1997) and hence does not use weighting by distance. But removing this from your code (I removed I've now studied my old code enough that I guess I haven't missed any details. It takes care of a few subtleties (if n_iter is larger than data size, every row should be taken as the reference once ... etc), but none of them should play a role here. Could it be that Orange 2 was just lucky? Try increasing the number of reference examples (not neighbours) by one at a time and see how the results change. At a certain number, the scores suddenly multiply by ten (reaching values between 2 and 4 -- is that theoretically possible, given the normalizations?!) and then drop again. With such instability, it's not inconceivable that it's all just luck. Also, if you increase the number of examples in Orange 2 to, say, 50, results are comparable to what the new code gives, whereas in theory a larger number of reference examples should increase robustness. |
kernc
changed the title
RReliefFix
[WIP] RReliefFix
Dec 17, 2015
This comment has been minimized.
This comment has been minimized.
|
Jernej, I was just about to merge this request. Why is it in WIP now? |
This comment has been minimized.
This comment has been minimized.
|
Because it's not ready yet. Please postpone merging until tomorrow.
|
kernc
added some commits
Dec 17, 2015
kernc
force-pushed the
kernc:rrelieffix
branch
from
b8b28d7
to
9fd6257
Dec 17, 2015
kernc
added some commits
Dec 17, 2015
kernc
force-pushed the
kernc:rrelieffix
branch
from
9fd6257
to
b5bdbab
Dec 17, 2015
This comment has been minimized.
This comment has been minimized.
|
Yes, some features in the small XOR example get overvalued. But the order of features by importance matches that from Orange 2 reasonably well for all datasets I tried. Merge at convenience. For the brave journeymen, there is a block comment in the source. |
kernc commentedDec 11, 2015
No description provided.