You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
AFAICT, mRMRe calculates mutual information by estimating from correlation between features, using the formula I(x,y)=−.5 log (1−cor(x,y)^2)* (this is why it expects numerical input), while praznik from a direct MLE formula I(x,y)=p_xy log (p_xy/p_x/p_y) for categorical variables (and this is why it expects factors)... So these are basically different algorithms, or let's say, versions of mRMR using different interface to the data.
EDIT: (*) is 2.8 from here, and it holds only for normal x, y and bivariate normal xy.
cross-ref https://notabug.org/mbq/praznik/issues/2
I don't think we can do anything about this but it is important to know.
Note that the absolute values are not of interest here but only the ranking of the features.
Created on 2019-06-19 by the reprex package (v0.3.0)
In addition, here is a runtime comparison for a dataset with ~ 7k features:
The text was updated successfully, but these errors were encountered: