You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The profilers seem not taking into account negative correlations to remove redundant features. Despite pandas-profiling does a good job detecting positive correlations on the dataset, negative correlations are not being detected. I do believe the reason lies the condition to remove a given variable assumes that the correlation is above a given positive threshold value.
To Reproduce
importnumpyasnpimportpandasaspdimportpandas_profiling# Let's generate a dataset where the first and second variables are negative correlated and the third variable is random.X=np.array([np.arange(100), np.arange(100)[::-1], np.random.randn(100)]).Tdf=pd.DataFrame(X, columns=["1", "2", "3"])
profile=df.profile_report()
profile.get_rejected_variables() # []
The text was updated successfully, but these errors were encountered:
Describe the bug
The profilers seem not taking into account negative correlations to remove redundant features. Despite pandas-profiling does a good job detecting positive correlations on the dataset, negative correlations are not being detected. I do believe the reason lies the condition to remove a given variable assumes that the correlation is above a given positive threshold value.
To Reproduce
The text was updated successfully, but these errors were encountered: