You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am using a very large dataset in which some float columns have low cardinality (but it is not an integer column, think of float values with 90% missing data). When rows from these columns are sampled, sometimes in the sample the standard deviation is either 0 or NaN and the following code generates an error:
xpl = SmartExplainer()
xpl.compile(
contributions = myContributions,
x = myData,
model = myEstimator,
)
xpl.run_app(settings={'rows' : 1000})
The workaround of increasing the number of sampled rows to 10.000 or 20.000 simply crashes the app. Removing those columns from the data is not an option.
Python version : 3.9.7
Shapash version : 1.6.1
Operating System : Windows
The text was updated successfully, but these errors were encountered:
I have the same issue and want to know the sampling technique behind it. I am scared about random sample not depicting the actual distribution of the plot(s).
I am using a very large dataset in which some float columns have low cardinality (but it is not an integer column, think of float values with 90% missing data). When rows from these columns are sampled, sometimes in the sample the standard deviation is either 0 or NaN and the following code generates an error:
The workaround of increasing the number of sampled rows to 10.000 or 20.000 simply crashes the app. Removing those columns from the data is not an option.
Python version : 3.9.7
Shapash version : 1.6.1
Operating System : Windows
The text was updated successfully, but these errors were encountered: