-
Notifications
You must be signed in to change notification settings - Fork 3.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Group, init_score, and sample_weight cannot be type Dask DataFrame #4375
Comments
Thanks for writing this up! I'm adding the literal text of the error message here, so people will be able to find this issue from search engines if they encounter it.
|
This issue has been automatically locked since there has not been any recent activity since it was closed. To start a new related discussion, open a new issue at https://github.com/microsoft/LightGBM/issues including a reference to this. |
Description
The
group
,sample_weight
, andinit_score
Dask estimator parameters cannot beDask DataFrames
. They are currently described as "Dask Array, Dask DataFrame, Dask Series..." (e.g. here). Serial LightGBM estimators do not accept DataFrame values for these parameters - they should be array-like and not 2-D - which means that the distributed versions of these params also cannot be 2-D.In
python-package/lightgbm/dask.py
these args'typing
needs to reflect this in dask.py by adding a_DaskAarrayLike
constant equal toUnion[dask_Array, dask_Series]
and then changing any required docstrings to remove the mention that they can be "Dask DataFrame" type.Reproducible example
Environment info
LightGBM version or commit hash: 3.2.0
Command(s) you used to install LightGBM
Additional Comments
First raised here: #4101 (comment)
The text was updated successfully, but these errors were encountered: