Quantized Gradient Creates a Failed Check #6175

jamespinkerton · 2023-11-03T04:41:36Z

Hi. I'm training a large model that I've trained many times before. I wanted to turn on quantized gradient to speed it up, but it's creating an error.

Here's a stack trace of the error:

Traceback (most recent call last):
File "/mnt/disks/condaman/mamba/lib/python3.11/site-packages/lightgbm/engine.py", line 266, in train
booster.update(fobj=fobj)
File "/mnt/disks/condaman/mamba/lib/python3.11/site-packages/lightgbm/basic.py", line 3557, in update
_safe_call(_LIB.LGBM_BoosterUpdateOneIter(
File "/mnt/disks/condaman/mamba/lib/python3.11/site-packages/lightgbm/basic.py", line 237, in _safe_call
raise LightGBMError(_LIB.LGBM_GetLastError().decode('utf-8'))
lightgbm.basic.LightGBMError: Check failed: (best_split_info.left_count) > (0) at /home/conda/feedstock_root/build_artifacts/lightgbm_1689341180525/work/src/treelearner/serial_tree_learner.cpp, line 845 .

I'm using LGBM 4.1.0. Installed with conda-forge.

Thanks so much,
James

jameslamb · 2023-11-03T14:08:10Z

Thanks for using LightGBM, and sorry about this bug.

In the future, please check the issues here before posting. Searching that error message (https://github.com/microsoft/LightGBM/issues?q=%22Check+failed%3A+%28best_split_info.left_count%29+%3E+%280%29%22+is%3Aissue), you'll see #5994 at the top of the list.

That links to related issue #5982, which shows this was fixed in #6092.

That change hasn't been released yet. We will try to get a release up soon.

For now, if you need to use quantized training follow @shiyu1994 's advice in #6134 (comment) and build the Python package from source.

jamespinkerton · 2023-11-03T15:03:27Z

My bad. I think I noticed the bug a while ago and checked at the time and there wasn't an issue. And then I finally got around to submitting it and I forgot to re-check. My fault, and thank you!

jameslamb · 2023-11-03T15:59:43Z

No problem at all, thanks for using LightGBM and taking the time to report! Sorry we haven't gotten that fix out in a release yet, I'm hoping to put one up in the next few days.

empowerVictor · 2024-01-17T21:27:16Z

@jameslamb I am still getting this error on 4.2.0.
Any ideas why? How can I help debug it?
Unfortunately I can't share my data.

jameslamb · 2024-01-17T21:31:52Z

How can I help debug it?

Post on #5982 with the following:

environment information (operating system, architecture, version of Python, how you installed lightgbm)
a reproducible example (e.g. using fake data if you can't share the data you're using), exact minimal code showing how you're using LightGBM
any other details like logs

This issue is marked duplicate, so I'm going to lock it to prevent further comments here.

jameslamb added bug duplicate and removed bug labels Nov 3, 2023

jameslamb closed this as completed Nov 3, 2023

microsoft locked as resolved and limited conversation to collaborators Jan 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quantized Gradient Creates a Failed Check #6175

Quantized Gradient Creates a Failed Check #6175

jamespinkerton commented Nov 3, 2023

jameslamb commented Nov 3, 2023

jamespinkerton commented Nov 3, 2023

jameslamb commented Nov 3, 2023

empowerVictor commented Jan 17, 2024

jameslamb commented Jan 17, 2024

Quantized Gradient Creates a Failed Check #6175

Quantized Gradient Creates a Failed Check #6175

Comments

jamespinkerton commented Nov 3, 2023

jameslamb commented Nov 3, 2023

jamespinkerton commented Nov 3, 2023

jameslamb commented Nov 3, 2023

empowerVictor commented Jan 17, 2024

jameslamb commented Jan 17, 2024