New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Does this cause core dump ? #16
Comments
TLDR: It's possible that there's a bug that causes a segfault, though it's unlikely that this is happening in the parts of the code you're pointing to. For diagnosing the segfault: Could you run a minimally reproducing example with Regarding the categorical data: The relevant function is actually this one:
This is the function in the binary that lleaves calls from Python (using two double pointers). The categorical features are then cast to ints in the core loop here:lleaves/lleaves/compiler/codegen/codegen.py Line 205 in 9784625
Most of the processing of the Pandas dataframes follows LightGBM very closely. This double to int casting is a bit strange, but I wanted to follow LightGBM as closely as possible. It works since LightGBM doesn't allow categoricals > 2^31-1 (max int 32), but double can represent any int up to 2^53 and lower without loss of precision.
|
I find that if categorical feature is numerical value, we can get rid of the code When I retrained a model that pandas_categorical is not null, the core dump disappeared. PR: return empty list if pandas_categorical is null in model file |
I'm having trouble understanding this issue. Could you write up a minimally reproducible example of the core dump / send me the |
Recently, I find that one of my model will cause core dump if I use lleaves for predict.
I am confused about two functions below.
In codegen.py, function param type can be
int*
if param is categoricalBut in data_processing.py with predict used, all feature param are convert to
double*
Is this just like
Does this will happy in lleaves?
The text was updated successfully, but these errors were encountered: