sparkdl.xgboost getting stuck trying to map partitions #248

timpiperseek · 2022-08-10T08:16:53Z

I am running the following code to try to fit a model

from sparkdl.xgboost import XgboostClassifier
param = {
    'num_workers': 4, # number of workers on the cluster, adjust as needed
  'missing': 0,
    "objective": "binary:logistic",
    "eval_metric": "logloss",
      'featuresCol':"features", 
      'labelCol':"objective",
      'nthread':32 # equal to the number of cpus on each worker machine
}
  
train, test = data.randomSplit([0.001, 0.001])
xgb_classifier = XgboostClassifier(**param)
xgb_clf_model = xgb_classifier.fit(train)

When I run the model training on my databricks cluster is seems to be getting stuck when it is trying to map partitions.
It is using almost zero cpu on each cluster but the memory usage is slowly increasing.

is there anything I can do to get around this issue

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

sparkdl.xgboost getting stuck trying to map partitions #248

sparkdl.xgboost getting stuck trying to map partitions #248

timpiperseek commented Aug 10, 2022

sparkdl.xgboost getting stuck trying to map partitions #248

sparkdl.xgboost getting stuck trying to map partitions #248

Comments

timpiperseek commented Aug 10, 2022