How can I save a pyod model? #88

singyaowu · 2019-05-08T09:22:27Z

I've just trained a auto-encoder model, and I wonder how can I save the model so that I don't need to train it again next time I want it. I didn't see any function related to save a model in auto_encoder.py, so I'm not sure if there is a function which I can use to save my model. Do you implement this kind of function?

yzhao062 · 2019-05-08T15:06:05Z

Agreed that a model save functionality should be added. Marked as a todo task. I am not sure whether pickle will work or not (hopefully yes), and I will also do some tests.

osancus · 2019-08-06T19:21:52Z

When trying to save AutoEncoder model using Pickle, Following error occurs. Any idea how can I fix it?

TypeError: can't pickle _thread.RLock objects

#Code
clf = fit_model(X_train)
pickle.dump(clf, open('./autoencoder.h5', 'wb'))

yzhao062 · 2019-08-14T14:49:10Z

@epicsol-inc sorry for the late response. AE in pyod is written with keras, and saving the model can be tricky.

To my understanding, keras models may not be pickable (keras-team/keras#10528)...

If saving model is a must, you may have to copy the code out from auto_encoder.py directly. Sorry for the inconvenience..

sbysiak · 2019-08-14T14:56:58Z

@epicsol-inc
I managed to save it using dill (https://pypi.org/project/dill/), which has syntax very similar to pickle

with open(out_fname, 'wb') as f: dill.dump(model, f, dill.HIGHEST_PROTOCOL)

You can check if it works in your case

yzhao062 · 2019-08-15T19:50:00Z

@sbysiak Thanks for the note. Much appreciated. Will also check out it and consider add this to the documentation :)

lgo7 · 2019-10-16T17:06:19Z

Any news regarding save PyOD models? I need to save an IForest model, can I use Pickle?

yzhao062 · 2019-10-16T18:04:04Z

Any news regarding save PyOD models? I need to save an IForest model, can I use Pickle?

Sorry I have not tested it out which should be. If pickle is not working, I will say using "https://pypi.org/project/dill/" as mentioned above.

This will be listed on the top of my priority list now.

lgo7 · 2019-10-24T23:50:23Z

I've used picke.dump and worked!

AlexDelPab · 2020-03-12T07:35:39Z

I've also used pickle.dump() for the classifiers knn, oc-svm, iforest and fabod and it works saving and loading them with:

save: pickle.dump(clf, open(folder + clf_name + '.h5', 'wb'))
load: pickle.loads(open(
folder + 'k Nearest Neighbors (kNN).h5',
'rb').read())

bhowmiks · 2020-04-17T16:30:32Z

Pickle and dill can save successfully. But these formats can make it time consuming to load the model. For autoencoder model, I saved the weights as HDF5 and the classifier object as pickle for faster loads and less disk space.

from pyod.models.auto_encoder import AutoEncoder
autoenModel= AutoEncoder()
autoenModel.fit(X=x_train)


##serialize model to JSON
model_json = autoenModel.model_.to_json()
with open(model_path+".json", "w") as json_file:
  json_file.write(model_json)
##serialize weights to HDF5
autoenModel.model_.save_weights(model_path+"model.h5")

##then set autoencoder model to None. It makes it smaller

autoenModel.model_ = None
with open(newpath+"//"+model_name+"_model"+'.pickle', 'wb') as handle:
  pickle.dump(autoenModel, handle, protocol=pickle.HIGHEST_PROTOCOL)

Model Load

##load the auto encoder instance
with open(path + "//" + model_n+"_model" + ".pickle", 'rb') as handle:
loaded_model = pickle.load(handle)

# load json and create model
json_file = open(path + "//" + model_n + '.json', 'r')

loaded_model_json = json_file.read()
loaded_model_json = loaded_model_json.replace("\"ragged\": false,", " ")
json_file.close()
loaded_model_ = model_from_json(loaded_model_json)
# load weights into new model
loaded_model_.load_weights(path + "//" + model_n + "model.h5")
print("Loaded model from disk")

loaded_model.model_ = loaded_model_   ## Set the loaded model to the auto encoder instance model

This works almost 5x faster and model size is 10X smaller.

ezzeldinadel · 2021-02-09T20:04:41Z

loaded_model_ = model_from_json(loaded_model_json)

what is model_from_json? this https://www.tensorflow.org/api_docs/python/tf/keras/models/model_from_json ?

SaqlainHussainShah · 2021-07-07T12:25:48Z

I have tried with .pkl and .h5 extension along with dill, pickle and joblib but the issue persists

Unable to save model can't pickle _thread.RLock objects

lfvillavicencio · 2022-06-22T01:05:52Z

Pickle and dill can save successfully. But these formats can make it time consuming to load the model. For autoencoder model, I saved the weights as HDF5 and the classifier object as pickle for faster loads and less disk space.
from pyod.models.auto_encoder import AutoEncoder
autoenModel= AutoEncoder()
autoenModel.fit(X=x_train)


##serialize model to JSON
model_json = autoenModel.model_.to_json()
with open(model_path+".json", "w") as json_file:
  json_file.write(model_json)
##serialize weights to HDF5
autoenModel.model_.save_weights(model_path+"model.h5")
##then set autoencoder model to None. It makes it smaller
autoenModel.model_ = None
with open(newpath+"//"+model_name+"_model"+'.pickle', 'wb') as handle:
  pickle.dump(autoenModel, handle, protocol=pickle.HIGHEST_PROTOCOL)
Model Load

##load the auto encoder instance with open(path + "//" + model_n+"_model" + ".pickle", 'rb') as handle: loaded_model = pickle.load(handle)
# load json and create model
json_file = open(path + "//" + model_n + '.json', 'r')

loaded_model_json = json_file.read()
loaded_model_json = loaded_model_json.replace("\"ragged\": false,", " ")
json_file.close()
loaded_model_ = model_from_json(loaded_model_json)
# load weights into new model
loaded_model_.load_weights(path + "//" + model_n + "model.h5")
print("Loaded model from disk")

loaded_model.model_ = loaded_model_   ## Set the loaded model to the auto encoder instance model
This works almost 5x faster and model size is 10X smaller.

Hi!
Where do you import that function model_from_json ?
thx

yzhao062 added the enhancement label May 8, 2019

yzhao062 self-assigned this Oct 16, 2019

yzhao062 added a commit that referenced this issue Sep 19, 2020

fix #222 #88 model persistence

23ae4ff

yzhao062 added a commit that referenced this issue Sep 19, 2020

fix #222 #88 model persistence

a8e17b4

TimotheeGr mentioned this issue Jan 15, 2021

Cannot save AutoEncoder #256

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How can I save a pyod model? #88

How can I save a pyod model? #88

singyaowu commented May 8, 2019 •

edited

Loading

yzhao062 commented May 8, 2019

osancus commented Aug 6, 2019

yzhao062 commented Aug 14, 2019

sbysiak commented Aug 14, 2019

yzhao062 commented Aug 15, 2019

lgo7 commented Oct 16, 2019

yzhao062 commented Oct 16, 2019

lgo7 commented Oct 24, 2019

AlexDelPab commented Mar 12, 2020

bhowmiks commented Apr 17, 2020 •

edited

Loading

ezzeldinadel commented Feb 9, 2021 •

edited

Loading

SaqlainHussainShah commented Jul 7, 2021

lfvillavicencio commented Jun 22, 2022

Model Load

How can I save a pyod model? #88

How can I save a pyod model? #88

Comments

singyaowu commented May 8, 2019 • edited Loading

yzhao062 commented May 8, 2019

osancus commented Aug 6, 2019

yzhao062 commented Aug 14, 2019

sbysiak commented Aug 14, 2019

yzhao062 commented Aug 15, 2019

lgo7 commented Oct 16, 2019

yzhao062 commented Oct 16, 2019

lgo7 commented Oct 24, 2019

AlexDelPab commented Mar 12, 2020

bhowmiks commented Apr 17, 2020 • edited Loading

Model Load

ezzeldinadel commented Feb 9, 2021 • edited Loading

SaqlainHussainShah commented Jul 7, 2021

lfvillavicencio commented Jun 22, 2022

Model Load

singyaowu commented May 8, 2019 •

edited

Loading

bhowmiks commented Apr 17, 2020 •

edited

Loading

ezzeldinadel commented Feb 9, 2021 •

edited

Loading