Memory Leak in BaseEstimator Class #1576

gregbolet · 2022-10-24T05:50:32Z

Subject of the issue (with proposed fix)

When creating multiple MaximumLikelihoodEstimator objects and calling estimate_cpd for all the model nodes in each, we seem to get a memory leak. This is what happens with repeated calls to BayesianNetwork.fit.

The state_counts function of the BaseEstimator class has a decorator ([https://github.com/pgmpy/pgmpy/blob/dev/pgmpy/estimators/base.py#L66])

@lru_cache(maxsize=2048)

which seems to be the culprit. When I remove it, the memory leak goes away. I haven't bothered to dig deeper on the issue, but it's something that may be worth considering if repeatedly training models.

Your environment

Google Colab Notebook
PGMPY version: pgmpy-0.1.20
Python version: 3.7.15
Operating System: Ubuntu 18.04.6 LTS (Bionic Beaver)
uname -a output: Linux 5.10.133+

Steps to reproduce

Try the following code out. You should see the memory slowly get eaten up.

import pandas as pd
from pgmpy.models import BayesianNetwork
from pgmpy.estimators import MaximumLikelihoodEstimator

data = pd.read_csv('https://raw.githubusercontent.com/mwaskom/seaborn-data/master/iris.csv')

cols = list(data.columns)
edgelist = [(a,b) for a in cols[0:3] for b in cols[3:] if a != b]

model = BayesianNetwork()
model.add_nodes_from(cols)
model.add_edges_from(edgelist)


for i in range(1000):
  model.fit(data, MaximumLikelihoodEstimator)

Expected behaviour

Repeatedly calling the model.fit function with the MLE should not consume memory ad infinitum, the model gets re-trained and so old data should not persist.

Actual behaviour

During the 1000 iterations additional RAM is consumed and not freed EVER.

The text was updated successfully, but these errors were encountered:

gregbolet · 2022-10-24T05:53:19Z

Looks like this might be a similar issue: limix/glimix-core#15

ankurankan · 2023-03-06T04:34:58Z

Fixed for now as caching is disabled as it wasn't improving performance by much.

gregbolet closed this as completed Oct 24, 2022

gregbolet reopened this Oct 24, 2022

ankurankan added the Bug label Oct 24, 2022

ankurankan closed this as completed Mar 6, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Memory Leak in BaseEstimator Class #1576

Memory Leak in BaseEstimator Class #1576

gregbolet commented Oct 24, 2022 •

edited

gregbolet commented Oct 24, 2022

ankurankan commented Mar 6, 2023

Memory Leak in BaseEstimator Class #1576

Memory Leak in BaseEstimator Class #1576

Comments

gregbolet commented Oct 24, 2022 • edited

Subject of the issue (with proposed fix)

Your environment

Steps to reproduce

Expected behaviour

Actual behaviour

gregbolet commented Oct 24, 2022

ankurankan commented Mar 6, 2023

gregbolet commented Oct 24, 2022 •

edited