[Bug]: Memory leak in flair models #3378

ganga7445 · 2023-11-28T14:06:03Z

Describe the bug

I have fine-tuned the flair in my own ner dataset and using the below code. i have used xlmr transformer embeddings.
predict method is giving more memory leakage. @helpmefindaname
flair version is : 0.12.2

The

To Reproduce

from flair.models import SequenceTagger
import torch
tagger = SequenceTagger.load("best-model.pt")
from flair.data import Sentence

for text in [ "Financial Chair, Chi Phi Fraternity I Tuscaloosa, AL  Fall 2019",
              "Fort Smith Housekeeping Opportunities - Evenings, Data Scientist"]:
  sentence = Sentence(text)
  tagger.predict(sentence)
  entities = sentence.get_spans('ner')
  for entity in entities:
      print(f"Entity: {entity.text}, Label: {entity.tag}")
torch.cuda.empty_cache()

Expected behavior

Ideally, there should be no memory leaks. During testing with lower versions of Flair, no memory leaks were observed.

Logs and Stack traces

No response

Screenshots

No response

Additional Context

No response

Environment

Versions:

Flair

0.12.2

Pytorch

2.0.1+cu117

Transformers

4.30.2

GPU

True

alanakbik · 2023-11-28T15:05:34Z

Thanks for reporting this! @helpmefindaname can you check?

helpmefindaname · 2023-11-28T18:56:54Z

Hi @ganga7445 can you please upgrade to the latest version of flair and check if the issue persists?

ganga7445 · 2023-11-29T12:59:51Z

@helpmefindaname @alanakbik
I tried with the latest versions of flair, torch, and transformers. still, the memory leak is happening.

helpmefindaname · 2023-11-30T10:35:04Z

Okay,
can you then elaborate more what you mean by memory leak & how you meassure it?

ganga7445 · 2023-12-01T06:06:06Z

@helpmefindaname
Initially, loading the model demanded 1.5 gigabytes (GB) of RAM. Subsequently, after making predictions with the loaded model, there was a rise in RAM usage, ultimately resulting in a memory leak. I conducted debugging using the memory-profiler in Python, revealing that the predict method is allocating approximately 30 MB for each request.

None-Such · 2023-12-02T02:10:46Z

Is this a leak . . . or a configuration issue?

See - https://stackoverflow.com/questions/55322434/how-to-clear-cuda-memory-in-pytorch

helpmefindaname · 2023-12-04T11:51:20Z

@ganga7445
an I guessing right, that you are talking about https://pypi.org/project/memory-profiler/ ?
I tried that out, adopting your script to:

from memory_profiler import profile

from flair.models import SequenceTagger
import torch

tagger = SequenceTagger.load("ner")
from flair.data import Sentence


@profile
def main():
    for text in ["Financial Chair, Chi Phi Fraternity I Tuscaloosa, AL  Fall 2019",
                 "Fort Smith Housekeeping Opportunities - Evenings, Data Scientist"]:
        sentence = Sentence(text)
        tagger.predict(sentence)
        entities = sentence.get_spans('ner')
        for entity in entities:
            print(f"Entity: {entity.text}, Label: {entity.tag}")
    torch.cuda.empty_cache()


if __name__ == "__main__":
    main()

then running python -m memory_profiler scripts\memory_leak.py
gives output like this:

Line #    Mem usage    Increment  Occurrences   Line Contents
=============================================================
    10   3015.7 MiB   3015.7 MiB           1   @profile
    11                                         def main():
    12   3015.7 MiB      0.0 MiB           3       for text in ["Financial Chair, Chi Phi Fraternity I Tuscaloosa, AL  Fall 2019",
    13                                                          "Fort Smith Housekeeping Opportunities - Evenings, Data Scientist"]:
    14   3015.8 MiB   -214.3 MiB           2           sentence = Sentence(text)
    15   2801.8 MiB   -428.5 MiB           2           tagger.predict(sentence)
    16   2801.7 MiB     -0.0 MiB           2           entities = sentence.get_spans('ner')
    17   2801.7 MiB     -0.0 MiB           2           for entity in entities:
    18                                                     print(f"Entity: {entity.text}, Label: {entity.tag}")
    19   2801.7 MiB   -213.9 MiB           1       torch.cuda.empty_cache()

With the numbers on line 14 & 15 varying from run to run, but staying with negative increment.
I don't see any proof or indication of a memory leak here.

ganga7445 · 2023-12-04T14:12:58Z

@helpmefindaname
You are right, but I developed a custom Named Entity Recognition (NER) model that uses different embeddings during training. However, I'm experiencing a memory leak issue during inference.

helpmefindaname · 2023-12-04T15:15:15Z

Okay, but as long as I cannot verify your claims and reproduce them, I cannot help you.
I'd suggest you try to find a reproducible example where that happens and share all the necessary details for it.

danilyef · 2024-04-04T19:06:43Z

I can confirm data leak as well, when I run model on pandas dataframe (text variable).
But for strange reason, if I sort dataframe by the length of the text variable (ascending order) memory leak issues disappears.

helpmefindaname · 2024-04-05T11:49:17Z

@danilyef
can you please create a reproducable example that demonstrates this leak and also shows how you determine that it is a memory leak?

ganga7445 · 2024-05-17T12:21:44Z

@helpmefindaname
below is the code.

from flair.data import Sentence
from flair.models import SequenceTagger
from memory_profiler import profile

tagger = SequenceTagger.load('ner')
texts = [ "2014 Master of Computer Application (MCA) from Indira Gandhi Open",
"University(IGNOU).2010 Bachelor of Computer Application (BCA) from",
"Bangalore University",
"ITIL v3 Training conducted by Accenture - IT",
"TATA CONSULTANCY SERVICES : L2 Network Engineer -Jan 2017 till date",
"L2 & L2.5 Engineer for Morgan Stanley (BFSI Project):",
"Accenture : Network Engineer - Sept 2012 to 2017",
"Current Morgan Stanley, Bangalore",
"CISCO SWITCH -2960,Hp-procurve,etc..",
"Experience in managing networks for Trading sector network DATA Centers, Branch",
"CONVERGYS Nov 2010 - May 2012",
"MCA with 8 years and 3 months of experience in IT.",
"Completed CCNA (Cisco Certified Network Associate) with Cisco ID-",
"CSCO12765621",
"2014 Master of Computer Application (MCA) from Indira Gandhi Open",
"University(IGNOU).2010 Bachelor of Computer Application (BCA) from",
"Bangalore University",
"ITIL v3 Training conducted by Accenture - IT",
"TATA CONSULTANCY SERVICES : L2 Network Engineer -Jan 2017 till date",
"L2 & L2.5 Engineer for Morgan Stanley (BFSI Project):",
"Accenture : Network Engineer - Sept 2012 to 2017",
"Current Morgan Stanley, Bangalore",
"CISCO SWITCH -2960,Hp-procurve,etc..",
"Experience in managing networks for Trading sector network DATA Centers, Branch",
"CONVERGYS Nov 2010 - May 2012",
"MCA with 8 years and 3 months of experience in IT.",
"Completed CCNA (Cisco Certified Network Associate) with Cisco ID-",
"CSCO12765621",
"2014 Master of Computer Application (MCA) from Indira Gandhi Open",
"University(IGNOU).2010 Bachelor of Computer Application (BCA) from",
"Bangalore University",
"ITIL v3 Training conducted by Accenture - IT",
"TATA CONSULTANCY SERVICES : L2 Network Engineer -Jan 2017 till date",
"L2 & L2.5 Engineer for Morgan Stanley (BFSI Project):",
"Accenture : Network Engineer - Sept 2012 to 2017",
"Current Morgan Stanley, Bangalore",
"CISCO SWITCH -2960,Hp-procurve,etc..",
"Experience in managing networks for Trading sector network DATA Centers, Branch",
"CONVERGYS Nov 2010 - May 2012",
"MCA with 8 years and 3 months of experience in IT.",
"Completed CCNA (Cisco Certified Network Associate) with Cisco ID-",
"CSCO12765621",
"SandHI - A Science and Heritage Initiative, IIT Kharagpur | Prof. Joy Sen [May 2016 - June 2016]",
"Machine Learning | Python for Data Science | Probability and Statistics | Linear Algebra | Social Media Data Analytics | Programming and",
"Business Analyst | Merilytics [June 2019 - Dec 2019]",
"Credit Card Fraud Detection | Kaggle [June 2018]",
"Governor | Technology Adventure Society [May 2016 - April 2017]",
"One Two Flip Entertainment Pvt. Ltd. | Data Science [March 2018 - May 2018]",
"Business Development and Operations | ListUp [May 2017 - July 2017]",
"Multiword Fashion Term Extraction | Prof. Pawan Goyal [Aug 2018 - Oct 2018]",
"Programming Language/Libraries: Python, SQL, scikit-learn, Scala, pandas, beautiful-soup, Numpy, matplotlib",
"Softwares: Anaconda, Adobe Photoshop, Figma, Tableau, Microsoft Office - Advanced Excel, PowerPoint"]

@Profile
def prdict_test(texts):
metadata_sentences = [Sentence(sent) for sent in texts]
tagger.predict(metadata_sentences, mini_batch_size=32)
for sente in metadata_sentences:
ner_spans = sente.get_spans('ner')
entities = [{'text': span.text, 'label': span.tag,
'start_pos': span.start_position,
'end_pos': span.end_position,
'labels': {"value" : str(span.tag),
"confidence" : span.score}
}
for span in ner_spans]
model_result = {'text': sente.to_original_text(), 'entities': entities}

prdict_test(texts)

output is

using flair 0.13.1. does the issue with version of flair?

ganga7445 · 2024-05-21T12:39:01Z

@alanakbik can you please check once?

ganga7445 added the bug Something isn't working label Nov 28, 2023

alanakbik assigned helpmefindaname Nov 28, 2023

helpmefindaname added the Awaiting Response Waiting for new input from the author label Nov 29, 2023

github-actions bot removed the Awaiting Response Waiting for new input from the author label Nov 29, 2023

helpmefindaname added the Awaiting Response Waiting for new input from the author label Nov 30, 2023

github-actions bot removed the Awaiting Response Waiting for new input from the author label Nov 30, 2023

helpmefindaname added the Awaiting Response Waiting for new input from the author label Nov 30, 2023

github-actions bot removed the Awaiting Response Waiting for new input from the author label Dec 1, 2023

helpmefindaname added the Awaiting Response Waiting for new input from the author label Dec 4, 2023

github-actions bot removed the Awaiting Response Waiting for new input from the author label Dec 4, 2023

helpmefindaname added the Awaiting Response Waiting for new input from the author label Dec 4, 2023

github-actions bot removed the Awaiting Response Waiting for new input from the author label Apr 4, 2024

helpmefindaname added the Awaiting Response Waiting for new input from the author label May 3, 2024

github-actions bot removed the Awaiting Response Waiting for new input from the author label May 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: Memory leak in flair models #3378

[Bug]: Memory leak in flair models #3378

ganga7445 commented Nov 28, 2023 •

edited

Loading

alanakbik commented Nov 28, 2023

helpmefindaname commented Nov 28, 2023

ganga7445 commented Nov 29, 2023

helpmefindaname commented Nov 30, 2023

ganga7445 commented Dec 1, 2023

None-Such commented Dec 2, 2023

helpmefindaname commented Dec 4, 2023

ganga7445 commented Dec 4, 2023

helpmefindaname commented Dec 4, 2023

danilyef commented Apr 4, 2024

helpmefindaname commented Apr 5, 2024

ganga7445 commented May 17, 2024 •

edited

Loading

ganga7445 commented May 21, 2024

[Bug]: Memory leak in flair models #3378

[Bug]: Memory leak in flair models #3378

Comments

ganga7445 commented Nov 28, 2023 • edited Loading

Describe the bug

To Reproduce

Expected behavior

Logs and Stack traces

Screenshots

Additional Context

Environment

Versions:

Flair

Pytorch

Transformers

GPU

alanakbik commented Nov 28, 2023

helpmefindaname commented Nov 28, 2023

ganga7445 commented Nov 29, 2023

helpmefindaname commented Nov 30, 2023

ganga7445 commented Dec 1, 2023

None-Such commented Dec 2, 2023

helpmefindaname commented Dec 4, 2023

ganga7445 commented Dec 4, 2023

helpmefindaname commented Dec 4, 2023

danilyef commented Apr 4, 2024

helpmefindaname commented Apr 5, 2024

ganga7445 commented May 17, 2024 • edited Loading

ganga7445 commented May 21, 2024

ganga7445 commented Nov 28, 2023 •

edited

Loading

ganga7445 commented May 17, 2024 •

edited

Loading