You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The memory used at the above line while running an inference is never released, thus causing OOM is some time. Increase in memory in each iteration gets lowered but not totally removed if I use jemalloc.
The text was updated successfully, but these errors were encountered:
I have trained efficient conformer transducer and during inference on a CPU in a flask based web app, I see there is a memory leak at
EfficientConformer/models/encoders.py
Line 128 in 2f59ed2
x, attention, hidden = block(x, mask)
The memory used at the above line while running an inference is never released, thus causing OOM is some time. Increase in memory in each iteration gets lowered but not totally removed if I use jemalloc.
The text was updated successfully, but these errors were encountered: