Skip to content

ResourceExhaustedError during retraining in Object detection API #1766

@Garima-Mishra

Description

@Garima-Mishra

I am trying to train the mobilenet v1 with pets data set.
from tensorflow.python.client import device_lib
device_lib.list_local_devices()
gives following output
name: GeForce GTX 750 Ti
major: 5 minor: 0 memoryClockRate (GHz) 1.15
pciBusID 0000:03:00.0
Total memory: 1.95GiB
Free memory: 1.93GiB

But when I run training. I get the error
tensorflow/core/framework/op_kernel.cc:1158] Resource exhausted: OOM when allocating tensor with shape[24,128,75,75]
INFO:tensorflow:Error reported to Coordinator: <class 'tensorflow.python.framework.errors_impl.ResourceExhaustedError'>, OOM when allocating tensor with shape[24,128,75,75]

Even tried with pascal dataset and other pre-trained models also. I was able to run call models for running the object detection tutorial.
I followed the instruction given for running_locally.md
One thing I observed that all of the GPU memory get exhausted before throwing the error.
What is the minimum GPU memory required for training?

Metadata

Metadata

Assignees

Labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions