You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I0321 16:33:57.914856 26795 nccl_context.cc:83] init nccl context nranks: 2 local rank: 0 gpu id: 0 ring id: 0
W0321 16:33:59.374146 26795 gpu_resources.cc:61] Please NOTE: device: 0, GPU Compute Capability: 8.0, Driver API Version: 12.1, Runtime API Version: 11.2
W0321 16:33:59.379195 26795 gpu_resources.cc:91] device: 0, cuDNN Version: 8.5.
loading annotations into memory...
Done (t=0.12s)
creating index...
index created!
[03/21 16:34:02] ppdet.utils.checkpoint INFO: Finish loading model weights: /home/hy/.cache/paddle/weights/ResNet50_cos_pretrained.pdparams
Traceback (most recent call last):
File "tools/train.py", line 172, in
main()
File "tools/train.py", line 168, in main
run(FLAGS, cfg)
File "tools/train.py", line 132, in run
trainer.train(FLAGS.eval)
File "/home/hy/PaddleDetection/ppdet/engine/trainer.py", line 506, in train
outputs = model(data)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/parallel.py", line 752, in forward
outputs = self._layers(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/architectures/meta_arch.py", line 59, in forward
out = self.get_loss()
File "/home/hy/PaddleDetection/ppdet/modeling/architectures/cascade_rcnn.py", line 125, in get_loss
rpn_loss, bbox_loss, mask_loss = self._forward()
File "/home/hy/PaddleDetection/ppdet/modeling/architectures/cascade_rcnn.py", line 87, in _forward
body_feats = self.backbone(self.inputs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/backbones/resnet.py", line 582, in forward
x = stage(x)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/backbones/resnet.py", line 423, in forward
block_out = block(block_out)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/backbones/resnet.py", line 362, in forward
out = self.branch2c(out)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/backbones/resnet.py", line 118, in forward
out = self.conv(inputs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/nn/layer/conv.py", line 666, in forward
out = F.conv._conv_nd(
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/nn/functional/conv.py", line 144, in _conv_nd
pre_bias = getattr(_C_ops, op_type)(x, weight, *attrs)
SystemError: (Fatal) Operator conv2d raises an paddle::memory::allocation::BadAlloc exception.
The exception content is
:ResourceExhaustedError:
Out of memory error on GPU 0. Cannot allocate 128.000000MB memory on GPU 0, 39.401306GB memory has been allocated and available memory is only 30.312500MB.
复现环境 Environment
Linux 18.04
paddlepaddle-gpu 2.2.2.post11.2
cuda 11.2
cudnn 8.5
python 3.8
Bug描述确认 Bug description confirmation
我确认已经提供了Bug复现步骤、代码改动说明、以及环境信息,确认问题是可以复现的。I confirm that the bug replication steps, code change instructions, and environment information have been provided, and the problem can be reproduced.
是否愿意提交PR? Are you willing to submit a PR?
我愿意提交PR!I'd like to help by submitting a PR!
The text was updated successfully, but these errors were encountered:
问题确认 Search before asking
Bug组件 Bug Component
Validation
Bug描述 Describe the Bug
I0321 16:33:57.914856 26795 nccl_context.cc:83] init nccl context nranks: 2 local rank: 0 gpu id: 0 ring id: 0
W0321 16:33:59.374146 26795 gpu_resources.cc:61] Please NOTE: device: 0, GPU Compute Capability: 8.0, Driver API Version: 12.1, Runtime API Version: 11.2
W0321 16:33:59.379195 26795 gpu_resources.cc:91] device: 0, cuDNN Version: 8.5.
loading annotations into memory...
Done (t=0.12s)
creating index...
index created!
[03/21 16:34:02] ppdet.utils.checkpoint INFO: Finish loading model weights: /home/hy/.cache/paddle/weights/ResNet50_cos_pretrained.pdparams
Traceback (most recent call last):
File "tools/train.py", line 172, in
main()
File "tools/train.py", line 168, in main
run(FLAGS, cfg)
File "tools/train.py", line 132, in run
trainer.train(FLAGS.eval)
File "/home/hy/PaddleDetection/ppdet/engine/trainer.py", line 506, in train
outputs = model(data)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/parallel.py", line 752, in forward
outputs = self._layers(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/architectures/meta_arch.py", line 59, in forward
out = self.get_loss()
File "/home/hy/PaddleDetection/ppdet/modeling/architectures/cascade_rcnn.py", line 125, in get_loss
rpn_loss, bbox_loss, mask_loss = self._forward()
File "/home/hy/PaddleDetection/ppdet/modeling/architectures/cascade_rcnn.py", line 87, in _forward
body_feats = self.backbone(self.inputs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/backbones/resnet.py", line 582, in forward
x = stage(x)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/backbones/resnet.py", line 423, in forward
block_out = block(block_out)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/backbones/resnet.py", line 362, in forward
out = self.branch2c(out)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/PaddleDetection/ppdet/modeling/backbones/resnet.py", line 118, in forward
out = self.conv(inputs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 930, in call
return self._dygraph_call_func(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/fluid/dygraph/layers.py", line 915, in _dygraph_call_func
outputs = self.forward(*inputs, **kwargs)
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/nn/layer/conv.py", line 666, in forward
out = F.conv._conv_nd(
File "/home/hy/anaconda3/envs/ppdet/lib/python3.8/site-packages/paddle/nn/functional/conv.py", line 144, in _conv_nd
pre_bias = getattr(_C_ops, op_type)(x, weight, *attrs)
SystemError: (Fatal) Operator conv2d raises an paddle::memory::allocation::BadAlloc exception.
The exception content is
:ResourceExhaustedError:
Out of memory error on GPU 0. Cannot allocate 128.000000MB memory on GPU 0, 39.401306GB memory has been allocated and available memory is only 30.312500MB.
复现环境 Environment
Linux 18.04
paddlepaddle-gpu 2.2.2.post11.2
cuda 11.2
cudnn 8.5
python 3.8
Bug描述确认 Bug description confirmation
是否愿意提交PR? Are you willing to submit a PR?
The text was updated successfully, but these errors were encountered: