Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

paddle/fluid/core_avx.so paddle::memory::allocation::MemoryMapFdSet::Clear() #52269

Closed
chlyzzo opened this issue Mar 29, 2023 · 4 comments
Closed
Assignees

Comments

@chlyzzo
Copy link

chlyzzo commented Mar 29, 2023

bug描述 Describe the Bug

在接口测调用预测服务:x = paddle.to_tensor(pred_padding_ids),pred = self.model_pre(x)
运行一段时间后coredumps增多,查看错误是:
#1 0x00007fd1c21332d9 in GI_abort () at abort.c:89
#2 0x00007fd081ddd489 in google::InitVLOG3
(int**, int*, char const*, int) () from /home/work//python/2.7.17/lib/python2.7/site-packages/paddle/fluid/core_avx.so
#3 0x00007fd081f17adf in paddle::memory::allocation::MemoryMapFdSet::Clear() () from /home/work/
/python/2.7.17/lib/python2.7/site-packages/paddle/fluid/core_avx.so
#4 0x00007fd081bedd56 in pybind11::cpp_function::initialize<paddle::pybind::BindImperative(pybind11::module*)::{lambda()#7}, void, , pybind11::name, pybind11::scope, pybind11::sibling>(paddle::pybind::BindImperative(pybind11::module*)::{lambda()#7}&&, void ()(), pybind11::name const&, pybind11::scope const&, pybind11::sibling const&)::{lambda(pybind11::detail::function_call&)#3}::_FUN(pybind11::detail::function_call) () from /home/work//python/2.7.17/lib/python2.7/site-packages/paddle/fluid/core_avx.so

但是未找到paddle::memory::allocation::MemoryMapFdSet::Clear()文件,是否内存管理不善导致?哪里可以去清内存。

其他补充信息 Additional Supplementary Information

No response

@vivienfanghuagood
Copy link
Contributor

请提供可复现脚本和模型我们跟进呢

@chlyzzo
Copy link
Author

chlyzzo commented Mar 29, 2023

[Thread debugging using libthread_db enabled]
Using host libthread_db library "/opt/compiler/gcc-8.2/lib/libthread_db.so.1".
Core was generated by `/home/work/.deck/1.0/python/2.7.17/bin/uwsgi --ini /home/work/aigc_t2p_strategy'.
Program terminated with signal SIGABRT, Aborted.
#0 __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:55
55 ../sysdeps/unix/sysv/linux/raise.c: No such file or directory.
[Current thread is 1 (Thread 0x7fd1c2105700 (LWP 27188))]
(gdb) bt
#0 __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:55
#1 0x00007fd1c21332d9 in GI_abort () at abort.c:89
#2 0x00007fd081ddd489 in google::InitVLOG3
(int**, int*, char const*, int) ()
from /home/work/.deck/1.0/python/2.7.17/lib/python2.7/site-packages/paddle/fluid/core_avx.so
#3 0x00007fd081f17adf in paddle::memory::allocation::MemoryMapFdSet::Clear() ()
from /home/work/.deck/1.0/python/2.7.17/lib/python2.7/site-packages/paddle/fluid/core_avx.so
#4 0x00007fd081bedd56 in pybind11::cpp_function::initialize<paddle::pybind::BindImperative(pybind11::module*)::{lambda()#7}, void, , pybind11::name, pybind11::scope, pybind11::sibling>(paddle::pybind::BindImperative(pybind11::module*)::{lambda()#7}&&, void ()(), pybind11::name const&, pybind11::scope const&, pybind11::sibling const&)::{lambda(pybind11::detail::function_call&)#3}::_FUN(pybind11::detail::function_call) ()
from /home/work/.deck/1.0/python/2.7.17/lib/python2.7/site-packages/paddle/fluid/core_avx.so
#5 0x00007fd081a6afe1 in pybind11::cpp_function::dispatcher(_object
, _object*, _object*) ()
from /home/work/.deck/1.0/python/2.7.17/lib/python2.7/site-packages/paddle/fluid/core_avx.so
#6 0x00007fd1c23f0f53 in call_function (oparg=, pp_stack=0x7fff5fa7e068) at Python/ceval.c:4376
#7 PyEval_EvalFrameEx (f=f@entry=0x7fd120d3a430, throwflag=throwflag@entry=0) at Python/ceval.c:3013
#8 0x00007fd1c23ef977 in fast_function (nk=, na=, n=0, pp_stack=0x7fff5fa7e178,
func=0x7fd12157dad0) at Python/ceval.c:4461
#9 call_function (oparg=, pp_stack=0x7fff5fa7e178) at Python/ceval.c:4396
#10 PyEval_EvalFrameEx (f=f@entry=0x7fcf5400d3f0, throwflag=throwflag@entry=0) at Python/ceval.c:3013
#11 0x00007fd1c23f1b52 in PyEval_EvalCodeEx (co=, globals=, locals=locals@entry=0x0,
args=args@entry=0x7fd1c208a068, argcount=, kws=kws@entry=0x7fd1c208a068, kwcount=0, defs=0x0,
defcount=0, closure=0x7fd12157bb90) at Python/ceval.c:3608
#12 0x00007fd1c236cae4 in function_call (func=0x7fd12157dbd0, arg=0x7fd1c208a050, kw=0x7fd1215593b0)
at Objects/funcobject.c:523
#13 0x00007fd1c2343223 in PyObject_Call (func=func@entry=0x7fd12157dbd0, arg=arg@entry=0x7fd1c208a050,
kw=kw@entry=0x7fd1215593b0) at Objects/abstract.c:2544
#14 0x00007fd1c23e8f2d in ext_do_call (nk=, na=, flags=,
--Type for more, q to quit, c to continue without paging--
pp_stack=0x7fff5fa7e3f0, func=0x7fd12157dbd0) at Python/ceval.c:4690
#15 PyEval_EvalFrameEx (f=f@entry=0x7fcf8c008cb0, throwflag=throwflag@entry=0) at Python/ceval.c:3052
#16 0x00007fd1c23f1b52 in PyEval_EvalCodeEx (co=, globals=, locals=locals@entry=0x0,
args=args@entry=0x7fd1c208a068, argcount=, kws=kws@entry=0x0, kwcount=0, defs=0x0, defcount=0,
closure=0x0) at Python/ceval.c:3608
#17 0x00007fd1c236ca1b in function_call (func=0x7fd1c17a8750, arg=0x7fd1c208a050, kw=0x0) at Objects/funcobject.c:523
#18 0x00007fd1c2343223 in PyObject_Call (func=func@entry=0x7fd1c17a8750, arg=arg@entry=0x7fd1c208a050, kw=0x0)
at Objects/abstract.c:2544
#19 0x00007fd1c23e7983 in PyEval_CallObjectWithKeywords (func=func@entry=0x7fd1c17a8750, arg=0x7fd1c208a050,
arg@entry=0x0, kw=kw@entry=0x0) at Python/ceval.c:4245
#20 0x00007fd1c2416ede in call_sys_exitfunc () at Python/pythonrun.c:1768
#21 Py_Finalize () at Python/pythonrun.c:430
#22 0x000000000046d9d1 in uwsgi_plugins_atexit ()
#23 0x00007fd1c2149f3c in __run_exit_handlers (status=0, listp=0x7fd1c22b25f8 <__exit_funcs>,
run_list_atexit=run_list_atexit@entry=true) at exit.c:82
#24 0x00007fd1c2149fd5 in __GI_exit (status=) at exit.c:104
#25 0x000000000042712f in uwsgi_exit ()
#26 0x000000000046f027 in simple_goodbye_cruel_world ()
#27 0x000000000046f058 in goodbye_cruel_world ()
#28 0x0000000000427f67 in uwsgi_close_request ()
#29 0x000000000046b6a0 in simple_loop_run ()
#30 0x000000000046b4ae in simple_loop ()
#31 0x000000000046fc7e in uwsgi_ignition ()
#32 0x000000000047433a in uwsgi_worker_run ()
#33 0x000000000047489b in uwsgi_run ()
#34 0x0000000000423efe in main ()

@paddle-bot paddle-bot bot added status/following-up 跟进中 and removed status/new-issue 新建 labels Mar 30, 2023
@GuoxiaWang
Copy link
Contributor

你能提供复现代码吗?只看这些log看不出什么。

@paddle-bot paddle-bot bot closed this as completed Apr 2, 2024
Copy link

paddle-bot bot commented Apr 2, 2024

Since you haven't replied for more than a year, we have closed this issue/pr.
If the problem is not solved or there is a follow-up one, please reopen it at any time and we will continue to follow up.
由于您超过一年未回复,我们将关闭这个issue/pr。
若问题未解决或有后续问题,请随时重新打开,我们会继续跟进。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants