[Eager] print gpu mem info #42616

wanghuancoder · 2022-05-09T11:45:49Z

PR types

Others

PR changes

Others

Describe

修改CI显存监控逻辑，即使在非TESTING下，也编译该逻辑，使用FLAGS_enable_gpu_memory_usage_log控制开关。

在程序结束时打印模型显存实际使用峰值和模型申请显存峰值，单位MB，格式如下：

[Memory Usage (MB)] gpu 0 : Reserved = 2266.72, Allocated = 2237.15

其中Allocated为模型显存实际使用峰值。Reserved为模型向显卡申请显存峰值。

paddle-bot-old · 2022-05-09T11:46:01Z

你的PR提交成功，感谢你对开源项目的贡献!
请关注后续CI自动化测试结果，详情请参考Paddle-CI手册。
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

lelelelelez · 2022-05-10T03:00:50Z

paddle/fluid/platform/device/gpu/gpu_info.cc

    if (FLAGS_enable_gpu_memory_usage_log) {
-      std::cout << "[Memory Usage (Byte)] gpu " << dev_id_ << " : "
-                << MEMORY_STAT_PEAK_VALUE(Reserved, dev_id_) << std::endl;
+      std::cout << "[Memory Usage (MB)] gpu " << dev_id_ << " : Reserved = "


对于一些显存小的，用MB为单位的话可能会出现大量的0

我又加了一个FLAGS_enable_gpu_memory_usage_log_mb开关。默认打印MB，FLAGS_enable_gpu_memory_usage_log_mb=false时，打印Byte

From00

LGTM

print mem

5cc7352

wanghuancoder added 2 commits May 9, 2022 12:11

refine

ccff3cd

refine

0172b22

wanghuancoder changed the title ~~[Do Not Merge] print mem~~ [Eager] print gpu mem info May 10, 2022

refine

036f8da

lelelelelez reviewed May 10, 2022

View reviewed changes

refine

beff365

From00 approved these changes May 10, 2022

View reviewed changes

luotao1 approved these changes May 10, 2022

View reviewed changes

wanghuancoder merged commit 8164414 into PaddlePaddle:develop May 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Eager] print gpu mem info #42616

[Eager] print gpu mem info #42616

wanghuancoder commented May 9, 2022 •

edited

Loading

paddle-bot-old bot commented May 9, 2022

lelelelelez May 10, 2022

wanghuancoder May 10, 2022 •

edited

Loading

From00 left a comment

[Eager] print gpu mem info #42616

[Eager] print gpu mem info #42616

Conversation

wanghuancoder commented May 9, 2022 • edited Loading

PR types

PR changes

Describe

paddle-bot-old bot commented May 9, 2022

lelelelelez May 10, 2022

Choose a reason for hiding this comment

wanghuancoder May 10, 2022 • edited Loading

Choose a reason for hiding this comment

From00 left a comment

Choose a reason for hiding this comment

wanghuancoder commented May 9, 2022 •

edited

Loading

wanghuancoder May 10, 2022 •

edited

Loading