Fix memory stats printing by pengwa · Pull Request #20061 · microsoft/onnxruntime

pengwa · 2024-03-25T12:44:30Z

Fix memory stats printing

The mmeory stats printing is failed when module is in eval mode, doing ORTModule wrap. At that time, runtime inspector for training manager should have training model being true, but got a false (because existing logic get the boolean from module.training). Runtime inspector as part of training manager or inference manager should know it is serving training or not explicitly, so we cannot depend on the stat of module.training during ORTModule initialization.

Motivation and Context

orttraining/orttraining/test/python/orttraining_test_ortmodule_api.py

orttraining/orttraining/python/training/ortmodule/_graph_execution_manager.py

orttraining/orttraining/python/training/ortmodule/_runtime_inspector.py

…pengwa/fix_mem_inspector

### Fix memory stats printing The mmeory stats printing is failed when module is in eval mode, doing ORTModule wrap. At that time, runtime inspector for training manager should have training model being true, but got a false (because existing logic get the boolean from module.training). Runtime inspector as part of training manager or inference manager should know it is serving training or not explicitly, so we cannot depend on the stat of module.training during ORTModule initialization. ### Motivation and Context

fix memory stats printing

17811c4

pengwa added the training issues related to ONNX Runtime training; typically submitted using template label Mar 25, 2024

pengwa requested review from baijumeswani, guyang3532 and wschin March 25, 2024 12:45

wschin previously approved these changes Mar 26, 2024

View reviewed changes

wschin reviewed Mar 26, 2024

View reviewed changes

orttraining/orttraining/test/python/orttraining_test_ortmodule_api.py Show resolved Hide resolved

wschin reviewed Mar 26, 2024

View reviewed changes

orttraining/orttraining/python/training/ortmodule/_graph_execution_manager.py Show resolved Hide resolved

wschin reviewed Mar 26, 2024

View reviewed changes

orttraining/orttraining/python/training/ortmodule/_runtime_inspector.py Show resolved Hide resolved

pengwa added 3 commits March 26, 2024 05:10

add doc string

82dc08e

Merge branch 'main' of https://github.com/microsoft/onnxruntime into …

c1dfe37

…pengwa/fix_mem_inspector

minor

f146042

pengwa dismissed wschin’s stale review via f146042 March 26, 2024 05:16

Merge branch 'main' of https://github.com/microsoft/onnxruntime into …

e743cf7

…pengwa/fix_mem_inspector

guyang3532 approved these changes Mar 26, 2024

View reviewed changes

pengwa merged commit dfa891a into main Mar 26, 2024

pengwa deleted the pengwa/fix_mem_inspector branch March 26, 2024 13:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix memory stats printing#20061

Fix memory stats printing#20061
pengwa merged 5 commits intomainfrom
pengwa/fix_mem_inspector

pengwa commented Mar 25, 2024

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

pengwa commented Mar 25, 2024