Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Custom CRD: Wait for all processes before running metrics collector (#…
…1313) * Enable to wait all in metrics collectors * Rename metricsFilePath * Fix tfevent * Fix pns py * Fix comment
- Loading branch information
1 parent
7b797e1
commit 6b7142f
Showing
135 changed files
with
20,302 additions
and
121 deletions.
There are no files selected for viewing
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,10 @@ | ||
# Default value for interval between running processes check | ||
DEFAULT_POLL_INTERVAL = 1 | ||
# Default value for timeout before invoke error during running processes check | ||
DEFAULT_TIMEOUT = 0 | ||
# Default value whether wait for all other main process of container exiting | ||
DEFAULT_WAIT_ALL = True | ||
# Default value for directory where TF event metrics are reported | ||
DEFAULT_METRICS_FILE_DIR = "/log" | ||
# Job finished marker in $$$$.pid file when main process is completed | ||
TRAINING_COMPLETED = "completed" |
Oops, something went wrong.