-
Notifications
You must be signed in to change notification settings - Fork 82
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Sync with sagemaker-debugger master branch and fix issue with tensorf…
…low_datasets version (#114) * Update sagemaker.md (#250) * Bumping version to 0.9.0 (#251) * Skip using standalone keras Py3.7+ (#253) * Gradtape zcc (#252) * Fix Incorrect Log Statement (#256) * Incorrect number of tensors saved with MirroredStrategy (#257) * Change Version to 0.8.1 (#258) * Save Scalars With Mirrored Strategy (#259) * skip flaky test (#262) * Don't export to collections for all workers with unsupported distrib training (#263) * version bump (#265) * Avoiding Basehook object pickling (#266) * handle eager tensors (#271) * TF 2.x: Support for keras to estimator (#268) * Revert "TF 2.x: Support for keras to estimator (#268)" (#273) This reverts commit 749bded. * Disable TB Testing (#275) * Support for TF 2 estimator (#274) * Adding a TF2 Hvd example and test (#279) * Moved end of training log from info to debug (#281) #280 * Adding action class (#285) * Adding action class Actions added: stop trianing job, email, sms * Fix buildspec used for PR CI (#287) * Adding a test to check that PT model is saved without issues (#283) * test that model can be pickled without issues * Save Model Inputs, Model Outputs, Gradients, Custom Tensors, Layer Inputs, Layer Outputs (#282) * Pin pytest version (#293) * Load IRIS Dataset from S3 (#298) * Load dataset from s3 (#299) * remove problematic log (#300) * Change Enum (#301) * Doc update (#292) * rename enum (#305) * version bump to 0.9.1 (#304) * modify asserts (#307) * version compare (#306) * Support TF 2.3 Tests (#312) * Disable TB in ZCC for AWS TF 2.3.0 (#316) * Update Assert Statements For New TF 2.2.0 DLC (#320) * Version Bump (#319) * add a note for TF 2.2 limited support (#303) Co-authored-by: Miyoung Choi <cmiyoung@amazon.com> Co-authored-by: Nihal Harish <nihal42harish@gmail.com> * TF 2.2 documentation update (#322) * update TF 2.2 smdebug features * Update code samples/notes for new pySDK and smdebug/add and fix links * add 'New features' note Co-authored-by: Miyoung Choi <cmiyoung@amazon.com> * Adding pagination in list_training_jobs (#323) * Adding pagination in list_Training_jobs * Test Custom Step Usecase (#331) * save tf2 model (#333) * Add ability to only save shapes of tensors (#328) * Revert "Add ability to only save shapes of tensors (#328)" (#337) This reverts commit c9eb769. * Function to Test If the hook has been configured with the Default hook config (#332) * Default hook config (#338) * version bump (#339) * TF ZCC limitation footnote (#342) * Ability to save shapes (#341) * WIP saveshape * Add shape writer * Add pytorch test * Add untested keras test * fix syntax * fix syntax * Import * Import * Add tests for TF * Simplify read code * Add read API and tests * Add mxnet test * Add s3 and json tests * lint * Fix payload * fix import * Handle different num tensors for losses * Fix exact equal condition * Fix mode bug * trigger CI * Add support for distributed training with writer map * Check that value throws exception * Fix tests to make them more resilient * Fix mxnet and pytorch tests * Remove tensor names * pre-commmit * Fix get_mode * Fix bug with old index files * Fix keras test with names of tensors * Set original name to None if tf_obj is None * Fix mirrored test for cpu * Add docs * trigger CI * Fix shape writer get * Simplify by removing shape writer * Cleanup * Fix name of writer * Addressed review comments * trigger ci * retrigger CI Co-authored-by: NihalHarish <nihal42harish@gmail.com> * Support Inputs and Labels in the dict format (#345) * 0.9.4 (#347) * Refactor Make Numpy Array (#329) * warn gradtape users about tf.function support (#348) * Support all tf types (#346) * Model Subclassing Test (#351) * Modify Should Save Tensor Test To Work on Any Version of TF (#352) * framework version updates (#360) * list training jobs improvements (#349) * Earlier list training job would make 50 attempts irrespective. This may be bad because of unnecessary traffic. * if there are training jobs found with prefix, we break * if there are exceptions caught more than 5 times we break. * Handle Deprecation Of experimental_ref api (#356) * check file exist before moving (#364) * check file exist before moving when closing the file. * Support Saving Tensors in Graph Mode with add_for_mode (#353) * Change layer name logic (#357) * Pass Variable Length Argument To Old Function Call (#366) * test concat layers (#367) * Update README.md (#371) * Pinning the version of tensorflow_datasets package so that it does not require updating TF (#373) Co-authored-by: NihalHarish <nihal42harish@gmail.com> * Bugfix: Debugger breaks if should_save_tensor is called before collections are prepared (#372) * Fixing the nightly build pipelines. Avoid force reinstall of rules package when not necessary (#374) * returning list instead of dict keys (#376) fix in reuturn of _get_sm_tj_jobs_with_prefix . This function should return list always. * Add support for mixed precision training (#378) * Modify Asserts to Work with TF 2.1.0 and TF 2.0.0 (#380) * pytorch tmp (#382) * extend zcc to 2.1.2 (#384) * disable pytorch (#386) * Removed the redundant installation of smdebug and smdebug-rules (#391) * Incrementing the version to 0.9.5 (#396) * pin tensorflow dataset in test config (#399) * add back test * revert some changes * unpin pytest version Co-authored-by: Nihal Harish <nihal42harish@gmail.com> Co-authored-by: Vikas-kum <vikumar@amazon.com> Co-authored-by: Vandana Kannan <vandanavk@users.noreply.github.com> Co-authored-by: Anirudh <anirudhkrec@gmail.com> Co-authored-by: Miyoung <myoung8739@gmail.com> Co-authored-by: Miyoung Choi <cmiyoung@amazon.com> Co-authored-by: Rahul Huilgol <huilgolr@amazon.com> Co-authored-by: Amol Lele <19983848+leleamol@users.noreply.github.com>
- Loading branch information
1 parent
29fd12e
commit a996f1e
Showing
13 changed files
with
75 additions
and
29 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1 +1 @@ | ||
__version__ = "0.9.4" | ||
__version__ = "0.9.5" |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -28,6 +28,7 @@ | |
except ImportError: | ||
herring = None | ||
|
||
|
||
DEFAULT_INCLUDE_COLLECTIONS = [CollectionKeys.LOSSES] | ||
|
||
|
||
|
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,31 +1,32 @@ | ||
# Third Party | ||
import numpy as np | ||
from packaging import version | ||
import pytest | ||
from tensorflow.python.framework.dtypes import _NP_TO_TF | ||
from tests.tensorflow2.utils import is_tf_2_2 | ||
|
||
# First Party | ||
from smdebug.core.tfevent.util import _get_proto_dtype | ||
|
||
|
||
@pytest.mark.skipif( | ||
is_tf_2_2() is False, reason="Brain Float Is Unavailable in lower versions of TF" | ||
) | ||
def test_tensorflow2_datatypes(): | ||
# _NP_TO_TF contains all the mappings | ||
# of numpy to tf types | ||
try: | ||
from tensorflow import __version__ as tf_version | ||
from tensorflow.python import _pywrap_bfloat16 | ||
|
||
if version.parse(tf_version) >= version.parse("2.0.0"): | ||
from tensorflow.python import _pywrap_bfloat16 | ||
|
||
# TF 2.x.x Implements a Custom Numpy Datatype for Brain Floating Type | ||
# Which is currently only supported on TPUs | ||
_np_bfloat16 = _pywrap_bfloat16.TF_bfloat16_type() | ||
_NP_TO_TF.pop(_np_bfloat16) | ||
# TF 2.x.x Implements a Custom Numpy Datatype for Brain Floating Type | ||
# Which is currently only supported on TPUs | ||
_np_bfloat16 = _pywrap_bfloat16.TF_bfloat16_type() | ||
_NP_TO_TF.pop(_np_bfloat16) | ||
except (ModuleNotFoundError, ValueError, ImportError): | ||
pass | ||
|
||
for _type in _NP_TO_TF: | ||
try: | ||
_get_proto_dtype(np.dtype(_type)) | ||
except Exception: | ||
assert False | ||
assert False, f"{_type} not supported" | ||
assert True |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters