Apply EAV data model to system attributes. #162

toshihikoyanase · 2018-08-23T02:22:36Z

System attributes are converted to a JSON-dumped string, and stored as an entry of user attributes.
If users uses RDB backend, errors occur when dumped system attributes exceed max length of an entry of user attributes.
This PR apply EAV data model to system attributes to avoid such errors.
Schema version will be updated from 5 to 7 because version 6 will be assigned to this change about study names.

…to it.

…ad-only.

toshihikoyanase · 2018-09-12T02:41:11Z

Schema version is changed due to this change. The version will be updated from 7 to 8 at this time.
This PR should be merged after PR Add Study.study_name #157 to keep the order of schema version. Otherwise, we need to change schema version again.

g-votte · 2018-09-12T06:12:32Z

tests/storages_tests/test_storages.py

@@ -24,7 +23,7 @@
    'json_serializable': {'baseline_score': 0.001, 'tags': ['image', 'classification']},
 }

-EXAMPLE_USER_ATTRS = dict(EXAMPLE_SYSTEM_ATTRS, **{SYSTEM_ATTRS_KEY: {}})  # type: Dict[str, Any]
+EXAMPLE_USER_ATTRS = EXAMPLE_SYSTEM_ATTRS  # type: Dict[str, Any]


Now, EXAMPLE_USER_ATTRS and EXAMPLE_SYSTEM_ATTRS have the same structure. I think we can use a common variable name such as: EXAMPLE_ATTRS.

Thank you for your suggestion. Those two variables are merged into EXAMPLE_ATTRS.

g-votte · 2018-09-12T06:32:05Z

pfnopt/storages/rdb/storage.py

+
+        study = models.StudyModel.find_or_raise_by_id(study_id, session)
+        system_attr = models.StudySystemAttributeModel.find_by_study_and_key(study, key, session)
+        # TODO(Yanase): KeyError may be inconsistent with ValueError raised by missing study_id.


Thank you for raising attention for this matter. As far as referring Python document, KeyError is supposed to be used in a dictionary. Taking this also in consideration, ValueError seems more appropriate here.

Raised when a mapping (dictionary) key is not found in the set of existing keys.

Thank you for your answer to my question. This KeyError was replaced with ValueError. Corresponding method in in_memory.py is also fixed to keep the consistency.

g-votte · 2018-09-12T07:26:17Z

pfnopt/storages/rdb/storage.py

+        else:
+            attribute.value_json = json.dumps(value)
+
+        session.commit()


We have better take care of IntegrityError; otherwise, this line may fail on multi-worker environment.

I agree with you. The set methods for both study and trial need this fix. As discussed offline, I will resolve this issue in another PR because it requires a certain amount of changes.

…e TODO comment.

g-votte · 2018-09-12T13:22:27Z

pfnopt/storages/rdb/storage.py

+        system_attr = models.StudySystemAttributeModel.find_by_study_and_key(study, key, session)
+        if system_attr is None:
+            raise ValueError(
+                'System attribute {} does not exist in Study {}.'.format(key, study_id))


study_id should not be taken care of by a typical user. We may wanna display study name instead. (..or I think it's also OK not to display the information, such as "the study", to avoid complicated merge dependency.)

You're right. study_id should not be visible to ordinal users, and we need to use study_uuid or study_name for this purpose. At this time, I just use "the study" to avoid merge dependency as you mentioned. Thanks.

g-votte · 2018-09-12T13:25:20Z

pfnopt/storages/rdb/models.py

@@ -18,7 +18,7 @@
 from pfnopt.structs import StudyTask
 from pfnopt.structs import TrialState

-SCHEMA_VERSION = 6
+SCHEMA_VERSION = 8


As you may notice, let's take care of version number before merging.

g-votte

Except SCHEMA_VERSION, LGTM!

toshihikoyanase · 2018-09-13T08:54:51Z

SCHEMA_VERSION was changed again from 8 to 7 to merge this PR before #157.

iwiwi

LGTM. My concern is that there is some inconsistency between system attrs and user attrs (e.g., get_trial_user_attrs and get_trial_system_attr), but this should be addressed in later PRs.

toshihikoyanase added 5 commits August 22, 2018 17:26

Extract system attribute from user attribute to apply EAV data model …

31227ea

…to it.

Apply EAV data model to Trial.user_attrs and Trial.system_attrs.

af0e5ec

Apply copy.deepcopy to get_trial_system_attr to make return values re…

96cb53d

…ad-only.

Merge branch 'master' into add-system-attribute-table

a2c656c

Remove unused columns from a database table.

94a89b9

toshihikoyanase changed the title ~~[WIP] Apply EAV data model to system attributes.~~ Apply EAV data model to system attributes. Sep 12, 2018

Increase max length of keys.

fc7c68f

g-votte requested changes Sep 12, 2018

View reviewed changes

toshihikoyanase added 3 commits September 12, 2018 17:49

Change type of errors. Merged duplicated variables.

c5b8d22

Merge branch 'master' into add-system-attribute-table

36bed23

Use self._commit instead of session.commit. This is tentative fix. Se…

2b33310

…e TODO comment.

toshihikoyanase mentioned this pull request Sep 12, 2018

Take care of IntegrityError on multi-worker environment. #168

Closed

g-votte reviewed Sep 12, 2018

View reviewed changes

Remove study_id from an error message.

2d51595

g-votte approved these changes Sep 13, 2018

View reviewed changes

SCHEMA_VERSION was rollbacked from 8 to 7.

5fdd9e8

iwiwi approved these changes Sep 13, 2018

View reviewed changes

iwiwi merged commit 75e3fe5 into master Sep 13, 2018

toshihikoyanase mentioned this pull request Sep 13, 2018

Add Study.study_name #157

Merged

toshihikoyanase deleted the add-system-attribute-table branch November 29, 2018 04:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply EAV data model to system attributes. #162

Apply EAV data model to system attributes. #162

toshihikoyanase commented Aug 23, 2018

toshihikoyanase commented Sep 12, 2018

g-votte Sep 12, 2018

toshihikoyanase Sep 12, 2018

g-votte Sep 12, 2018

toshihikoyanase Sep 12, 2018

g-votte Sep 12, 2018

toshihikoyanase Sep 12, 2018

g-votte Sep 12, 2018 •

edited

toshihikoyanase Sep 13, 2018

g-votte Sep 12, 2018

g-votte left a comment

toshihikoyanase commented Sep 13, 2018

iwiwi left a comment

Apply EAV data model to system attributes. #162

Apply EAV data model to system attributes. #162

Conversation

toshihikoyanase commented Aug 23, 2018

toshihikoyanase commented Sep 12, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

g-votte Sep 12, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

g-votte left a comment

Choose a reason for hiding this comment

toshihikoyanase commented Sep 13, 2018

iwiwi left a comment

Choose a reason for hiding this comment

g-votte Sep 12, 2018 •

edited