Added update_metadata() to write_ adapters #716

jmaruland · 2022-07-08T15:03:01Z

This PR started as an approach to add more information to the document schema that will help to keep track of all the updates made to the metadata and specs of the samples.
In addition, we implemented a revisions system with the mongo database that will keep track of old versions of documents. Every time that update_metadata() is run, the active document that is saved in collections is copied to revisions where every entry is protected by the same key id plus a revision number. There two parameters is used as a unique identifier for every entry.

databroker/experimental/server_ext.py

danielballan · 2022-07-29T14:47:13Z

databroker/experimental/schemas.py

+
+class DocumentRevision(BaseDocument):
+    revision: int
+


It may be useful to add a classmethod constructor here to do what you were trying to do in __init__.

@classmethod def from_document(cls, document) return cls(key=document.key, ...)

databroker/experimental/schemas.py

danielballan · 2022-07-29T14:48:44Z

databroker/experimental/server_ext.py

@@ -62,8 +63,9 @@ def inner(self, *args, **kwargs):
 class WritingArrayAdapter:
    structure_family = "array"

-    def __init__(self, collection, directory, doc):
+    def __init__(self, collection, revisions, directory, doc):


I think it would make sense to pass in the database here rather than separately passing in each of its collections.

Fixed in the latest commit

databroker/experimental/server_ext.py

danielballan · 2022-07-29T14:53:19Z

databroker/experimental/server_ext.py

+        updated_at = datetime.now(tz=timezone.utc)
+        self.doc.updated_at = updated_at
+
+        if len(metadata) > 0:


If I want to update metadata to be empty {} or specs to be empty [], shouldn't that update be processed?

Fixed in the latest commit

danielballan · 2022-07-29T14:53:47Z

databroker/experimental/server_ext.py

+        )
+
+        if result.matched_count != result.modified_count:
+            raise ValueError("Error while writing to database")


I suggest classifying this as a RuntimeError.

danielballan · 2022-08-05T17:50:46Z

Now that we are adding indexes, I think we should also add an index to the nodes collection. This will make lookup by key faster. We might as well also enforce it to be unique. Using UUID4 should achieve that result anyway, but it doesn't hurt to claim uniqueness via an index as well.

danielballan

Seems on track. A couple comments. More tests would be good, too.

danielballan · 2022-08-30T14:55:34Z

databroker/experimental/server_ext.py

+    def __len__(self):
+        return self._collection.count_documents(
+            {"key": self._key}
+        )  # maybe wrong MongoDB usage here...


Delete comment (assuming this usage is now correct).

danielballan · 2022-08-30T15:01:36Z

databroker/experimental/server_ext.py

+            {"key": self._key}
+        )  # maybe wrong MongoDB usage here...
+
+    def __getitem__(self, item_):


This looks mixed up and likely needs testing.

The usage r[i:j] should lead to skip(offset).limit(j - i). The usage r[i:] or r[i:None] or r[:-1] (all equivalent) should lead to skip(offset) with no limit. Pymongo also accept skip(offset).limit(0) where 0 means "no limit", which is an option if you find it leads to cleaner code.

danielballan · 2022-08-30T15:02:09Z

databroker/experimental/server_ext.py

+        if now > self.deadline:
+            self._doc = Document(
+                **self.collection.find_one({"key": self.key})
+            )  # run query


This comment seems superfluous. :-)

danielballan · 2022-08-30T15:03:23Z

databroker/experimental/server_ext.py

+    def create_indexes(self):
+        self.revision_coll.create_index(
+            [("key", pymongo.ASCENDING), ("revision", pymongo.DESCENDING)], unique=True
+        )


While we're creating indexes, we should also create an index on the nodes collection to ensure that key is unique.

danielballan reviewed Jul 8, 2022

View reviewed changes

databroker/experimental/server_ext.py Outdated Show resolved Hide resolved

danielballan reviewed Jul 29, 2022

View reviewed changes

jmaruland added 4 commits August 4, 2022 11:57

Added created_at and updated_at to document schema

0a09403

First draft to update metadata

2735195

Fixed instances

29f50ad

Added update_metadata to array and df clients

0518b1d

danielballan force-pushed the Add-timestamps-to-experimental-document branch from 5583ff5 to 0518b1d Compare August 4, 2022 16:20

danielballan added 3 commits August 4, 2022 12:42

Update new signature.

aaf6ee6

Update test for keyword-only args.

970dfbe

Rebase put a copy of this on COO but not DF.

96f6a4a

jmaruland changed the title ~~Added created_at and updated_at to document schema~~ Added update_metadata() to write_ adapters Aug 4, 2022

jmaruland mentioned this pull request Aug 4, 2022

Support updating metadata and accessing metadata revision history bluesky/tiled#266

Merged

jmaruland added 4 commits August 5, 2022 19:31

Added Dan's reviews

59f4f4b

Added support of revisions to metadata

92ac52b

Split update metadata test case in two simpler tests cases

924f26f

Made revisions tests compatible with pagination packets

21cdeda

danielballan reviewed Aug 30, 2022

View reviewed changes

jmaruland and others added 4 commits September 7, 2022 01:04

Added batch of reviews and fixed test cases

0224f36

structure() should return dataclass not pydantic

e595a6d

Added last group of reviews

40a7215

updated version requirement for tiled

e4e7509

danielballan approved these changes Sep 14, 2022

View reviewed changes

danielballan merged commit bafe6c5 into bluesky:main Sep 14, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added update_metadata() to write_ adapters #716

Added update_metadata() to write_ adapters #716

jmaruland commented Jul 8, 2022 •

edited

Loading

danielballan Jul 29, 2022

danielballan Jul 29, 2022

jmaruland Aug 5, 2022

danielballan Jul 29, 2022

jmaruland Aug 5, 2022

danielballan Jul 29, 2022

danielballan commented Aug 5, 2022

danielballan left a comment

danielballan Aug 30, 2022

danielballan Aug 30, 2022

danielballan Aug 30, 2022

danielballan Aug 30, 2022

Added update_metadata() to write_ adapters #716

Added update_metadata() to write_ adapters #716

Conversation

jmaruland commented Jul 8, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

danielballan commented Aug 5, 2022

danielballan left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jmaruland commented Jul 8, 2022 •

edited

Loading