DM-33314: Implement missing methods for Cassandra backend #25

andy-slac · 2022-03-14T16:56:52Z

APDB API added few methods to support replication to/from PPDB on a previous ticket. This update adds an implementation of these new methods to Cassandra backend. Cassandra code was updated to use execution profiles, as the previous model gets deprecated. Also a continuing refactoring of Cassandra implementation in a couple of commits. It makes sense to look at the final diff, some pieces were changed more than once.

Packing columns into blobs did not help with performance, and storage size grows too. Not useful, getting rid of it to simplify things.

n8pease · 2022-03-14T23:27:27Z

python/lsst/dax/apdb/apdbCassandra.py

+                # prepare it because it's not reusable.
+                statement = cassandra.query.SimpleStatement(full_query)
+            statements.append((statement, params))
+        _LOG.debug("getDiaObjects: #queries: %s", len(statements))
        # _LOG.debug("getDiaObjects: queries: %s", queries)


remove commented line?

n8pease · 2022-03-14T23:35:14Z

python/lsst/dax/apdb/apdbCassandra.py

+                values = (ssObjectId, apdb_part, apdb_time_part, diaSourceId)
+            queries.add(self._prep_statement(query), values)
+
+        # _LOG.debug("query: %s", query)


n8pease · 2022-03-15T16:01:15Z

python/lsst/dax/apdb/apdbCassandraSchema.py

-        if not clust_columns:
-            raise ValueError(f"Table {table_name} configuration is missing primary index")
+        # if not clust_columns:
+        #     raise ValueError(f"Table {table_name} configuration is missing primary index")


uncomment? remove lines?

I forgot to clean that up after testing, will remove it completely.

n8pease · 2022-03-15T16:11:47Z

LGTM!

The implementation uses ssObjectId for partitioning key, there are no other natural keys and size is too large for one partition. This may be reconsidered once we learn more about how SSObjects will be queried.

Efficient search by diaSourceId needs another table which is partitioned by that column. There may be other ways to implement association, will need to think more about it.

Remove non_prepared_statements option, always use prepared.

andy-slac added 2 commits March 6, 2022 22:14

Drop support for packed columns.

8f1b735

Packing columns into blobs did not help with performance, and storage size grows too. Not useful, getting rid of it to simplify things.

Switch to using execution profiles

c25b6e5

n8pease reviewed Mar 14, 2022

View reviewed changes

n8pease reviewed Mar 15, 2022

View reviewed changes

andy-slac added 6 commits March 15, 2022 11:12

Add support for SSObject table to Cassandra implementation.

c07ddb3

The implementation uses ssObjectId for partitioning key, there are no other natural keys and size is too large for one partition. This may be reconsidered once we learn more about how SSObjects will be queried.

Fix mypy warnings

88df8dd

Refactoring of cassandra code

b413ad3

Implement reassignDiaSources for Cassandra.

1042492

Efficient search by diaSourceId needs another table which is partitioned by that column. There may be other ways to implement association, will need to think more about it.

Another refactoring of Cassandra implementation.

2a76eba

Remove non_prepared_statements option, always use prepared.

Implement history seach for Cassandra

7a03f92

andy-slac force-pushed the tickets/DM-33314 branch from 7cde542 to 7a03f92 Compare March 15, 2022 18:15

andy-slac merged commit 7e0c5f6 into main Mar 15, 2022

andy-slac deleted the tickets/DM-33314 branch March 15, 2022 18:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DM-33314: Implement missing methods for Cassandra backend #25

DM-33314: Implement missing methods for Cassandra backend #25

andy-slac commented Mar 14, 2022

n8pease Mar 14, 2022

n8pease Mar 14, 2022

n8pease Mar 15, 2022

andy-slac Mar 15, 2022

n8pease commented Mar 15, 2022

DM-33314: Implement missing methods for Cassandra backend #25

DM-33314: Implement missing methods for Cassandra backend #25

Conversation

andy-slac commented Mar 14, 2022

n8pease Mar 14, 2022

Choose a reason for hiding this comment

n8pease Mar 14, 2022

Choose a reason for hiding this comment

n8pease Mar 15, 2022

Choose a reason for hiding this comment

andy-slac Mar 15, 2022

Choose a reason for hiding this comment

n8pease commented Mar 15, 2022