Don't fail when inserting UDTs with prepared queries with some missing fields #224 #1151

Mokto · 2023-04-26T12:09:04Z

Let's say you have an object:

class Address(object):

    def __init__(self, street, zipcode, **kwargs):
        self.street = street
        self.zipcode = zipcode

cluster.register_user_type('mykeyspace', 'address', Address)

And let's say the type actually contains another field, let's call i raw_address

Then inserting data through a prepared statement will actually fail : the driver will complain raw_address is missing.

This change addresses that, as any field should be optional.

absurdfarce · 2023-05-03T15:22:21Z

Thanks for the PR @Mokto!

Have you signed the Contributor License Agreement for contributions to DataStax open source projects? If not you can find it at https://cla.datastax.com/. Thanks!

Mokto · 2023-05-03T16:09:06Z

I just did ;)

Mokto · 2023-05-09T04:44:44Z

Hi,
Any update on this ?

Thanks.

fruch

LGTM

absurdfarce · 2023-05-24T20:40:50Z

@Mokto Apologies for the delay in getting back to this; we've got a lot of balls in the air at the moment.

A few notes, perhaps mostly to myself. First off this works fine via CQL:

drop keyspace if exists mykeyspace;
create KEYSPACE mykeyspace WITH replication = {'class': 'SimpleStrategy', 'replication_factor': 1};
use mykeyspace;
CREATE TYPE address (street text, zipcode int, raw_address text);
CREATE TABLE users (id int PRIMARY KEY, location frozen<address>);
INSERT INTO users (id, location) VALUES (0, {street: '123 Main St.', zipcode: 78723});
cqlsh:mykeyspace> select * from users;

 id | location
----+-------------------------------------------------------------
  0 | {street: '123 Main St.', zipcode: 78723, raw_address: null}

(1 rows)

Second this change really only affects the prepared statement case. When an "incomplete" (for lack of a better term) instance representing a UDT type is fed to a simple statement the input type is converted to something like the UDT literal above so you get basically the same behaviour. In the prepared statement case we actually know the type that's expected so we can attempt to serialize based on cassandra.cqltypes.UserType, which lands us in the code cited above.

Finally, an explicit repro case:

from cassandra.cluster import Cluster

cluster = Cluster()
session = cluster.connect()
session.execute("drop keyspace if exists mykeyspace")
session.execute("create KEYSPACE mykeyspace WITH replication = {'class': 'SimpleStrategy', 'replication_factor': 1}")
session.set_keyspace('mykeyspace')
session.execute("CREATE TYPE address (street text, zipcode int, raw_address text)")
session.execute("CREATE TABLE users (id int PRIMARY KEY, location frozen<address>)")

# create a class to map to the "address" UDT                                                                                                                                                                         
class Address(object):

    def __init__(self, street, zipcode):
        self.street = street
        self.zipcode = zipcode

#cluster.register_user_type('mykeyspace', 'address', Address)                                                                                                                                                        
insert_statement = session.prepare("INSERT INTO users (id, location) VALUES (?, ?)")
session.execute(insert_statement, [0, Address("123 Main St.", 78723)])

Without this change (and using the repro code above) you get:

$ python foo.py 
Traceback (most recent call last):
  File "cassandra/cqltypes.py", line 1027, in cassandra.cqltypes.UserType.serialize_safe
TypeError: 'Address' object is not subscriptable

During handling of the above exception, another exception occurred:

Traceback (most recent call last):
  File "foo.py", line 28, in <module>
    session.execute(insert_statement, [0, Address("123 Main St.", 78723)])
  File "cassandra/cluster.py", line 2634, in cassandra.cluster.Session.execute
  File "cassandra/cluster.py", line 2677, in cassandra.cluster.Session.execute_async
  File "cassandra/cluster.py", line 2880, in cassandra.cluster.Session._create_response_future
  File "cassandra/query.py", line 506, in cassandra.query.PreparedStatement.bind
  File "cassandra/query.py", line 636, in cassandra.query.BoundStatement.bind
  File "cassandra/cqltypes.py", line 796, in cassandra.cqltypes._ParameterizedType.serialize
  File "cassandra/cqltypes.py", line 1030, in cassandra.cqltypes.UserType.serialize_safe
AttributeError: 'Address' object has no attribute 'raw_address'

With the fix you get no error and a row matching the CQL example above added.

Mokto · 2023-05-25T05:06:34Z

Thanks!

…sync_with_upstream_3.29.1 version 3.28.0 * tag '3.28.0' of https://github.com/datastax/python-driver: Release 3.28.0: changelog & version PYTHON-1352 Add vector type, codec + support for parsing CQL type (apache#1161) Update docs.yaml to point to most recent 3.27.0 docs changes CONN-38 Notes for 3.27.0 on PYTHON-1350 (apache#1166) PYTHON-1356 Create session-specific protocol handlers to contain session-specific CLE policies (apache#1165) PYTHON-1350 Store IV along with encrypted text when using column-level encryption (apache#1160) PYTHON-1351 Convert cryptography to an optional dependency (apache#1164) Jenkinsfile cleanup (apache#1163) PYTHON-1343 Use Cython for smoke builds (apache#1162) Don't fail when inserting UDTs with prepared queries with some missing fields (apache#1151) Revert "remove unnecessary import __future__ (apache#1156)" docs: convert print statement to function in docs (apache#1157) remove unnecessary import __future__ (apache#1156) Update docs.yaml to include recent fixes to CLE docs Fix for rendering of code blocks in CLE documentation (apache#1159) DOC-3278 Update comment for retry policy (apache#1158) DOC-2813 (apache#1145) Remove different build matrix selection for develop branches (apache#1138)

Allow extra field when inserting with prepared queries

c7b4491

Mokto mentioned this pull request Apr 26, 2023

Allow extra field when inserting with prepared queries scylladb/python-driver#224

Merged

absurdfarce added the cla-missing label May 3, 2023

absurdfarce removed the cla-missing label May 9, 2023

fruch approved these changes May 10, 2023

View reviewed changes

absurdfarce approved these changes May 24, 2023

View reviewed changes

absurdfarce changed the title ~~Allow extra field when inserting with prepared queries #224~~ Don't fail when inserting UDTs with prepared queries with some missing fields #224 May 24, 2023

absurdfarce merged commit d8431d4 into apache:master May 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Don't fail when inserting UDTs with prepared queries with some missing fields #224 #1151

Don't fail when inserting UDTs with prepared queries with some missing fields #224 #1151

Uh oh!

Mokto commented Apr 26, 2023

Uh oh!

absurdfarce commented May 3, 2023

Uh oh!

Mokto commented May 3, 2023

Uh oh!

Mokto commented May 9, 2023

Uh oh!

fruch left a comment

Uh oh!

absurdfarce commented May 24, 2023

Uh oh!

Mokto commented May 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Don't fail when inserting UDTs with prepared queries with some missing fields #224 #1151

Don't fail when inserting UDTs with prepared queries with some missing fields #224 #1151

Uh oh!

Conversation

Mokto commented Apr 26, 2023

Uh oh!

absurdfarce commented May 3, 2023

Uh oh!

Mokto commented May 3, 2023

Uh oh!

Mokto commented May 9, 2023

Uh oh!

fruch left a comment

Choose a reason for hiding this comment

Uh oh!

absurdfarce commented May 24, 2023

Uh oh!

Mokto commented May 25, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants