[GEN-2381] Pandas handling of nullable cells by BryanFauble · Pull Request #1272 · Sage-Bionetworks/synapsePythonClient

BryanFauble · 2025-11-06T18:44:20Z

Verified that the system correctly handles null values during upserts and that the data is stored and retrieved accurately.

Problem:

When querying for, inserting, or updating cells of data that can be nullable things start to break as described in https://sagebionetworks.jira.com/browse/GEN-2381
Using convert_dtypes introduces int64 data not serialized by json error and attributes not matching error in integration tests such as StringDtype vs object.

Solution:

Use the suggestion from @danlu1 to use both the convert_dtypes and the dtype argument when reading in a CSV to pandas DF

Testing:

Unit tests and integration tests are extended and run though successfully.
More testing within Genie is needed

…ts and that the data is stored and retrieved accurately.\

…n type

thomasyu888 · 2025-11-14T18:15:22Z

        pd.testing.assert_series_equal(
-            results["column_string"], pd.DataFrame(dict_data)["column_string"]
+            results["column_string"],
+            pd.DataFrame(dict_data)["column_string"].convert_dtypes(),


For discussion: Should expected dataframes in these tests be created with the dtypes set so that we are testing the fact that convert_dtypes is being done?

The concern here is if there are issues with "convert_dtypes" function, it won't be caught because we are using the function itself.

The only reason I added convert_dtypes is to make data types match between columns. But I forgot that we can use check_dtype=False instead. I would prefer to use check_dtype because it would be a quicker fix.

rxu17 · 2025-11-20T00:40:40Z

It's still expected that the result of calling query will return object type for something that is integer type and has nulls.

Then using convert_dtypes() would convert it to the correct pandas dtype (in this case Int64)What we are trying to prevent was that previously query would return something from a table that is integer with nulls -> float.

If that is the case, then i think this is good to go for the genie scenario.

rxu17

LGTM! Just a comment

* Adding support for Python 3.14 and dropping support for python 3.9

- Verified that the system correctly handles null values during upser…

2efe8e3

…ts and that the data is stored and retrieved accurately.\

BryanFauble requested review from danlu1 and rxu17 November 6, 2025 18:44

rxu17 reviewed Nov 6, 2025

View reviewed changes

Comment thread synapseclient/models/mixins/table_components.py

rxu17 reviewed Nov 6, 2025

View reviewed changes

Comment thread synapseclient/models/mixins/table_components.py Outdated

rxu17 reviewed Nov 6, 2025

View reviewed changes

Comment thread synapseclient/models/mixins/table_components.py

Patch unit test

bb9a5fd

danlu1 reviewed Nov 7, 2025

View reviewed changes

Comment thread synapseclient/models/mixins/table_components.py Outdated

Comment thread synapseclient/models/mixins/table_components.py Outdated

danlu1 added 2 commits November 11, 2025 19:25

resolve ambiguity error for np array and convert np.bool to regular bool

3abf87e

add unit test cases for construct_partial_rows

e3de13a

danlu1 reviewed Nov 11, 2025

View reviewed changes

Comment thread synapseclient/models/mixins/table_components.py Outdated

danlu1 reviewed Nov 11, 2025

View reviewed changes

Comment thread synapseclient/models/mixins/table_components.py Outdated

danlu1 added 5 commits November 13, 2025 17:46

preserving none in list column items

fd4b787

add unit test for csv_to_pandas_df

7e7e234

add convert_types to csv_to_pandas_df so it renders the correct colum…

b9cafb6

…n type

fix int64 can not be serialized by json error

b981781

formatting

729c339

danlu1 marked this pull request as ready for review November 13, 2025 23:46

danlu1 requested a review from a team as a code owner November 13, 2025 23:46

thomasyu888 reviewed Nov 14, 2025

View reviewed changes

Comment thread synapseclient/models/mixins/table_components.py Outdated

thomasyu888 reviewed Nov 14, 2025

View reviewed changes

Comment thread synapseclient/models/mixins/table_components.py Outdated

thomasyu888 reviewed Nov 14, 2025

View reviewed changes

BryanFauble commented Nov 14, 2025

View reviewed changes

Comment thread synapseclient/models/mixins/table_components.py Outdated

danlu1 added 2 commits November 14, 2025 23:31

use check_dtypes to bypass dtypes checking

897c56a

add dtype conversion function

bbaeebb

BryanFauble requested a review from rxu17 November 18, 2025 16:35

rxu17 reviewed Nov 20, 2025

View reviewed changes

rxu17 approved these changes Nov 20, 2025

View reviewed changes

[SYNPY-1690] python 3 14 (#1273)

29e6b97

* Adding support for Python 3.14 and dropping support for python 3.9

BryanFauble merged commit ee64f1d into develop Nov 20, 2025
16 of 20 checks passed

thomasyu888 deleted the gen-2381-table-typing branch April 16, 2026 03:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GEN-2381] Pandas handling of nullable cells#1272

[GEN-2381] Pandas handling of nullable cells#1272
BryanFauble merged 12 commits intodevelopfrom
gen-2381-table-typing

BryanFauble commented Nov 6, 2025 •

edited by danlu1

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thomasyu888 Nov 14, 2025

Uh oh!

danlu1 Nov 14, 2025

Uh oh!

Uh oh!

rxu17 Nov 20, 2025

Uh oh!

rxu17 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

BryanFauble commented Nov 6, 2025 • edited by danlu1 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem:

Solution:

Testing:

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

thomasyu888 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

danlu1 Nov 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rxu17 Nov 20, 2025

Choose a reason for hiding this comment

Uh oh!

rxu17 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

BryanFauble commented Nov 6, 2025 •

edited by danlu1

Loading