Skip to content

Conversation

@inv-jishnu
Copy link
Contributor

@inv-jishnu inv-jishnu commented Nov 13, 2025

Description

This PR updates the column creation logic in createColumnFromValue() in the ColumnUtils class to correctly interpret the string "null" as an actual null value for non-TEXT column types.

Previously, when a CSV file contained null as a value, it was parsed as the literal string "null", causing the initial null check (value != null) to behave differently than when the same value was provided via JSON or JSON Lines. As a result, CSV-based imports failed, while JSON and JSON Lines imports handled null values correctly.

This fix ensures consistent behavior across all file types by converting "null" to a real null value for all column types except TEXT when provided via a CSV file.

Related issues and/or PRs

NA

Changes made

  • Updated createColumnFromValue() method in ColumnUtils class and added a pre-check:

  • Added Unit Tests in ColumnUtilsTest

Checklist

The following is a best-effort checklist. If any items in this checklist are not applicable to this PR or are dependent on other, unmerged PRs, please still mark the checkboxes after you have read and understood each item.

  • I have commented my code, particularly in hard-to-understand areas.
  • I have updated the documentation to reflect the changes.
  • I have considered whether similar issues could occur in other products, components, or modules if this PR is for bug fixes.
  • Any remaining open issues linked to this PR are documented and up-to-date (Jira, GitHub, etc.).
  • Tests (unit, integration, etc.) have been added for the changes.
  • My changes generate no new warnings.
  • Any dependent changes in other PRs have been merged and published.

Additional notes (optional)

NA

Release notes

Fixed handling of "null" values for non-TEXT columns in CSV imports

@inv-jishnu inv-jishnu self-assigned this Nov 13, 2025
@gemini-code-assist
Copy link
Contributor

Summary of Changes

Hello @inv-jishnu, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request standardizes the handling of "null" values across different data import formats. Previously, CSV imports treated the string "null" as a literal for non-TEXT columns, causing inconsistencies and potential failures compared to JSON and JSONLines. The changes modify the createColumnFromValue method to correctly convert the string "null" into a true null value for all non-TEXT data types, ensuring uniform data interpretation and improving the robustness of data loading processes.

Highlights

  • Consistent Null Handling: The pull request introduces a change to correctly interpret the string "null" as an actual null value for non-TEXT column types during CSV imports, aligning its behavior with JSON and JSONLines imports.
  • CSV Import Reliability: This fix resolves issues where CSV-based imports would fail due to the string "null" being incorrectly parsed as a literal string instead of a null value for non-TEXT fields.
  • Enhanced Test Coverage: New unit tests have been added to ColumnUtilsTest to ensure the correct handling of "null" strings for various data types, including specific tests for non-TEXT types being converted to null and TEXT types retaining "null" as a literal string.
Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for Github and other Google products, sign up here.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

Copy link
Contributor

@gemini-code-assist gemini-code-assist bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request correctly handles string "null" values from CSV imports as null for non-TEXT columns, aligning behavior with other file formats. The implementation is sound, and new unit tests validate the change. My main feedback is to make the "null" string check case-insensitive (e.g., to handle "NULL" and "Null") for better robustness, and to update the corresponding unit tests to cover these cases. This will make the feature more user-friendly.

@ypeckstadt ypeckstadt changed the title Handle null as null value for non-TEXT columns in CSV import Fix handling of "null" values for non-TEXT columns in CSV imports Nov 20, 2025
Copy link
Contributor

@ypeckstadt ypeckstadt left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM. Thank you.

@ypeckstadt ypeckstadt requested review from a team, Torch3333, brfrn169, feeblefakie and komamitsu and removed request for a team November 20, 2025 00:23
Copy link
Collaborator

@brfrn169 brfrn169 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thank you!

Copy link
Contributor

@komamitsu komamitsu left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! 👍

Copy link
Contributor

@feeblefakie feeblefakie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thank you!

Copy link
Contributor

@Torch3333 Torch3333 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM, thank you!

Copy link
Contributor

@thongdk8 thongdk8 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

7 participants