[SPARK-54463][SQL] Add CSV serialization and deserialization support for TIME type #53175

vinodkc · 2025-11-23T00:13:39Z

What changes were proposed in this pull request?

This PR adds CSV serialization and deserialization support for Spark's TIME type

Why are the changes needed?

TIME type currently lacks CSV support, preventing users from:

Reading/writing CSV files with TIME columns
Using from_csv() and to_csv() functions with TIME type
Integrating TIME data with external CSV-based systems

Does this PR introduce any user-facing change?

Yes.
Users can now:

Read CSV with TIME: spark.read.schema("time TIME(6)").csv("data.csv")
Write CSV with TIME: df.write.csv("output.csv") → 14:30:45.123456

Use from_csv/to_csv:

SELECT from_csv('14:30:45.123456', 'time TIME(6)');
SELECT to_csv(named_struct('time', TIME'14:30:45'));

Custom format: spark.read.option("timeFormat", "HH-mm-ss.SSSSSS").csv("data.csv")
New option: timeFormat - controls TIME formatting/parsing (default: HH:mm:ss with fractional seconds)

How was this patch tested?

Added new test cases in CsvExpressionsSuite, CsvFunctionsSuite, SQL tests (csv-functions.sql), and CsvSuite

Was this patch authored or co-authored using generative AI tooling?

Yes.
Generated-by: Claude 3.5 Sonnet

AI assistance was used for:

Code pattern analysis and design discussions
Implementation guidance following Spark conventions
Test case generation and organization
Documentation and examples

dongjoon-hyun

Thank you, @vinodkc .

dongjoon-hyun

+1, LGTM. Merged to master for Apache Spark 4.2.0.

…for TIME type ### What changes were proposed in this pull request? This PR adds CSV serialization and deserialization support for Spark's TIME type ### Why are the changes needed? TIME type currently lacks CSV support, preventing users from: - Reading/writing CSV files with TIME columns - Using `from_csv()` and `to_csv()` functions with TIME type - Integrating TIME data with external CSV-based systems ### Does this PR introduce _any_ user-facing change? Yes. Users can now: - Read CSV with TIME: `spark.read.schema("time TIME(6)").csv("data.csv")` - Write CSV with TIME: `df.write.csv("output.csv")` → `14:30:45.123456` - Use from_csv/to_csv: ```sql SELECT from_csv('14:30:45.123456', 'time TIME(6)'); SELECT to_csv(named_struct('time', TIME'14:30:45')); ``` - Custom format: `spark.read.option("timeFormat", "HH-mm-ss.SSSSSS").csv("data.csv")` - New option: `timeFormat` - controls TIME formatting/parsing (default: `HH:mm:ss` with fractional seconds) ### How was this patch tested? Added new test cases in `CsvExpressionsSuite`, `CsvFunctionsSuite`, SQL tests (`csv-functions.sql`), and `CsvSuite` ### Was this patch authored or co-authored using generative AI tooling? Yes. Generated-by: Claude 3.5 Sonnet AI assistance was used for: - Code pattern analysis and design discussions - Implementation guidance following Spark conventions - Test case generation and organization - Documentation and examples Closes apache#53175 from vinodkc/br_time_csv_read_write. Authored-by: vinodkc <vinod.kc.in@gmail.com> Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>

Support TIME type in CVS read and write

0bc143e

github-actions bot added the SQL label Nov 23, 2025

dongjoon-hyun reviewed Nov 23, 2025

View reviewed changes

dongjoon-hyun approved these changes Nov 23, 2025

View reviewed changes

dongjoon-hyun closed this in 7f5478c Nov 23, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SPARK-54463][SQL] Add CSV serialization and deserialization support for TIME type #53175

[SPARK-54463][SQL] Add CSV serialization and deserialization support for TIME type #53175

Uh oh!

vinodkc commented Nov 23, 2025

Uh oh!

dongjoon-hyun left a comment

Uh oh!

dongjoon-hyun left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[SPARK-54463][SQL] Add CSV serialization and deserialization support for TIME type #53175

[SPARK-54463][SQL] Add CSV serialization and deserialization support for TIME type #53175

Uh oh!

Conversation

vinodkc commented Nov 23, 2025

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

dongjoon-hyun left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants