Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SPARK-38028][SQL] Expose Arrow Vector from ArrowColumnVector #35326

Closed
wants to merge 1 commit into from

Conversation

viirya
Copy link
Member

@viirya viirya commented Jan 26, 2022

What changes were proposed in this pull request?

This change exposes Arrow Vector from ArrowColumnVector.

Why are the changes needed?

In some cases we need to work with Arrow Vectors behind ColumnVector using Arrow APIs. For example, some Spark extension libraries need to consume Arrow Vectors. For now, it is impossible as the Arrow Vector is private member in ArrowColumnVector. We need to expose the Arrow Vector from ArrowColumnVector.

Does this PR introduce any user-facing change?

No

How was this patch tested?

Existing tests.

@viirya
Copy link
Member Author

viirya commented Jan 26, 2022

cc @sunchao @dongjoon-hyun

@github-actions github-actions bot added the SQL label Jan 26, 2022
Copy link
Member

@dongjoon-hyun dongjoon-hyun left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

+1, LGTM. Thank you, @viirya .

@HyukjinKwon
Copy link
Member

I guess it's fine to expose .. @BryanCutler FYI.

Copy link
Member

@sunchao sunchao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM too

@dongjoon-hyun
Copy link
Member

Merged to master for Apache Spark 3.3.
All compilation (SBT/Maven/Scala-2.13/Java 11/Java 17) passed and this is irrelevant to the exiting UTs.

@viirya
Copy link
Member Author

viirya commented Jan 26, 2022

Thank you @dongjoon-hyun @sunchao @HyukjinKwon

@BryanCutler
Copy link
Member

A little late, but SGTM

@dongjoon-hyun
Copy link
Member

Thank you, @BryanCutler .

@viirya
Copy link
Member Author

viirya commented Jan 31, 2022

Thank you @BryanCutler

senthh pushed a commit to senthh/spark-1 that referenced this pull request Feb 3, 2022
### What changes were proposed in this pull request?

This change exposes Arrow Vector from `ArrowColumnVector`.

### Why are the changes needed?

In some cases we need to work with Arrow Vectors behind `ColumnVector` using Arrow APIs. For example, some Spark extension libraries need to consume Arrow Vectors. For now, it is impossible as the Arrow Vector is private member in `ArrowColumnVector`. We need to expose the Arrow Vector from `ArrowColumnVector`.

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?

Existing tests.

Closes apache#35326 from viirya/arrow_vector.

Authored-by: Liang-Chi Hsieh <viirya@gmail.com>
Signed-off-by: Dongjoon Hyun <dongjoon@apache.org>
@viirya viirya deleted the arrow_vector branch December 27, 2023 18:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
5 participants