Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ARROW-10656: [Rust] Allow schema validation to ignore field names and only check data types on new batch #8988

Closed
wants to merge 1 commit into from

Conversation

nevi-me
Copy link
Contributor

@nevi-me nevi-me commented Dec 22, 2020

This adds the option to create a new record batch with less strict validation for list field names.
The default behaviour is preserved.

@nevi-me nevi-me changed the title ARROW-10655: [Rust] Allow eaner schema validation on new batch ARROW-10656: [Rust] Allow eaner schema validation on new batch Dec 22, 2020
@nevi-me
Copy link
Contributor Author

nevi-me commented Dec 22, 2020

Hi @ch-sc , please have a look at the approach that I've taken, and let me know if it meets your use-case.

If you're happy with the PR, I can then document the new functions, and add unit tests, then get it reviewed.

Thanks

@nevi-me nevi-me changed the title ARROW-10656: [Rust] Allow eaner schema validation on new batch ARROW-10656: [Rust] Allow leaner schema validation on new batch Dec 22, 2020
@codecov-io
Copy link

codecov-io commented Dec 22, 2020

Codecov Report

Merging #8988 (78b560c) into master (51672b2) will decrease coverage by 0.05%.
The diff coverage is 26.00%.

Impacted file tree graph

@@            Coverage Diff             @@
##           master    #8988      +/-   ##
==========================================
- Coverage   82.61%   82.56%   -0.06%     
==========================================
  Files         202      202              
  Lines       50048    50087      +39     
==========================================
+ Hits        41347    41353       +6     
- Misses       8701     8734      +33     
Impacted Files Coverage Δ
rust/arrow/src/datatypes.rs 75.02% <0.00%> (-1.43%) ⬇️
rust/arrow/src/record_batch.rs 76.06% <38.23%> (-12.23%) ⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 51672b2...78b560c. Read the comment docs.

@github-actions
Copy link

@ch-sc
Copy link
Contributor

ch-sc commented Dec 27, 2020

Looks good to me @nevi-me. Thank you!

Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Nice work @nevi-me ! I went through this and it looks good. I also merged the this code into apache/master locally and re-ran all tests. I think this is good to go.

@alamb
Copy link
Contributor

alamb commented Dec 31, 2020

The full set of Rust CI tests did not run on this PR :(

Can you please rebase this PR against apache/master to pick up the changes in #9056 so that they do?

I apologize for the inconvenience.

@nevi-me
Copy link
Contributor Author

nevi-me commented Jan 1, 2021

Done rebasing @alamb

Copy link
Member

@jorgecarleitao jorgecarleitao left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM.

@alamb alamb changed the title ARROW-10656: [Rust] Allow leaner schema validation on new batch ARROW-10656: [Rust] Allow schema validation to ignore field names and only check data types on new batch Jan 1, 2021
@alamb alamb closed this in 118f462 Jan 1, 2021
@nevi-me nevi-me deleted the ARROW-10656 branch January 2, 2021 05:48
Ok(RecordBatch { schema, columns })
}

pub fn try_new_with_options(
Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@alamb @jorgecarleitao I must have done something wrong on the rebase, I had added doc comments for this function, and added tests :(

@nevi-me nevi-me restored the ARROW-10656 branch January 2, 2021 05:51
GeorgeAp pushed a commit to sirensolutions/arrow that referenced this pull request Jun 7, 2021
… only check data types on new batch

This adds the option to create a new record batch with less strict validation for list field names.
The default behaviour is preserved.

Closes apache#8988 from nevi-me/ARROW-10656

Authored-by: Neville Dipale <nevilledips@gmail.com>
Signed-off-by: Andrew Lamb <andrew@nerdnetworks.org>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants