feat: Add range partitioning support #174

kitagry · 2025-02-15T01:02:52Z

This pull request introduces support for range partitioning in BigQuery.

I checked this feature with example/config_replace_field_range_partitioned_table.yml

hiroyuki-sato

Thank you for creating this PR. I'll review this later. Before review this PR, I have a question. Could you check my comment?

hiroyuki-sato · 2025-02-15T15:31:40Z

README.md

+  range_partitioning:
+    field: customer_id
+    range:
+      start: '1'


Why does this part use string instead of integer? As far as I know, range partition uses a number. And we can check the range is valid (start < end) if we use integer.

What do you think of this configuration layout?
(Do we need a range block?).

range_partitioning: field: customer_id # document uses `column` but this plugin uses `field`. [1] start: 1 end: 1000 interval: 10 # [1] https://cloud.google.com/bigquery/docs/creating-partitioned-tables#create_an_integer-range_partitioned_table

(This is just my opinion, I want to ask co-maintainers this comment.)

Thank you for the comment! I followed the api documentation. If you prefer integer, I'll change this!

https://cloud.google.com/bigquery/docs/reference/rest/v2/tables#RangePartitioning

Do we need a range block?

I fixed it with 49d2c36.

Hello, @kitagry. Thank you for waiting.

Could you use this layout? (Sorry, we decided to use the original design (except using integer instead of string in range values.)

range_partitioning: field: customer_id range: start: 1 # integer not string. end: 99999 # integer not string. interval: 1 # integer not string.

and Could you check the range start + interval < end?
If you have any concern, please let me know.

I referenced the time_partition configurations.

BigQuery API use

{ "type": string, "expirationMs": string, "field": string, "requirePartitionFilter": boolean }

embulk configuration

type: bigquery table: table_name$20160929 time_partitioning: type: DAY expiration_ms: 259200000 # integer not strong, use sake case

We discussed this using this design document. (Written in Japanese)

After modification, I'll check the partition feature.

hiroyuki-sato · 2025-02-17T04:53:19Z

@kitagry Thank you for more work on this PR. I'll review this PR please wait. I'm not the original plugin developer. So, I need to investigate the configuration rule. If this plugin is based on the API settings, It would be better to use the original field.range.start. I'll talk co-maintainer about this.

kitagry · 2025-03-06T12:56:08Z

Hi @hiroyuki-sato , I changed range-partitioning fileds to be integer in f8d039f

lib/embulk/output/bigquery.rb

hiroyuki-sato · 2025-03-11T00:15:33Z

@kitagry Thanks. I will test this PR later. Please wait for a while.

feat: Add range partitioning support

adf5210

hiroyuki-sato self-requested a review February 15, 2025 15:47

hiroyuki-sato reviewed Feb 15, 2025

View reviewed changes

kitagry force-pushed the add-range-paritioned branch from 49d2c36 to f8d039f Compare March 6, 2025 12:54

hiroyuki-sato reviewed Mar 7, 2025

View reviewed changes

lib/embulk/output/bigquery.rb Outdated Show resolved Hide resolved

fix: change range-partitioning fields to be int

5398f69

kitagry force-pushed the add-range-paritioned branch from f8d039f to 5398f69 Compare March 9, 2025 06:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add range partitioning support #174

feat: Add range partitioning support #174

kitagry commented Feb 15, 2025

hiroyuki-sato left a comment

hiroyuki-sato Feb 15, 2025

kitagry Feb 15, 2025

kitagry Feb 16, 2025

hiroyuki-sato Mar 4, 2025

hiroyuki-sato commented Feb 17, 2025

kitagry commented Mar 6, 2025

hiroyuki-sato commented Mar 11, 2025

feat: Add range partitioning support #174

Are you sure you want to change the base?

feat: Add range partitioning support #174

Conversation

kitagry commented Feb 15, 2025

hiroyuki-sato left a comment

Choose a reason for hiding this comment

hiroyuki-sato Feb 15, 2025

Choose a reason for hiding this comment

kitagry Feb 15, 2025

Choose a reason for hiding this comment

kitagry Feb 16, 2025

Choose a reason for hiding this comment

hiroyuki-sato Mar 4, 2025

Choose a reason for hiding this comment

hiroyuki-sato commented Feb 17, 2025

kitagry commented Mar 6, 2025

hiroyuki-sato commented Mar 11, 2025