-
Notifications
You must be signed in to change notification settings - Fork 2.5k
Closed
Description
Hi,
I have a dataset which contain many columns.
now i want to store this dataset based on year/month.
hudi_options = {
'hoodie.table.name': tableName,
'hoodie.datasource.write.recordkey.field': 'ID',
'hoodie.datasource.write.table.name': tableName,
'hoodie.datasource.write.operation': 'upsert',
'hoodie.datasource.write.precombine.field': 'ID',
'hoodie.upsert.shuffle.parallelism': 2,
'hoodie.insert.shuffle.parallelism': 2,
'hoodie.datasource.write.partitionpath.field':'year/month'
}
when i use partition path as year/month,it doesn't create any partition.
i want my partition path as (year/month) year--->month--->then parquet files.
i saw your sample example where you have used (region/country/city)
i am using pyspark to connect with apache Hudi
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
No labels