[SUPPORT] Using BULK_INSERT mode multiple times writing causing a bug: Duplicate fileId 00000000-8651-4ae5-8f9e-4424fed2d181 from bucket 0 of partition  found during the BucketStreamWriteFunction index bootstrap.

**_Tips before filing an issue_**

- Have you gone through our [FAQs](https://hudi.apache.org/learn/faq/)?  yes

- Join the mailing list to engage in conversations and get faster support at dev-subscribe@hudi.apache.org.

- If you have triaged this as a bug, then file an [issue](https://issues.apache.org/jira/projects/HUDI/issues) directly.

**Describe the problem you faced**

Using BULK_INSERT mode multiple times writing causing a bug: Duplicate fileId 00000000-8651-4ae5-8f9e-4424fed2d181 from bucket 0 of partition  found during the BucketStreamWriteFunction index bootstrap.
configuration:
write.operation=BULK_INSERT 
index.type=BUCKET
hoodie.index.bucket.engine=SIMPLE

**To Reproduce**

Steps to reproduce the behavior:

1.A program writes to table a in BULK_INSERT mode.
2.Another program writes to this table using BULK_INSERT again, and the data written in the two times are not duplicated.
3.When trying to write incremental data using upsert mode, an error occurred.:Duplicate fileId 00000000-8651-4ae5-8f9e-4424fed2d181 from bucket 0 of partition  found during the BucketStreamWriteFunction index bootstrap.


**Expected behavior**

How to use BULK_INSERT to write multiple times to the same table

**Environment Description**

* Hudi version :  0.14.0

* Flink version : 1.15.2

* Hive version :

* Hadoop version :

* Storage (HDFS/S3/GCS..) :

* Running on Docker? (yes/no) :


**Additional context**

Add any other context about the problem here.

**Stacktrace**

```Add the stacktrace of the error.```



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SUPPORT] Using BULK_INSERT mode multiple times writing causing a bug: Duplicate fileId 00000000-8651-4ae5-8f9e-4424fed2d181 from bucket 0 of partition found during the BucketStreamWriteFunction index bootstrap. #10894

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[SUPPORT] Using BULK_INSERT mode multiple times writing causing a bug: Duplicate fileId 00000000-8651-4ae5-8f9e-4424fed2d181 from bucket 0 of partition found during the BucketStreamWriteFunction index bootstrap. #10894

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions