Skip to content

there is no data when a couple of hudi tables join #10366

@njalan

Description

@njalan

There is one etl job run every hour and it is insert overwrite one table from the results that is generated by some hudi table join. It happens like one a week that there is no data inserted.

Environment Description

Hudi version : 0.9.1

Spark version : 3.0.1

Hive version : 3

Hadoop version : 3.2.2

Storage (HDFS/S3/GCS..) : s3

Running on Docker? (yes/no) : no

what cab be the reason ? Is there any way to debug this kind of issues or how to get the more metrics for it?

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    Status

    👤 User Action

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions