Skip to content

Parquet Files Getting Emptied After Delete ParquetWriter #3014

@tomnoah1

Description

@tomnoah1

Describe the bug, including details regarding any error messages, version, and platform.

Version: 1.14.1

I got the following code:

(1) writer = AvroParquetWriter
            .builder<GenericRecord>(ParquetOutputFile(localFile))
            .withSchema(SCHEMA)
            .build()
(2) writer.write(genericRecord)
(3) writer.close()
(4) writer = AvroParquetWriter
            .builder<GenericRecord>(ParquetOutputFile(localFile2))
            .withSchema(SCHEMA)
            .build()

After the third line, I can see the file with the data (genericRecord), and read it.
For some reason, after the 4th line, the file is getting empty. It contains no data and weight 0 bytes.
When trying to read it I am getting:
File(<some_name>) cannot be read as parquet. File matching that expression not found.

Without the 4th line, the file and its content remains steady.

Component(s)

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions