Skip to content

bug: ingesting zipped CSV does not work above a certain number of lines #18986

@vbmithr

Description

@vbmithr

Search before asking

  • I had searched in the issues and found no similar issues.

Version

1.2.833

What's Wrong?

root@localhost:8000/default/default> COPY INTO aggtrade_nearusdt
FROM
  '@uploads/machin.zip' FILE_FORMAT = (TYPE = CSV COMPRESSION = zip);
error: APIError: QueryFailed: [1076]compression data invalid: invalid Zip archive: Could not find EOCD

Works when the input data is 250K lines, returns the above error at 500K lines.

Input data is https://data.binance.vision/data/futures/um/monthly/aggTrades/NEARUSDT/NEARUSDT-aggTrades-2025-07.zip

How to Reproduce?

CREATE TABLE aggtrade_nearusdt (
  agg_id int NOT NULL,
  px decimal(18, 4) NOT NULL,
  qty decimal(18, 0) NOT NULL,
  start_id int NOT NULL,
  stop_id int NOT NULL,
  ts timestamp NOT NULL,
  taker_is_mm bool NOT NULL
);

COPY INTO aggtrade_nearusdt
FROM
  '@uploads/NEARUSDT-aggTrades-2021-06.csv' FILE_FORMAT = (TYPE = CSV COMPRESSION = zip);

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Metadata

Metadata

Assignees

Labels

C-bugCategory: something isn't working

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions