[C++][CSV] Add support for ReadOptions::skip_rows >= block_size #24696

asfimport · 2020-04-20T11:09:29Z

Current implementation throws error in reader.cc:286 when skip_rows > header. However, in some workloads skip_rows used for not only skipping header but for just skipping first n-rows. In this case block-size constraint is greatly interferes. I think this constraint could be removed without performance reduction.

Reporter: Ravil Bikbulatov
Assignee: Nate Clark / @n3world

Related issues:

[C++][Dataset] Implement row-count for CSV or allow selecting 0 columns from CSV (relates to)

_{Note: This issue was originally created as ARROW-8527. Please see the migration documentation for further details.}

asfimport · 2021-05-06T17:37:07Z

Weston Pace / @westonpace:
This behavior could be useful for ARROW-12598. Also, in a recent discussion, n3world (no Jira I can find) pointed out that skip_rows is probably not the best tool for this. This sort of "paging" would require skipping data rows so it would be nice if the "skip header rows" (constant parameter based on the tool generating the data) is distinct from "skip data rows" (per query parameter based on paging needs)

asfimport · 2021-06-07T17:24:38Z

Nate Clark / @n3world:
Resolved with the solution for ARROW-12661. skip_rows_after_names can skip rows in multiple blocks.

asfimport closed this as completed Jun 7, 2021

asfimport added this to the 5.0.0 milestone Jan 11, 2023

asfimport mentioned this issue Jan 11, 2023

[C++][Dataset] Implement row-count for CSV or allow selecting 0 columns from CSV #28352

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[C++][CSV] Add support for ReadOptions::skip_rows >= block_size #24696

[C++][CSV] Add support for ReadOptions::skip_rows >= block_size #24696

asfimport commented Apr 20, 2020 •

edited

Loading

asfimport commented May 6, 2021

asfimport commented Jun 7, 2021

[C++][CSV] Add support for ReadOptions::skip_rows >= block_size #24696

[C++][CSV] Add support for ReadOptions::skip_rows >= block_size #24696

Comments

asfimport commented Apr 20, 2020 • edited Loading

Related issues:

asfimport commented May 6, 2021

asfimport commented Jun 7, 2021

asfimport commented Apr 20, 2020 •

edited

Loading