You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have to agree, it is very technical and can add some confusion. I would suggest to simply use 128,000 rows.
Would you be willing to contribute the change?
kou
changed the title
PyArrow Documentation bug dataset.to_batches()
[Python][Docs] PyArrow Documentation bug dataset.to_batches()
Sep 7, 2023
…128Ki to 128_000 (#37605)
### Rationale for this change
#37560
### Are these changes tested? -> No
### Are there any user-facing changes? -> Documentation
* Closes: #37560
Authored-by: Arkadiusz Rudny <aru@trackunit.com>
Signed-off-by: AlenkaF <frim.alenka@gmail.com>
… from 128Ki to 128_000 (apache#37605)
### Rationale for this change
apache#37560
### Are these changes tested? -> No
### Are there any user-facing changes? -> Documentation
* Closes: apache#37560
Authored-by: Arkadiusz Rudny <aru@trackunit.com>
Signed-off-by: AlenkaF <frim.alenka@gmail.com>
dgreiss
pushed a commit
to dgreiss/arrow
that referenced
this issue
Feb 19, 2024
… from 128Ki to 128_000 (apache#37605)
### Rationale for this change
apache#37560
### Are these changes tested? -> No
### Are there any user-facing changes? -> Documentation
* Closes: apache#37560
Authored-by: Arkadiusz Rudny <aru@trackunit.com>
Signed-off-by: AlenkaF <frim.alenka@gmail.com>
Describe the bug, including details regarding any error messages, version, and platform.
https://github.com/apache/arrow/blob/main/python/pyarrow/_dataset.pyx#L3439
Default value is misleading and it suggests that user should define chunk size (128 Ki) rather than number of rows
I would use
2**17
or128_000
rather than128 Ki
Component(s)
Documentation, Python
The text was updated successfully, but these errors were encountered: