-
Notifications
You must be signed in to change notification settings - Fork 3.6k
Insights: apache/arrow
Overview
Could not load contribution data
Please try again later
19 Pull requests merged by 12 people
-
GH-45669: [C++][Parquet] Add missing
ParquetFileReader::GetReadRanges()
definition#45684 merged
Mar 8, 2025 -
GH-45676: [C++][Python][Compute] Add skew and kurtosis functions
#45677 merged
Mar 7, 2025 -
MINOR: [R] Update github.linux.arrow.version.back.compat.yml
#45703 merged
Mar 7, 2025 -
MINOR: [JS] Bump typedoc from 0.27.6 to 0.27.9 in /js
#45646 merged
Mar 7, 2025 -
MINOR: [Docs] Update implementation status for nanoarrow
#45333 merged
Mar 6, 2025 -
GH-45665: [Docs] Add kapa AI bot to the docs
#45667 merged
Mar 5, 2025 -
GH-45366: [C++][Parquet] Set is_compressed to false when data page v2 is not compressed
#45367 merged
Mar 5, 2025 -
GH-45670: [Release][Archery] Crossbow bot accepts
--prefix
#45671 merged
Mar 5, 2025 -
GH-45652: [C++][Acero] Unify ConcurrentQueue and BackpressureConcurrentQueue API
#45421 merged
Mar 4, 2025 -
GH-45269: [C++][Compute] Add "pivot_wider" and "hash_pivot_wider" functions
#45562 merged
Mar 4, 2025 -
MINOR: Remove gzip compression reference from Pyarrow Orc doc
#45658 merged
Mar 4, 2025 -
GH-35289: [Python] Support large variable width types in numpy conversion
#36701 merged
Mar 4, 2025 -
GH-45661: [GLib][Ruby][Dev] Add Ruby lint rule (add space after comma)
#45662 merged
Mar 4, 2025 -
GH-45659: [GLib][Ruby] Fix Ruby lint violation(add space after comma)
#45660 merged
Mar 4, 2025 -
MINOR: [C#] Bump ZstdSharp.Port from 0.8.4 to 0.8.5 in /csharp
#45655 merged
Mar 4, 2025 -
MINOR: [C#] Bump Grpc.Tools from 2.69.0 to 2.70.0 in /csharp
#45654 merged
Mar 4, 2025 -
MINOR: [CI] Bump actions/download-artifact from 4.1.8 to 4.1.9
#45657 merged
Mar 3, 2025 -
GH-45649: [GLib] Add GArrowBinaryViewArray
#45650 merged
Mar 3, 2025
14 Pull requests opened by 13 people
-
GH-45474: [C++] Add support for building statistics array for nested types
#45663 opened
Mar 4, 2025 -
GH-41985: [Python][Docs] Clarify docstring of pyarrow.compute.scalar()
#45668 opened
Mar 4, 2025 -
GH-45673: Make ConcurrentQueue and BackpressureConcurrentQueue public interface
#45674 opened
Mar 5, 2025 -
GH-30302: [C++][Parquet] Preserve the bitwidth of integer dictionary indices on round-trip to Parquet
#45685 opened
Mar 6, 2025 -
GH-45694: [C++] Bump vendored flatbuffers to 24.3.6
#45687 opened
Mar 6, 2025 -
GH-45690: [C++][Parquet] Consolidate Arrow write functions under `TypedColumnWriterImpl`
#45688 opened
Mar 6, 2025 -
GH-45691: [R][Packaging] Update R packaging checklist with latest process
#45692 opened
Mar 6, 2025 -
GH-45693: [C++][Gandiva] Fix aes_encrypt/decrypt algorithm selection.
#45695 opened
Mar 7, 2025 -
GH-45696: [C++][Gandiva] Accept LLVM 20.1
#45697 opened
Mar 7, 2025 -
GH-45675: [Release] Run binary RC verification jobs in apache/arrow
#45699 opened
Mar 7, 2025 -
GH-45700: [C++] Added nullptr check in Equals method to handle null impl_ pointers
#45701 opened
Mar 7, 2025 -
GH-45705: [Python] Expose azure sas tokens in python
#45706 opened
Mar 8, 2025 -
GH-45710: [GLib] Add GArrowStringViewArray
#45711 opened
Mar 8, 2025 -
GH-43118: [JS] Add interval for unit MONTH_DAY_NANO (#43117)
#45712 opened
Mar 8, 2025
16 Issues closed by 8 people
-
[C++][Parquet] GetReadRanges symbol not part of parquet.dll (Windows)
#45669 closed
Mar 8, 2025 -
[C++][Compute] Add skew and kurtosis functions
#45676 closed
Mar 7, 2025 -
[Docs] Add kapa AI bot to the docs
#45665 closed
Mar 5, 2025 -
[C++][Parquet] Allowing is_compressed not set if DataPageV2 enabled
#45366 closed
Mar 5, 2025 -
[Release][Archery] Crossbow bot accepts `--prefix`
#45670 closed
Mar 5, 2025 -
[C++] Unify ConcurrentQueue and BackpressureConcurrentQueue API
#45652 closed
Mar 4, 2025 -
[C++][Compute] Add a pivot function
#45269 closed
Mar 4, 2025 -
[Python] Converting from NumPy to large_string or large_binary returns not implemented
#35289 closed
Mar 4, 2025 -
[C++][Acero] High contention in Swiss join build phase
#45611 closed
Mar 4, 2025 -
[GLib][Ruby][Dev] Add Ruby lint rule (add space after comma)
#45661 closed
Mar 4, 2025 -
[GLib][Ruby] Fix Ruby lint violation(add space after comma)
#45659 closed
Mar 4, 2025 -
[GLib] Add GArrowBinaryViewArray
#45649 closed
Mar 3, 2025 -
[Python] read_table from s3 randomly fails due to timeout
#45432 closed
Mar 3, 2025 -
[CI][Python] test_cython_api is failing for Windows wheel
#45651 closed
Mar 3, 2025 -
[CI][Packaging] Drop support for Ubuntu 20.04
#45291 closed
Mar 3, 2025 -
CRAN packaging checklist for version 18.1.0.1
#45172 closed
Mar 2, 2025
28 Issues opened by 18 people
-
[GLib] Add `garrow_chunked_array_import()` and `garrow_chunked_array_export()`
#45713 opened
Mar 9, 2025 -
[GLib] Add GArrowStringViewArray
#45710 opened
Mar 8, 2025 -
[R] Don't construct `arrow_binary` class vector in favor of `blob::blob`
#45709 opened
Mar 8, 2025 -
[Release] Re-run binary verification jobs after we upload binaries
#45708 opened
Mar 8, 2025 -
[Release] Verify aarch64 wheels RC in CI
#45707 opened
Mar 8, 2025 -
[Python] Expose Azure sas tokens to pyarrow
#45705 opened
Mar 8, 2025 -
[CI] Windows wheel builds are slower after refactor
#45704 opened
Mar 7, 2025 -
[Python][C++] Support AWS_ROLE_ARN env variable for pyarrow.fs.S3FileSystem input
#45702 opened
Mar 7, 2025 -
[C++] Missing nullability check in Equals method for Expression
#45700 opened
Mar 7, 2025 -
[C++] FFI between c++ and python
#45698 opened
Mar 7, 2025 -
[C++][Gandiva] Add support for LLVM 20
#45696 opened
Mar 7, 2025 -
[C++] Bump Vendored FlatBuffers
#45694 opened
Mar 7, 2025 -
[C++][Gandiva] AES Encrypt only uses 128 bit encryption regardless of cipher length
#45693 opened
Mar 7, 2025 -
[R][Packaging] Update R packaging checklist with latest process
#45691 opened
Mar 6, 2025 -
[C++][Parquet] Consolidate Arrow write functions under `TypedColumnWriterImpl`
#45690 opened
Mar 6, 2025 -
[C++] Bump Apache ORC to 2.1.1
#45689 opened
Mar 6, 2025 -
[C++][Python] sort_by produces incorrect result
#45686 opened
Mar 6, 2025 -
[CI][C++] Add CI jobs that use the same build options for JNI
#45683 opened
Mar 6, 2025 -
[Python] Failing to create array from null scalar
#45682 opened
Mar 5, 2025 -
[C++][Compute] Investigate Kahan summation vs pairwise summation
#45681 opened
Mar 5, 2025 -
[C++][Python] Remove deprecated functions in 20.0
#45680 opened
Mar 5, 2025 -
[R] Export/Import ChunkedArray via Arrow C stream interface
#45678 opened
Mar 5, 2025 -
[Release] Run binary RC verification jobs in apache/arrow
#45675 opened
Mar 5, 2025 -
[C++] Make ConcurrentQueue and BackpressureConcurrentQueue public interface
#45673 opened
Mar 5, 2025 -
[C++] Add StructScalar to Scalar::Parse()
#45666 opened
Mar 4, 2025 -
[C++] Add StringView and BinaryView to ArrayStatistics
#45664 opened
Mar 4, 2025 -
[C#] C# builds are failing on MacOS
#45656 opened
Mar 3, 2025 -
[Python] Scalar subclasses should implement Python protocols
#45653 opened
Mar 3, 2025
59 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
[C++][Python][Parquet] Implement Content-Defined Chunking for the Parquet writer
#45360 commented on
Mar 7, 2025 • 98 new comments -
GH-18036: [Packaging] Build Python wheel for musllinux
#45470 commented on
Mar 9, 2025 • 19 new comments -
GH-26818: [C++][Python] Preserve order when writing dataset multi-threaded
#44470 commented on
Mar 7, 2025 • 15 new comments -
GH-44615: [C++] Add possibility to extract spans/byte offsets directly for compute.extract_regex
#45577 commented on
Mar 8, 2025 • 11 new comments -
GH-25025: [C++] Move non core compute kernels into separate shared library
#45618 commented on
Mar 5, 2025 • 9 new comments -
GH-45594: [C++][Parquet] POC: Optimize Parquet DecodeArrow in DeltaLengthByteArray
#45622 commented on
Mar 7, 2025 • 8 new comments -
GH-38026: [Python] Use provided field types in StructArray.from_arrays
#38334 commented on
Mar 8, 2025 • 3 new comments -
GH-45564: [C++][Acero] Add size validation for names and expressions vectors in ProjectNode
#45565 commented on
Mar 7, 2025 • 3 new comments -
Add Arrow Flight SQL ODBC driver
#40939 commented on
Mar 8, 2025 • 2 new comments -
GH-44232: [Python] Validate __arrow_c_array__ length for scalar construction
#44434 commented on
Mar 4, 2025 • 2 new comments -
[C++][Parquet] Add variant type
#45375 commented on
Mar 6, 2025 • 2 new comments -
GH-45619: [Python] Use f-string instead of string.format
#45629 commented on
Mar 8, 2025 • 2 new comments -
GH-43135: [R] Change the binary type mapping to `blob::blob`
#45595 commented on
Mar 8, 2025 • 2 new comments -
GH-36628: [Python][Parquet] Fail when instantiating internal Parquet metadata classes
#45549 commented on
Mar 9, 2025 • 1 new comment -
GH-44345: [C++][Parquet] Fully support arrow decimal32/64 in Parquet
#45351 commented on
Mar 9, 2025 • 1 new comment -
GH-43573: [C++] Copy bitmap when casting from string-view to offset string and binary types
#44822 commented on
Mar 4, 2025 • 1 new comment -
GH-44800: [C#] Implement Flight SQL Client
#44783 commented on
Mar 6, 2025 • 1 new comment -
MINOR: [JS] Bump ts-jest from 29.1.4 to 29.2.6 in /js
#45648 commented on
Mar 7, 2025 • 0 new comments -
GH-43118: [JS] Add interval for unit MONTH_DAY_NANO
#43117 commented on
Mar 8, 2025 • 0 new comments -
GH-43296: [C++][FlightRPC] Remove Flight UCX transport
#43297 commented on
Mar 7, 2025 • 0 new comments -
GH-43855: [Python] Support inference and conversion of uuid.UUID objects
#44242 commented on
Mar 3, 2025 • 0 new comments -
GH-32276: [C++][FlightRPC] Add option to align RecordBatch buffers given to IPC reader
#44279 commented on
Mar 7, 2025 • 0 new comments -
GH-45559: [C#] Increasing performance by avoiding array allocation everytime method gets used
#45609 commented on
Mar 3, 2025 • 0 new comments -
GH-45689: [C++][Thirdparty] Bump Apache ORC to 2.1.1
#45600 commented on
Mar 7, 2025 • 0 new comments -
GH-45167: [C++] Implement Compute Equals for List Types
#45272 commented on
Mar 9, 2025 • 0 new comments -
GH-45282: [Python][Parquet] Remove unused readonly properties of ParquetWriter
#45281 commented on
Mar 4, 2025 • 0 new comments -
MINOR: [JS] Bump glob from 10.4.1 to 11.0.1 in /js
#45405 commented on
Mar 7, 2025 • 0 new comments -
MINOR: [JS] Bump web-streams-polyfill from 3.2.1 to 4.1.0 in /js
#45409 commented on
Mar 7, 2025 • 0 new comments -
MINOR: [JS] Bump typescript from 5.4.5 to 5.7.3 in /js
#45410 commented on
Mar 7, 2025 • 0 new comments -
GH-41246: [Docs][C++][Python] Improve docs on column encryption for nested fields
#45411 commented on
Mar 7, 2025 • 0 new comments -
[C#] BitUtility.cs performance improvement
#45559 commented on
Mar 3, 2025 • 0 new comments -
[Release] Provide more ways to publish Binary Artifacts
#40760 commented on
Mar 3, 2025 • 0 new comments -
[JS] Implement IPC RecordBatch body buffer compression from ARROW-300
#24833 commented on
Mar 3, 2025 • 0 new comments -
`FixedShapeTensorArray.from_numpy_ndarray` should pass `dim_names` to `fixed_shape_tensor`
#45531 commented on
Mar 3, 2025 • 0 new comments -
[CI][Python] Windows wheels job fail on test_cython_api
#45530 commented on
Mar 3, 2025 • 0 new comments -
[Python] Timestamp with tz loses its time zone after `to_numpy`
#45644 commented on
Mar 3, 2025 • 0 new comments -
[Docs][Release][Website] Figure out why the version banner code changed in 19.0.0
#45290 commented on
Mar 3, 2025 • 0 new comments -
[Python] - Schema inference from pandas reorders struct fields
#45640 commented on
Mar 3, 2025 • 0 new comments -
[Python] Support for SubInterpreters and InterpreterPoolExecutors
#44511 commented on
Mar 3, 2025 • 0 new comments -
[Python][CSV] Сannot remove columns with the same name from a table
#45590 commented on
Mar 3, 2025 • 0 new comments -
[Python] OSError: Unable to load libhdfs
#45369 commented on
Mar 4, 2025 • 0 new comments -
[Format] Consider adding an official variant type to Arrow
#42069 commented on
Mar 4, 2025 • 0 new comments -
[C++] Add support for building statistics array for nested types
#45474 commented on
Mar 4, 2025 • 0 new comments -
[Python] AWS Error NETWORK_CONNECTION during HeadObject operation: curlCode: 6, Couldn't resolve host name
#40539 commented on
Mar 4, 2025 • 0 new comments -
[Python] Automatically support fsspec filesystem URIs
#44900 commented on
Mar 4, 2025 • 0 new comments -
[Python] Allow PyCapsule Interface in `pyarrow.scalar` constructor?
#44232 commented on
Mar 4, 2025 • 0 new comments -
[Python][Docs] Clarify docstring of pyarrow.compute.scalar()
#41985 commented on
Mar 4, 2025 • 0 new comments -
read_parquet super slow
#13720 commented on
Mar 4, 2025 • 0 new comments -
[C++][Compute] Split up `hash_aggregate.cc`
#45563 commented on
Mar 5, 2025 • 0 new comments -
[C++][Parquet] Thread crash at exit in Arrow-15.0.0 dataset scan
#39862 commented on
Mar 5, 2025 • 0 new comments -
[R] Support for filename-based partitioning
#31516 commented on
Mar 6, 2025 • 0 new comments -
Arrow FlightSQL client fails on s390x platform
#45641 commented on
Mar 6, 2025 • 0 new comments -
[C++][Parquet] Preserve the bithwidth of the integer dictionary indices on rountrip to Parquet?
#30302 commented on
Mar 6, 2025 • 0 new comments -
[C++] Split non-cast compute kernels into a separate shared library
#25025 commented on
Mar 6, 2025 • 0 new comments -
[C++][Gandiva] Projector fails with SIGSEGV if calling function from math.h on AArch64
#40839 commented on
Mar 7, 2025 • 0 new comments -
[C++] Use System Provided Flatbuffers
#45636 commented on
Mar 7, 2025 • 0 new comments -
[R] CRAN packaging checklist for version 19.0.1
#45581 commented on
Mar 7, 2025 • 0 new comments -
[Format][FlightRPC] Flight SQL evolution
#41840 commented on
Mar 8, 2025 • 0 new comments -
[C++][Compute] Rank function considers NaNs and nulls equal
#45193 commented on
Mar 9, 2025 • 0 new comments