-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Insights: apache/arrow
Overview
Could not load contribution data
Please try again later
31 Pull requests merged by 18 people
-
GH-46494: [CI][Dev] Add shellcheck files without change
#46495 merged
May 19, 2025 -
GH-46084: [C++] Always use ARROW_VCPKG to detect vcpkg mode
#46467 merged
May 19, 2025 -
GH-46490: [CI][Dev] Add shellcheck ci/scripts/install_ccache.sh
#46492 merged
May 19, 2025 -
GH-46487: [C++] Refactor lz4 from ExternalProject to FetchContent
#46390 merged
May 18, 2025 -
GH-46482: [CI][Dev] Add shellcheck files without change
#46483 merged
May 18, 2025 -
GH-46414: [C++] Fix GCS filesystem getFileInfo method
#46416 merged
May 18, 2025 -
GH-46473: [C++][Docs] Fix typos in decimal comments
#46474 merged
May 17, 2025 -
GH-46478: [C++] Implement recent JSON changes into Meson configuration
#46479 merged
May 17, 2025 -
GH-46444: [Documentation][C++][Acero] Move internal Swiss table doc into public C++ developer doc
#46445 merged
May 16, 2025 -
GH-46456: [GLib] Add missing
since:
tag#46457 merged
May 16, 2025 -
GH-40756: [C++] Remove dead Boost urls
#46452 merged
May 16, 2025 -
GH-46442: [R] hms::as_hms tests fail on some of our crossbow builds
#46443 merged
May 15, 2025 -
GH-46349: [Python] Move parquet definitions to pyarrow/includes/libparquet.pxd
#46437 merged
May 15, 2025 -
GH-45229: [Python] skip scipy.sparse roundtrip tests for float16
#46413 merged
May 15, 2025 -
GH-46420: [C++][Dataset] Fix DatasetWriter deadlock on writting batch greater than max_rows_queued
#46139 merged
May 15, 2025 -
GH-46450: [GLib] Add GArrowFixedShapeDataType#strides
#46451 merged
May 15, 2025 -
GH-46224: [C++][Acero] Fix the hang in asof join
#46300 merged
May 14, 2025 -
GH-45908: [C++][Docs] Rename and expose basic {Array,...}FromJSON helpers as public APIs
#46180 merged
May 14, 2025 -
GH-26818: [C++][Python] Preserve order when writing dataset multi-threaded
#44470 merged
May 14, 2025 -
GH-46433: [GLib] Add GArrowFixedShapeDataType#dim_names
#46434 merged
May 14, 2025 -
GH-45750: [C++][Python][Parquet] Implement Content-Defined Chunking for the Parquet writer
#45360 merged
May 13, 2025 -
GH-46394: [C++][R] gcc-UBSAN errors on CRAN
#46397 merged
May 13, 2025 -
GH-38914: [Python] Add EncryptionConfiguration.uniform_encryption
#46347 merged
May 13, 2025 -
GH-46424: [C++][Parquet] Fix erroneous unit test skip
#46425 merged
May 13, 2025 -
GH-46419: [C++] Remove duplicate declaration and sync arg names on acero test_util_internal functions
#45400 merged
May 13, 2025 -
GH-46376: [Docs] Replace Xitter link with BlueSky link
#46402 merged
May 13, 2025 -
GH-46304: [Release][Packaging] Use optimized debug build for .deb
#46392 merged
May 13, 2025 -
GH-46417: [C++][Parquet] Fix UB in LoadEnumSafe for EdgeInterpolationAlgorithm
#46418 merged
May 13, 2025 -
GH-46400: [GLib] Add GArrowFixedShapeDataType#permutation
#46401 merged
May 13, 2025 -
MINOR: [Swift] Bump github.com/apache/arrow-go/v18 from 18.2.0 to 18.3.0 in /swift/CDataWGo
#46405 merged
May 13, 2025
21 Pull requests opened by 15 people
-
GH-46375: [C++] Add adapters/orc to Meson configuration
#46409 opened
May 12, 2025 -
GH-46411: [C++] Add dataset option to Meson configuration
#46412 opened
May 12, 2025 -
GH-46395: [C++][Statistics] Correct the Equal method for min and max in arrow::ArrayStatistics
#46422 opened
May 13, 2025 -
GH-45229: [Python] Migrate from scipy.spmatrix to scipy.sparray
#46423 opened
May 13, 2025 -
GH-45601: [R] R arrow cannot handle labelled data in arrow tables
#46431 opened
May 13, 2025 -
GH-46343: Avoid installing gdb 16.3 in conda-python image to fix CI
#46438 opened
May 14, 2025 -
GH-37027: [C++] Add float16 kernels to if-else and vector-replace functions
#46446 opened
May 14, 2025 -
GH-46439: [C++} Address post-merge review comments in PR exposing {Array,...}FromJSON helpers in public API
#46447 opened
May 14, 2025 -
GH-46454: [C++][Dataset][Acero] Preserve order when writting with TeeNode
#46455 opened
May 15, 2025 -
GH-46462: [C++][Parquet] Expose currently thrown EncodedStatistics when checking is_stats_set
#46463 opened
May 15, 2025 -
WIP: [R] Verify CRAN release-20.0.0.1
#46472 opened
May 16, 2025 -
GH-46475: [Documentation][C++][Compute] Consolidate Acero developer docs
#46476 opened
May 16, 2025 -
GH-46477: [C++] Use vendored flatbuffers in Meson configuration
#46484 opened
May 17, 2025 -
GH-46481: [C++][Python] Allow nullable schema in FlightInfo
#46489 opened
May 18, 2025 -
GH-43623: [R] remove libarrow backwards compatibility enforcement
#46491 opened
May 18, 2025 -
GH-24833 [JS] Implement IPC RecordBatch body buffer compression
#46493 opened
May 18, 2025 -
GH-46496: [CI][Dev] Fix shellcheck SC2086 errors in ci/scripts directory
#46497 opened
May 19, 2025 -
GH-46499: [CI][Crossbow][C++] Use apache/arrow for Meson
#46501 opened
May 19, 2025 -
GH-46500: [CI][Java] Remove CI scripts for Java (Except JNI)
#46502 opened
May 19, 2025 -
GH-46507: [C++] Make the aws sdk S3 lowSpeedLimit configurable from arrow S3Options
#46506 opened
May 19, 2025
29 Issues closed by 12 people
-
[CI][Dev] Add shellcheck files without change
#46494 closed
May 19, 2025 -
[C++] Configration process: Could NOT find BrotliAlt
#46084 closed
May 19, 2025 -
[CI][Dev] Add shellcheck ci/scripts/install_ccache.sh
#46490 closed
May 19, 2025 -
[C++] Use FetchContent for bundled LZ4
#46487 closed
May 18, 2025 -
[CI][Dev] Add shellcheck files without change
#46482 closed
May 18, 2025 -
[C++] Fix GCS filesystem getFileInfo method.
#46414 closed
May 18, 2025 -
[C++][Docs] Fix code comment for Decimal type
#46473 closed
May 17, 2025 -
Fix Meson Confiugration to Include Recent JSON Changes
#46478 closed
May 17, 2025 -
Writing UUID using PyArrow does not set the UUID logical type on Parquet
#46469 closed
May 16, 2025 -
[Documentation][C++][Acero] Move internal Swiss table doc into public C++ developer doc
#46444 closed
May 16, 2025 -
[GLib] Add missing `Since: ` tag
#46456 closed
May 16, 2025 -
[C++] Boost download still fails
#40756 closed
May 16, 2025 -
[R] hms::as_hms tests fail on some of our crossbow builds
#46442 closed
May 15, 2025 -
[Python] `pyarrow/_parquet.pxd` should go into `pyarrow/includes`
#46349 closed
May 15, 2025 -
[C++] DatasetWriter deadlocks on writting batch greater than max_rows_queued
#46420 closed
May 15, 2025 -
[GLib] Add GArrowFixedShapeDataType#strides
#46450 closed
May 15, 2025 -
[C++][Acero] Apparent deadlock in Table.join_asof
#46224 closed
May 14, 2025 -
[C++] Expose `{Array,...}FromJSON` as public APIs
#45908 closed
May 14, 2025 -
[C++][Dataset] Preserve order when writing dataset
#26818 closed
May 14, 2025 -
[GLib] Add GArrowFixedShapeDataType#dim_names
#46433 closed
May 14, 2025 -
[C++][Python][Parquet] Support Content-Defined Chunking of Parquet files
#45750 closed
May 13, 2025 -
[C++][R]: gcc-UBSAN errors on CRAN
#46394 closed
May 13, 2025 -
[Python][Parquet] Add EncryptionConfiguration.uniform_encryption to Python implementation
#38914 closed
May 13, 2025 -
[C++][Parquet] Tests skipped with "requires Snappy compression" even though ARROW_SNAPPY is enabled
#46424 closed
May 13, 2025 -
[C++] Remove duplicate function definition and synchronize arg names
#46419 closed
May 13, 2025 -
[Docs] Replace Xitter link with BlueSky link
#46376 closed
May 13, 2025 -
[Release][Packaging] APT Debian/Ubuntu `.debug` files not usable with `addr2line` due to release build
#46304 closed
May 13, 2025 -
[C++][Parquet] LoadEnumSafe for EdgeInterpolationAlgorithm trigger UB
#46417 closed
May 13, 2025 -
[GLib] Add GArrowFixedShapeDataType#permutation
#46400 closed
May 13, 2025
36 Issues opened by 21 people
-
[C++] Make the aws sdk S3 lowSpeedLimit configurable from arrow S3Options
#46507 opened
May 19, 2025 -
[CI][R]: R's nightly backwards compatibility job is failing to setup.
#46504 opened
May 19, 2025 -
[C++][CUDA] Implement GPUDirect data loading with IPC
#46503 opened
May 19, 2025 -
[CI][Java] Remove CI scripts for Java
#46500 opened
May 19, 2025 -
[CI][Crossbow][C++] Use apache/arrow for Meson
#46499 opened
May 19, 2025 -
[CI][MATLAB] Build failure (mpm version 2025.1 is not supported)
#46498 opened
May 19, 2025 -
[CI][Dev] Fix shellcheck SC2086 errors in ci/scripts directory
#46496 opened
May 19, 2025 -
[C#] Switch to license expression
#46488 opened
May 18, 2025 -
pyarrow JSON - support parsing from array of arrays
#46486 opened
May 17, 2025 -
[Flight][Python] `FlightInfo` segfaults when schema is None
#46481 opened
May 17, 2025 -
Consider adding `CITATION.cff` for citation details
#46480 opened
May 16, 2025 -
Use vendored flatbuffers in Meson configuration
#46477 opened
May 16, 2025 -
[Documentation][C++][Acero] Consolidate Acero developer doc
#46475 opened
May 16, 2025 -
Support for grouping in UUID columns
#46468 opened
May 16, 2025 -
[C++][Arrow Flight SQL ODBC] Refactor unnecessary nesting in include folders
#46465 opened
May 15, 2025 -
[C#] Expose method to compare 2 Tables
#46464 opened
May 15, 2025 -
[C++][Parquet] Avoid throwing away EncodedStatistics when checking is_stats_set
#46462 opened
May 15, 2025 -
[C++] Review `arrow/json` headers for internal APIs
#46461 opened
May 15, 2025 -
[C++] Review `arrow/csv` headers for internal APIs
#46460 opened
May 15, 2025 -
[C++] Review `arrow/util` headers for internal APIs
#46459 opened
May 15, 2025 -
[C++][Parquet] Review headers for internal APIs
#46458 opened
May 15, 2025 -
[C++][Dataset] Preserve order when writing dataset using TeeNode
#46454 opened
May 15, 2025 -
[R] CRAN packaging checklist for version 20.0.0.1
#46453 opened
May 15, 2025 -
[C++] ODBC - Implement SQLDriverConnect to handle FILEDSN and SAVEFILE keywords according to the spec
#46449 opened
May 14, 2025 -
[C++] ODBC - Implement `SQL_DRIVER_COMPLETE_REQUIRED` in SQLDriverConnect according to the spec
#46448 opened
May 14, 2025 -
[C++] Implement growth strategy for StringHeapBuilder / BinaryViewBuilder
#46440 opened
May 14, 2025 -
[C++] Address post-merge review comments in PR exposing {Array,...}FromJSON helpers in public API
#46439 opened
May 14, 2025 -
[C++][Parquet] `test-conda-cpp-valgrind` error in geospatial tests
#46435 opened
May 14, 2025 -
No obvious mechanism for partitioning groups of record batches
#46432 opened
May 13, 2025 -
The arrow R package v20 and v19 are substantially slower and use more memory than v17 for some operations
#46428 opened
May 13, 2025 -
[C++][Acero] Control over ordering in source nodes.
#46427 opened
May 13, 2025 -
[C++][Acero] Asofjoin does not propagate pause upstream
#46421 opened
May 13, 2025 -
[C++] Add dataset directory to Meson
#46411 opened
May 12, 2025 -
[C++] Add parquet option to Meson configuration
#46410 opened
May 12, 2025
50 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
GH-46371: [C++][Parquet] Parquet Variant decoding tools
#46372 commented on
May 19, 2025 • 60 new comments -
[C++][FlightRPC] Add Arrow Flight SQL ODBC driver
#40939 commented on
May 19, 2025 • 38 new comments -
GH-45653: [Python] Scalar subclasses should implement Python protocols
#45818 commented on
May 16, 2025 • 11 new comments -
GH-35166: [C++] Increase precision of decimals in sum aggregates
#44184 commented on
May 14, 2025 • 9 new comments -
GH-46336: [Release][Packaging] Add support for Reproducible Builds for source archive
#46342 commented on
May 15, 2025 • 6 new comments -
GH-46398: [GLib] Add GArrowFixedShapeTensorDataType#n_dimensions
#46399 commented on
May 19, 2025 • 3 new comments -
GH-32276: [C++][FlightRPC] Add option to align RecordBatch buffers given to IPC reader
#44279 commented on
May 15, 2025 • 3 new comments -
GH-46407: [C++] fix IPC serialization of sliced list arrays
#46408 commented on
May 19, 2025 • 1 new comment -
GH-46205: [C++][Parquet][WIP] Read/Write null count statistics for UNKNOWN sort order
#46275 commented on
May 14, 2025 • 1 new comment -
[C++] Add adapters/orc directory to Meson
#46375 commented on
May 12, 2025 • 0 new comments -
GH-39294: [C++][Python] DLPack on Tensor class
#42118 commented on
May 19, 2025 • 0 new comments -
GH-43379: [R][CI] Fix non pkg-config link order
#43353 commented on
May 19, 2025 • 0 new comments -
GH-46177: [C++][Compute] Enable MemAllocation::PREALLOCATE for DenseUnion, SparseUnion, ListView, LargeListView, BinaryView, StringView
#46317 commented on
May 17, 2025 • 0 new comments -
GH-46315 [C#] Apache Arrow Flight Middleware
#46316 commented on
May 14, 2025 • 0 new comments -
GH-44729: [C++][Acero] Enable nested types for non-keys fields in AsofJoin operation
#44871 commented on
May 14, 2025 • 0 new comments -
GH-17211: [C++] Add `hash32` and `hash64` scalar compute functions
#45001 commented on
May 14, 2025 • 0 new comments -
GH-45203: [C++][Acero] TeeNode metadata
#45211 commented on
May 16, 2025 • 0 new comments -
GH-45434: [C++][Acero] Add pipe_sink, pipe_source and pipe_tee nodes
#45435 commented on
May 13, 2025 • 0 new comments -
[EXP] GH-44792: [C++] Require C++20
#45445 commented on
May 19, 2025 • 0 new comments -
GH-41246: [C++][Python] Simplify nested field encryption configuration
#45462 commented on
May 16, 2025 • 0 new comments -
GH-45747: [C++] Remove deprecated ObjectType and FileStatistics, refactor hdfs code
#45998 commented on
May 19, 2025 • 0 new comments -
GH-46098: [C++] initial connection in ODBC layer
#46099 commented on
May 17, 2025 • 0 new comments -
GH-31387: [C++] Check nullability when validating fields on batches or struct arrays
#46129 commented on
May 13, 2025 • 0 new comments -
GH-46421: [C++][Acero] Asofjoin respect PauseProducing from downstream.
#46140 commented on
May 13, 2025 • 0 new comments -
GH-25025: [C++] Move non core compute kernels into separate shared library
#46261 commented on
May 19, 2025 • 0 new comments -
GH-46219: [C++][Parquet] Remove PARQUET_MINIMAL_DEPENDENCY option
#46274 commented on
May 16, 2025 • 0 new comments -
[C++] Add subdirectories to Meson Configuration
#45778 commented on
May 12, 2025 • 0 new comments -
[C++] Rust sliced ListArrays get corrupted by C++ IPC serialization
#46407 commented on
May 13, 2025 • 0 new comments -
[R] R arrow cannot handle labelled data in arrow tables
#45601 commented on
May 13, 2025 • 0 new comments -
[C++][Python] Failed to build pyarrow, missing Arrow C++
#46331 commented on
May 14, 2025 • 0 new comments -
[Swift] Publish Arrow Swift to `Swift Package Index`
#46382 commented on
May 14, 2025 • 0 new comments -
[C++][Statistics] Correct the Equal method for min and max in arrow::ArrayStatistics
#46395 commented on
May 14, 2025 • 0 new comments -
[C++][Acero] Not support type like Fixed Size List for non-key column in asof join node
#44729 commented on
May 14, 2025 • 0 new comments -
[JS] Apache Arrow does not compile with Typescript when types are checked
#46088 commented on
May 14, 2025 • 0 new comments -
[CI][Python] Conda Python 3.10 jobs fail with UnicodeDecodeError due to gdb issue
#46343 commented on
May 14, 2025 • 0 new comments -
[C++] Refactor bundled dependencies from ExternalProject to FetchContent
#45303 commented on
May 15, 2025 • 0 new comments -
[C++][Parquet] Reading/Writing string/binary types as/with the corresponding arrow view type in Parquet
#43041 commented on
May 15, 2025 • 0 new comments -
[C++][Parquet] Supports write BinaryView/StringView to Parquet file
#43244 commented on
May 15, 2025 • 0 new comments -
[Python] Compatibility with SciPy 1.15's stricter sparse code
#45229 commented on
May 15, 2025 • 0 new comments -
[Discuss][C++][Statistics] Should arrow::Array::Equals() checks wheter two arrow::Array have the same arrow::ArrayStatistics
#46396 commented on
May 16, 2025 • 0 new comments -
[CI][Dev] Apply ShellCheck lint to all shell scripts
#44748 commented on
May 17, 2025 • 0 new comments -
[R] Discussion: libarrow backwards compatibility enforcement
#43623 commented on
May 18, 2025 • 0 new comments -
[GLib] Add GArrowFixedShapeTensorDataType#n_dimensions
#46398 commented on
May 19, 2025 • 0 new comments -
[CI][Crossbow] Use apache/arrow instead of separated repository (e.g. ursacomputing/crossbow)
#46014 commented on
May 19, 2025 • 0 new comments -
[C++] Support for Baidu advanced file system (AFS)
#46256 commented on
May 19, 2025 • 0 new comments -
[C++] Avoid printing very large values
#46403 commented on
May 19, 2025 • 0 new comments -
pyarrow's GcsFileSystem fails with "SSL peer certificate or SSH remote key was not OK"
#36439 commented on
May 19, 2025 • 0 new comments -
[C++][Python] Incorrect result for `floor_temporal` with 3 and 'year'
#46301 commented on
May 19, 2025 • 0 new comments -
[C++][Docs] Update minimum GCC to 8 and C++ standard to C++20
#45885 commented on
May 19, 2025 • 0 new comments -
GH-41973: Expose new S3 option check_directory_existence_before_creation
#41998 commented on
May 19, 2025 • 0 new comments