-
Notifications
You must be signed in to change notification settings - Fork 3.7k
Insights: apache/arrow
Overview
Could not load contribution data
Please try again later
40 Pull requests merged by 17 people
-
GH-46676: [C++][Python][Parquet] Allow reading Parquet LIST data as LargeList directly
#46678 merged
Jun 4, 2025 -
GH-46652: [Python][Docs] Update language for row_group_size parameter
#46653 merged
Jun 4, 2025 -
GH-46702: [JS] Remove js/
#46703 merged
Jun 4, 2025 -
GH-46611: [Python][C++] Allow building float16 arrays without numpy
#46618 merged
Jun 4, 2025 -
GH-46439: [C++] Rename internal Converter class in from_string.cc
#46697 merged
Jun 3, 2025 -
GH-46439: [C++] Remove unneeded namespace prefix in test_util_internal.h
#46695 merged
Jun 3, 2025 -
GH-46688: [Ruby] Fix a typo
#46689 merged
Jun 3, 2025 -
GH-46673: [CI][R][Docs] Accept empty INSTALL_ARGS again
#46682 merged
Jun 3, 2025 -
GH-46693: [CI] Update GitHub hosted runner from deprecated windows-2019 to windows-2022
#46694 merged
Jun 3, 2025 -
GH-46407: [C++] Fix IPC serialization of sliced list arrays
#46408 merged
Jun 3, 2025 -
GH-46572: [Python] expose filter option to python for join
#46566 merged
Jun 3, 2025 -
MINOR: [C++] Remove deprecated inclusion
#46672 merged
Jun 3, 2025 -
GH-46684: [C++] Fix Meson configuration issue on Windows
#46685 merged
Jun 3, 2025 -
GH-46679: [C++][Meson] Use WrapDB entry for gflags instead of CMake wrapper
#46680 merged
Jun 3, 2025 -
GH-46645: [CI][Dev][R] Use pre-commit for styler
#46664 merged
Jun 3, 2025 -
GH-46665: [CI][Crossbow][C++] Use apache/arrow for Alpine Linux
#46666 merged
Jun 3, 2025 -
GH-46508: [C++] Upgrade OpenTelemetry cpp to avoid build error on recent Clang
#46509 merged
Jun 2, 2025 -
MINOR: [R][Docs] Fix package title text not showing
#46640 merged
Jun 2, 2025 -
GH-46632: [R][Docs] Add docs for arrow::one
#46648 merged
Jun 2, 2025 -
GH-46642: [Format] Add footnote clarifying REE layout has O(log n) random access
#46643 merged
Jun 2, 2025 -
GH-46219: [C++][Parquet] Remove PARQUET_MINIMAL_DEPENDENCY option
#46274 merged
Jun 2, 2025 -
GH-46270: [C++][Parquet] Clarify GeoStatistics docstring
#46649 merged
Jun 2, 2025 -
GH-46599: [C++][Doc][Parquet] Update supported types documentation
#46620 merged
Jun 2, 2025 -
GH-46623: [C++][Compute] Fix the failure of large memory test in arrow-compute-row-test
#46635 merged
Jun 2, 2025 -
GH-46589: [C++] Fix utf8_is_digit to support full Unicode digit range
#46590 merged
Jun 2, 2025 -
GH-46546: [CI][Dev][Python] Use pre-commit for numpydoc
#46595 merged
Jun 2, 2025 -
GH-46662: [CI][Dev] Fix shellcheck SC2148 errors in ci/scripts directory
#46663 merged
Jun 2, 2025 -
GH-46146: [C++] Merge metadata in SchemaBuidler::AddMetadata
#46654 merged
Jun 2, 2025 -
GH-46656: [CI][Dev] Fix shellcheck SC2034 and SC2086 errors in ci/scripts directory
#46657 merged
Jun 2, 2025 -
GH-46155: [C++] Implement Tensorflow directory in Meson
#46156 merged
May 31, 2025 -
MINOR: [C++] Fix an obvious opentelemetry CMake typo
#46644 merged
May 30, 2025 -
GH-46080 [Python][Docs] Provide guidance for tzdata related issues if installing with pip
#46591 merged
May 30, 2025 -
MINOR: [C++] Fix OTel 1.19 compat after silent breaking change
#45859 merged
May 30, 2025 -
GH-46499: [CI][Crossbow][C++] Use apache/arrow for Meson
#46501 merged
May 30, 2025 -
GH-46593: [CI][Integration] Disable nested log grouping
#46594 merged
May 30, 2025 -
GH-46576: [C++] Suppress
codecvt_utf8
deprecation warning#46622 merged
May 30, 2025 -
MINOR: [R][Release] Backport news changes from maint-20.0.0.1-r
#46639 merged
May 29, 2025 -
GH-41973: Expose new S3 option check_directory_existence_before_creation - manual rebase
#46619 merged
May 29, 2025 -
GH-46613: [GLib] Add GArrowBaseListDataType
#46615 merged
May 29, 2025
15 Pull requests opened by 13 people
-
[MINOR]: Add language-specific prompt to kapa.ai bot
#46637 opened
May 29, 2025 -
GH-46410: [C++] Add parquet options to Meson configuration
#46647 opened
May 30, 2025 -
GH-44208: [R] Adding test to ensure bit64's new semantic works with arrow
#46651 opened
May 30, 2025 -
GH-46515: [C++] Enable Multi Blocks in arrow::BinaryViewBuilder::AppendArraySlice
#46655 opened
May 31, 2025 -
GH-46659: [C++] Fix export of extension arrays with binary view/string view storage
#46660 opened
Jun 1, 2025 -
GH-46636: [R] Fix evaluation of external objects not in global environment in `case_when()`
#46667 opened
Jun 2, 2025 -
GH-46674: [C++] Construct Array from ExtensionType Scalar
#46675 opened
Jun 2, 2025 -
GH-46528: [CI][Dev] Remove "archery lint"
#46686 opened
Jun 3, 2025 -
GH-45332: [C++] Improve build time through forward Declarations
#46692 opened
Jun 3, 2025 -
GH-46439: [C++] Use result pattern for all FromJSONString Helpers
#46696 opened
Jun 3, 2025 -
GH-46699: [CI][Dev] fix shellcheck errors in the ci/scripts/cpp_test.sh
#46700 opened
Jun 4, 2025 -
GH-46704: [C++] Fix OSS-Fuzz build failure
#46706 opened
Jun 4, 2025 -
GH-46708: [C++][Gandiva] Added zero return values for castDECIMAL_utf8
#46709 opened
Jun 4, 2025 -
GH-46710: [C++] Fix ownership and lifetime issues in Dataset Writer
#46711 opened
Jun 4, 2025
44 Issues closed by 11 people
-
[C++][Parquet] Allow reading Parquet LIST data as LargeList directly
#46676 closed
Jun 4, 2025 -
[Python][Docs] Parquet row_group_size has confusing units
#46652 closed
Jun 4, 2025 -
[JS] Remove js/
#46702 closed
Jun 4, 2025 -
Arrow nightly Maven releases don't seem to work
#12570 closed
Jun 4, 2025 -
[Python] `pa.array(..., type=float16)` should accept Python floats
#46608 closed
Jun 4, 2025 -
[Python] Float16 conversion crashes if NumPy is not installed
#46611 closed
Jun 4, 2025 -
[C++] Address post-merge review comments in PR exposing {Array,...}FromJSON helpers in public API
#46439 closed
Jun 3, 2025 -
[CI][Docs] Docs preview jobs failing due to package not found
#46673 closed
Jun 3, 2025 -
[CI] Windows-2019 images will be removed on 2025-06-30 we should upgrade our jobs that use them
#46693 closed
Jun 3, 2025 -
[C++] Rust sliced ListArrays get corrupted by C++ IPC serialization
#46407 closed
Jun 3, 2025 -
[Python] Support filter option for hash join
#46572 closed
Jun 3, 2025 -
[Python] pyarrow.compute.and_ and operator.and_ not equal
#42119 closed
Jun 3, 2025 -
[Dev] Use pre-commit-hook instead of "archery lint" for all lints
#40417 closed
Jun 3, 2025 -
[C++] Meson Configuration Broken on Windows
#46684 closed
Jun 3, 2025 -
[C++][Meson] Use gflags wrapdb entry instead of custom CMake wrap
#46679 closed
Jun 3, 2025 -
[CI][Dev][R] Use pre-commit for styler
#46645 closed
Jun 3, 2025 -
[CI][Crossbow][C++] Use apache/arrow for Alpine Linux
#46665 closed
Jun 3, 2025 -
[C++] Build error on Apple Clang 17.0.0 when using bundled OpenTelemetry
#46508 closed
Jun 2, 2025 -
[R] update docgen script to include arrow::one
#46632 closed
Jun 2, 2025 -
[Format] Clarify that the REE layout has O(log n) random access
#46642 closed
Jun 2, 2025 -
[C++][Parquet] Remove PARQUET_MINIMAL_DEPENDENCY
#46219 closed
Jun 2, 2025 -
[C++][Parquet] Misleading `GeoStatistics::dimension_empty` docstring
#46270 closed
Jun 2, 2025 -
[C++][Doc][Parquet] Update doc for JSON, UUID
#46599 closed
Jun 2, 2025 -
[C++] Build REE and decode functions uncoditionally of ARROW_COMPUTE=ON/OFF
#45610 closed
Jun 2, 2025 -
[C++] Large memory test fails in arrow-compute-row-test
#46623 closed
Jun 2, 2025 -
[Python] `utf8_is_digit` in PyArrow doesn't fully match Python's `str.isdigit()` (e.g., fails for '³')
#46589 closed
Jun 2, 2025 -
[CI][Dev][Python] Use pre-commit for numpydoc
#46546 closed
Jun 2, 2025 -
[CI][Dev] Fix shellcheck SC2148 errors in ci/scripts directory
#46662 closed
Jun 2, 2025 -
[C++] arrow::SchemaBuidler::AddMetadata() replaces metadata not adds metadata
#46146 closed
Jun 2, 2025 -
[CI][Dev] Fix shellcheck SC2034 and SC2086 errors in ci/scripts directory
#46656 closed
Jun 2, 2025 -
[C++] Add adapters/tensorflow directory to Meson
#46155 closed
May 31, 2025 -
[Python] PyArrow cannot write ORC file with datetimes and timezones
#46080 closed
May 30, 2025 -
[CI][Crossbow][C++] Use apache/arrow for Meson
#46499 closed
May 30, 2025 -
[CI][Integration] Integration test logs may not hide details by group
#46593 closed
May 30, 2025 -
[R] CRAN packaging checklist for version 20.0.0
#46324 closed
May 30, 2025 -
[C++][FlightSQL][ODBC] Build error with deprecated C++-17 feature
#46576 closed
May 30, 2025 -
[CI][MATLAB] Build failure (mpm version 2025.1 is not supported)
#46498 closed
May 29, 2025 -
[R] Expose new S3 option check_directory_existence_before_creation
#41973 closed
May 29, 2025 -
[C++][Parquet] Rethink arrow::schema and binary_type config when reading arrow
#46631 closed
May 29, 2025 -
[GLib] Add GArrowBaseListDataType
#46613 closed
May 29, 2025 -
[C++] AMD64 Windows MinGW workflows are failing due to `protoc-gen-grpc`
#46634 closed
May 29, 2025 -
[Dev] Revert mistakenly pushed RC tags
#46630 closed
May 28, 2025
21 Issues opened by 15 people
-
[C++] Latent memory handling issues in Dataset Writer
#46710 opened
Jun 4, 2025 -
castDECIMAL_utf8 has undefined behaviour in case of invalid input value
#46708 opened
Jun 4, 2025 -
[C++][CI] OSS-Fuzz build failure
#46704 opened
Jun 4, 2025 -
[CI][Dev] fix shellcheck errors in the ci/scripts/cpp_test.sh
#46699 opened
Jun 4, 2025 -
[Python] Support `timestamp_parsers` for JSON input
#46698 opened
Jun 3, 2025 -
[GLib] _LIBCPP_ENABLE_ASSERTIONS is deprecated, please use _LIBCPP_HARDENING_MODE instead on macOS 15
#46690 opened
Jun 3, 2025 -
[CI] Occasional 403 forbidden from archive.ubuntu.com
#46687 opened
Jun 3, 2025 -
[Python] Missing `utf8_zfill` in pyarrow.compute to support `str.zfill` behavior
#46683 opened
Jun 3, 2025 -
[R] Add support for `%in%` within partitions
#46681 opened
Jun 2, 2025 -
[C++] Expose an BinaryViewBuilder interface for append a binary and multiple subslice
#46677 opened
Jun 2, 2025 -
[C++] Construct Array from ExtensionType Scalar
#46674 opened
Jun 2, 2025 -
[CI][Crossbow][Dev] Continue generating nightlies dashboard for `CI: Extra` scheduled runs
#46670 opened
Jun 2, 2025 -
[CI][Crossbow][Dev] Automate Zulip and email notifications for `CI: Extra` run nightly
#46669 opened
Jun 2, 2025 -
[Dev][R] Organize files for development
#46661 opened
Jun 1, 2025 -
Pre-commit fails if R is not installed
#46646 opened
May 30, 2025 -
[CI][Dev][Python] Validate Cython docstrings with pre-commit + numpydoc
#46641 opened
May 30, 2025 -
[R] `case_when()` sometimes doesn't find existing objects in the environment
#46636 opened
May 29, 2025
38 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
GH-46395: [C++][Statistics] Correct the Equal method for min and max in arrow::ArrayStatistics
#46422 commented on
Jun 4, 2025 • 45 new comments -
GH-25025: [C++] Move non core compute kernels into separate shared library
#46261 commented on
Jun 3, 2025 • 17 new comments -
GH-34785: [C++][Parquet] Parquet Bloom Filter Writer Implementation
#37400 commented on
Jun 4, 2025 • 16 new comments -
GH-31603: [C++] Add SecureString implementation to arrow/util/
#46626 commented on
Jun 4, 2025 • 16 new comments -
GH-46403: [C++] Add support for limiting element size when printing data
#46536 commented on
Jun 3, 2025 • 4 new comments -
GH-46513: [Archery] Add external library support in Archery
#46530 commented on
Jun 4, 2025 • 4 new comments -
GH-46488 [C#] Switch to license expression
#46624 commented on
May 29, 2025 • 4 new comments -
GH-35166: [C++] Increase precision of decimals in sum aggregates
#44184 commented on
Jun 1, 2025 • 4 new comments -
GH-46141: [C++] Add flight directory to Meson configuration
#46142 commented on
May 31, 2025 • 3 new comments -
GH-45723: [C++] FixedSizeListBuilder should have UnsafeAppend methods
#46126 commented on
Jun 2, 2025 • 3 new comments -
GH-46371: [C++][Parquet] Parquet Variant decoding tools
#46372 commented on
Jun 3, 2025 • 1 new comment -
GH-46562: [GLib][CI] Enable ARROW_GLIB_WERROR by default
#46563 commented on
May 31, 2025 • 0 new comments -
GH-37027: [C++] Add float16 kernels to if-else and vector-replace functions
#46446 commented on
Jun 3, 2025 • 0 new comments -
GH-46578: [C++][Statistics] Fix the ownership handling of arrow::ArrayData::statistics in arrow::ArrayData::CopyTo and arrow::ArrayDataViewOrCopy
#46625 commented on
May 29, 2025 • 0 new comments -
GH-37891: [C++] Refine several classes in parquet namespace
#46202 commented on
May 29, 2025 • 0 new comments -
GH-46421: [C++][Acero] Asofjoin respect PauseProducing from downstream.
#46140 commented on
Jun 4, 2025 • 0 new comments -
GH-30622: [C++] Arrow Flight SQL ODBC layer in Windows
#46099 commented on
May 28, 2025 • 0 new comments -
GH-44800: [C#] Implement Flight SQL Client
#44783 commented on
May 28, 2025 • 0 new comments -
[R] Intermittent memory leaks in the valgrind nightly test
#33094 commented on
Jun 4, 2025 • 0 new comments -
[C++][Statistics][Docs] Clarify whether arrow::ArrayStatistics is discarded during View, Slice, and Copy operations in the documentation for arrow::Array and arrow::ArrayData
#46485 commented on
May 29, 2025 • 0 new comments -
[C++][Parquet] Integer dictionary bitwidth preservation breaks multi-file read behaviour in pyarrow 20
#46629 commented on
May 29, 2025 • 0 new comments -
[C++] FixedSizeListBuilder should have UnsafeAppend methods
#45723 commented on
May 30, 2025 • 0 new comments -
[C++] Add parquet option to Meson configuration
#46410 commented on
May 30, 2025 • 0 new comments -
[C++] Incorrect behavior for arrow::BinaryVilewBuilder::AppendArraySlice
#46515 commented on
May 31, 2025 • 0 new comments -
[Python] S3FileSystem unable to gain access in the context of a pod on AWS EKS using Pod Identity Association
#45603 commented on
Jun 2, 2025 • 0 new comments -
[R] write_dataset returns nothing
#40723 commented on
Jun 2, 2025 • 0 new comments -
[C++][CI] Have a job with ARROW_LARGE_MEMORY_TESTS enabled
#46600 commented on
Jun 2, 2025 • 0 new comments -
[C++][Parquet] Encoding tools for variant type
#46555 commented on
Jun 2, 2025 • 0 new comments -
[Python] Allow retrieving fragments of a filtered dataset.
#36127 commented on
Jun 2, 2025 • 0 new comments -
[C++] Improve Build Time Through ArrayData forward Declarations
#45332 commented on
Jun 3, 2025 • 0 new comments -
[CI][Dev] Lint job failed with timeout
#46528 commented on
Jun 3, 2025 • 0 new comments -
[C++] StructBuilder should have UnsafeAppend methods
#45722 commented on
Jun 3, 2025 • 0 new comments -
[C++][Python] pyarrow.compute.field("x").isin([0.0]) incorrectly filters out all rows when they all match
#46183 commented on
Jun 3, 2025 • 0 new comments -
[Python] Dataset.to_batches accumulates memory usage and leaks
#39808 commented on
Jun 3, 2025 • 0 new comments -
[C++] Separate Protobuf dependencies into a `libarrow_proto.so`
#33633 commented on
Jun 3, 2025 • 0 new comments -
[Python] Jobs fail if Pyarrow version is not correctly generated due to missing remote dev tags
#44803 commented on
Jun 3, 2025 • 0 new comments -
[C++][Python][Parquet] Files with very large data page header can't be read with `pyarrow`
#46404 commented on
Jun 4, 2025 • 0 new comments -
[C++][CUDA] Implement GPUDirect data loading with IPC
#46503 commented on
Jun 4, 2025 • 0 new comments