-
Notifications
You must be signed in to change notification settings - Fork 3.8k
Insights: apache/arrow
Overview
Could not load contribution data
Please try again later
30 Pull requests merged by 18 people
-
MINOR: [C#] Bump BenchmarkDotNet and 12 others
#46887 merged
Jun 25, 2025 -
GH-46788: [C++][Parquet] Enable SIMD for byte stream split with 2 streams
#46789 merged
Jun 25, 2025 -
GH-40754: [Python] Expose tls_ca_file_path to S3FileSystem
#45881 merged
Jun 25, 2025 -
GH-46827: [C++] Update Meson Configuration for compute shared lib
#46839 merged
Jun 25, 2025 -
GH-41496: [Python][Azure][Docs] Turn on azure on debian-docs
#46892 merged
Jun 25, 2025 -
GH-46899: [CI][Dev] Fix shellcheck errors in the ci/scripts/install_numba.sh
#46900 merged
Jun 25, 2025 -
GH-45083: [C++] Add HalfFloat kernels for is_nan, is_inf, is_finite, negate, negate_checked, sign
#46866 merged
Jun 25, 2025 -
GH-46214: [C++] Improve S3 client initialization
#46723 merged
Jun 25, 2025 -
GH-46895: [CI][Dev] Fix shellcheck errors in the ci/scripts/install_minio.sh
#46896 merged
Jun 25, 2025 -
GH-46724: [C++][Parquet] OSSFuzz: Prevent from Bad-cast in handling statistics
#46725 merged
Jun 24, 2025 -
GH-46746: [C++] Assume AWS SDK >= 1.11.0
#46742 merged
Jun 24, 2025 -
GH-36753: [C++] Properly pretty-print and diff HalfFloatArrays
#46857 merged
Jun 24, 2025 -
GH-46881: [CI][Dev] Fix shellcheck errors in the ci/scripts/install_gcs_testbench.sh
#46882 merged
Jun 24, 2025 -
GH-46877: [MATLAB] Add
arrow.tabular.Table.fromRecordBatches
static method#46885 merged
Jun 23, 2025 -
GH-46871: [C++][Parquet] Restore implementation of 3 arrow::FileReader::GetRecordBatchReader() functions
#46868 merged
Jun 23, 2025 -
GH-46879: [CI][Packaging][Linux] Don't check example build with old CMake
#46880 merged
Jun 23, 2025 -
GH-44502: [R] Negative fractional dates must be converted to integers by floor, not trunc
#46873 merged
Jun 23, 2025 -
GH-46846: [CI][Dev] Fix shellcheck errors in the ci/scripts/install_dask.sh
#46847 merged
Jun 23, 2025 -
GH-46094: [C++][Docs] Add note to RleDecoder::Get's doc comment
#46874 merged
Jun 23, 2025 -
GH-45713: [GLib] Add garrow_chunked_array_(import|export)()
#46876 merged
Jun 22, 2025 -
GH-46841 [C++][Gandiva] Fix date trunc edge case
#46842 merged
Jun 21, 2025 -
GH-46674: [C++] Construct Array from ExtensionType Scalar
#46675 merged
Jun 20, 2025 -
GH-46831: [C++][R] Remove some pending references to CMake < 3.25 (docs + minor CMake references)
#46834 merged
Jun 20, 2025 -
GH-46771: [Python][C++] Implement pa.arange function to generate array sequences
#46778 merged
Jun 20, 2025 -
GH-46843: [C++] Don't use unity build for bundled AWS SDK for C++
#46845 merged
Jun 20, 2025 -
GH-37561: [Ruby] Add empty chunked array tests for Arrow::Table#each_raw_records
#46862 merged
Jun 19, 2025 -
GH-46864: [C++] Add half-float test for
ArrayFromJSONString
#46865 merged
Jun 19, 2025 -
GH-37891: [C++][Parquet] Refine several classes in Parquet encryption
#46202 merged
Jun 19, 2025 -
MINOR: [Ruby] Simplify dense union array RecordBatch creation in tests
#46850 merged
Jun 19, 2025 -
GH-46854: [CI][MATLAB][Packaging] Add support for MATLAB
R2025a
in CI and crossbow packaging workflows#46855 merged
Jun 19, 2025
12 Pull requests opened by 11 people
-
GH-46777: [C++] Use SimplifyIsIn only when the value_set of the expression is lower than a threshold
#46859 opened
Jun 19, 2025 -
GH-46863: [CI][C++] Fix downcast error in AWS SDK for C++
#46870 opened
Jun 20, 2025 -
GH-45098 [R] Provide a translation for data.table::fcase
#46878 opened
Jun 22, 2025 -
GH-45290: [Docs][Release] Change show_version_warning_banner substitution
#46883 opened
Jun 23, 2025 -
GH-46179: [Python] Bump index level once if pandas df already contains __index_level_i__ column
#46884 opened
Jun 23, 2025 -
GH-43891: [C++][Parquet] Faster reading of FIXED_LEN_BYTE_ARRAY data
#46886 opened
Jun 23, 2025 -
GH-44435: [GLib] Add distinct count support to GArrowArrayStatistics
#46894 opened
Jun 24, 2025 -
GH-46897: [Docs][C++][Python] Fix asof join documentation
#46898 opened
Jun 24, 2025 -
GH-46901: [C++] Support modulo compute kernel
#46902 opened
Jun 25, 2025 -
GH-46903: [CI] Automatically flag stale issues
#46904 opened
Jun 25, 2025 -
GH-46375: [C++] Add adapters/orc directory to Meson
#46906 opened
Jun 25, 2025 -
GH-38369: [MATLAB] Create utility functions for simplifying management of `Proxy` instances for `Array`s
#46907 opened
Jun 25, 2025
55 Issues closed by 11 people
-
[C#] Add ToList method to all Array
#23892 closed
Jun 25, 2025 -
[C++] Enable SIMD for Byte Stream Split with 2 streams
#46788 closed
Jun 25, 2025 -
[Python] Cannot read data if endpoint is s3 on a "secure" Minio server
#40754 closed
Jun 25, 2025 -
[C++] Meson Configuration Broken After Compute Module Refactor
#46827 closed
Jun 25, 2025 -
[Python][Azure][Docs] Add documentation about AzureFilesystem
#41496 closed
Jun 25, 2025 -
[CI][Dev] Fix shellcheck errors in the ci/scripts/install_numba.sh
#46899 closed
Jun 25, 2025 -
[C++][Python] support `is_nan` for float16
#45083 closed
Jun 25, 2025 -
Build python lib failed for ARMv8
#32752 closed
Jun 25, 2025 -
[R] open_dataset fails to open single compressed csv
#31973 closed
Jun 25, 2025 -
[R] Implement bindings for stringr::str_split_fixed
#30364 closed
Jun 25, 2025 -
[R] Implement bindings for stringr's combining strings functions
#30367 closed
Jun 25, 2025 -
[R] Implement bindings for stringr::str_glue
#30388 closed
Jun 25, 2025 -
[R] Implement bindings for stringr::str_flatten
#30387 closed
Jun 25, 2025 -
[R] Implement bindings for stringr::str_glue_data
#30389 closed
Jun 25, 2025 -
[R] R arrow cannot handle labelled data in arrow tables
#45601 closed
Jun 25, 2025 -
[C++] S3FileSystem construction causes IDMS lookups for region even when specified
#46214 closed
Jun 25, 2025 -
[CI][Dev] Fix shellcheck errors in the ci/scripts/install_minio.sh
#46895 closed
Jun 25, 2025 -
[C++][Parquet] Bad-cast in handling statistics
#46724 closed
Jun 24, 2025 -
[C++] Assume AWS SDK >= 1.11.0
#46746 closed
Jun 24, 2025 -
[C++] unsupported cast from halffloat to utf8
#32802 closed
Jun 24, 2025 -
[C++][Python] Support pretty printing of float16
#36753 closed
Jun 24, 2025 -
[CI][Dev] Fix shellcheck errors in the ci/scripts/install_gcs_testbench.sh
#46881 closed
Jun 24, 2025 -
[MATLAB] Add `arrow.tabular.Table.fromRecordBatches` static method
#46877 closed
Jun 23, 2025 -
[Python] - Schema inference from pandas reorders struct fields
#45640 closed
Jun 23, 2025 -
[C++][Parquet] Restore implementation of 3 arrow::FileReader::GetRecordBatchReader() functions
#46871 closed
Jun 23, 2025 -
[CI][Packaging][Linux] Failed example builds in package verification
#46879 closed
Jun 23, 2025 -
[R] Sub-day precision Date class must be converted to integers by floor, not trunc
#44502 closed
Jun 23, 2025 -
[CI][Dev] Fix shellcheck errors in the ci/scripts/install_dask.sh
#46846 closed
Jun 23, 2025 -
[C++][Parquet] RleDecoder::Get fails to return false on end for some bit widths
#46094 closed
Jun 23, 2025 -
[C++] Persist CMake options in generated header
#21354 closed
Jun 22, 2025 -
[C++] Review hardcoded "lib" paths in Find$PACKAGE.cmake related to endogenous libraries
#21518 closed
Jun 22, 2025 -
[Release] Build and publish Rust docs
#22005 closed
Jun 22, 2025 -
[Release] Document to use SNAPSHOT versions in pom.xml files for patch releases
#22413 closed
Jun 22, 2025 -
[Archery] lint sub-command should provide a --fail-fast option
#23158 closed
Jun 22, 2025 -
[Archery] Consider to use archery with or instead of the pre-commit hooks
#23447 closed
Jun 22, 2025 -
[Crossbow] Consider removing artifact patterns
#23872 closed
Jun 22, 2025 -
[GLib] Add `garrow_chunked_array_import()` and `garrow_chunked_array_export()`
#45713 closed
Jun 22, 2025 -
[R] Add str() methods to objects
#24871 closed
Jun 21, 2025 -
[C++][Gandiva] DATE_TRUNC with DAY rounds up to next day sometimes
#46841 closed
Jun 21, 2025 -
[MATLAB] Update MATLAB CI workflows to use MATLAB `R2023b`
#37809 closed
Jun 20, 2025 -
[MATLAB] arrow tabular variable name with hyperlink is not displayed correctly
#38540 closed
Jun 20, 2025 -
[MATLAB] Add release notes for initial release of the MATLAB interface
#38613 closed
Jun 20, 2025 -
[CI][MATLAB] Tests fail with Licensing error
#40034 closed
Jun 20, 2025 -
[C++] Construct Array from ExtensionType Scalar
#46674 closed
Jun 20, 2025 -
[C++] Improve Substrait consumer example
#33956 closed
Jun 20, 2025 -
[C++] Update handling of UUID type in Substrait integration
#43856 closed
Jun 20, 2025 -
[R] SystemRequirements documentation incorrect for cmake version
#46831 closed
Jun 20, 2025 -
[Python] If numpy is available use it for normalizing slice
#46771 closed
Jun 20, 2025 -
[C++] Bundled AWS SDK for C++ can't be built with unity build
#46843 closed
Jun 20, 2025 -
[Ruby] Add tests to consider an empty chunked array for Arrow::Table#each_raw_records
#37561 closed
Jun 19, 2025 -
[C++] Missing half-float test for ArrayFromJSONString
#46864 closed
Jun 19, 2025 -
[C++] Pass shared_ptr<DataType> by value to parametric type constructors
#37891 closed
Jun 19, 2025 -
[C++][CI] test-debian-12-docs fails to build AWS SDK (Unity build)
#46861 closed
Jun 19, 2025 -
GH-45937: [C++][Parquet] Logical type definition for variant
#46104 closed
Jun 19, 2025 -
[CI][MATLAB][Packaging] Add support for MATLAB `R2025a` in CI and crossbow packaging workflows
#46854 closed
Jun 19, 2025
14 Issues opened by 10 people
-
[Format] Add a Canonical Variant Extension Type
#46908 opened
Jun 25, 2025 -
[Parquet][C++] Expose Statistics.is_{min/max}_value_exact if they are present
#46905 opened
Jun 25, 2025 -
[CI] Automatically flag stale issues
#46903 opened
Jun 25, 2025 -
[C++][Compute] Add modulo compute kernel
#46901 opened
Jun 25, 2025 -
[Docs][Python][C++] Asof join documentation is wrong/incomplete
#46897 opened
Jun 24, 2025 -
[C++] Add tests for HalfFloatScalar
#46893 opened
Jun 24, 2025 -
Move `protobuf_test_util.cc` into a common location for all tests that use protobuf
#46889 opened
Jun 23, 2025 -
Remove default_option to set release buildtype in Meson configuration
#46888 opened
Jun 23, 2025 -
[GLib][Ruby] undefined method 'compute_initialize' for module Arrow (NoMethodError)
#46875 opened
Jun 20, 2025 -
[C++] Move Arange utility function to an Arrow C++ utility function
#46872 opened
Jun 20, 2025 -
[C++][Parquet] Add `Result<shared_ptr>` versions of `parquet:arrow::FileReader::GetRecordBatchReader()`
#46869 opened
Jun 20, 2025 -
[CI][C++][R] R Sanitizer for M1 fails on S3FileSystem
#46863 opened
Jun 19, 2025 -
[C++] Half-float builder should accept Float16
#46860 opened
Jun 19, 2025 -
[C++] Half-float scalars should use Float16 for value type, not uint16_t
#46858 opened
Jun 19, 2025
291 Unresolved conversations
Sometimes conversations happen on old items that aren’t yet closed. Here is a list of all the Issues and Pull Requests with unresolved conversations.
-
GH-34785: [C++][Parquet] Parquet Bloom Filter Writer Implementation
#37400 commented on
Jun 25, 2025 • 32 new comments -
GH-17211: [C++] Add `hash32` and `hash64` scalar compute functions
#45001 commented on
Jun 25, 2025 • 13 new comments -
GH-46683: Add utf8_zfill kernel for sign-aware zero padding
#46815 commented on
Jun 24, 2025 • 12 new comments -
GH-46833:[Python][Azure] Adding ConfigureClientSecretCredential to AzureFileSystem
#46837 commented on
Jun 25, 2025 • 9 new comments -
GH-46818: [Docs][C++] Add missing method description in type.h
#46819 commented on
Jun 24, 2025 • 3 new comments -
GH-46791: [C++] Add `Status::OrElse`
#46792 commented on
Jun 24, 2025 • 3 new comments -
GH-46677: [C++] Expose an BinaryViewBuilder interface for append a binary and multiple subslice
#46730 commented on
Jun 23, 2025 • 3 new comments -
GH-45798: [C++] Add extension directory to Meson
#45799 commented on
Jun 25, 2025 • 1 new comment -
GH-46714: [C++] Use hidden symbol visibility in Meson configuration
#46715 commented on
Jun 25, 2025 • 1 new comment -
[Python] arrow_to_pandas.cc has separate code paths for populating list<T> values into an object array
#22840 commented on
Jun 21, 2025 • 0 new comments -
[C++] CSV reader accept schema
#16876 commented on
Jun 21, 2025 • 0 new comments -
[C++] Create "ARROW_LIBRARIES" argument to pass list of desired components to build
#22942 commented on
Jun 21, 2025 • 0 new comments -
[C++] -Dzlib_SOURCE=BUNDLED on Windows does not produce arrow.dll with zlib statically linked
#22944 commented on
Jun 21, 2025 • 0 new comments -
[Website] Transition to new .asf.yaml machinery for website publishing
#22836 commented on
Jun 21, 2025 • 0 new comments -
[Crossbow] Unify the version numbers generated by crossbow and rake
#22972 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Add accessors for ColumnChunk KeyValue metadata
#42904 commented on
Jun 21, 2025 • 0 new comments -
GH-46207: [C++] Rename arrow::util::StringBuilder and move to internal namespace
#46813 commented on
Jun 25, 2025 • 0 new comments -
[C++] Support dictionary unification on dictionaries having nulls
#22981 commented on
Jun 21, 2025 • 0 new comments -
[Python] Consider adding some user-friendly conveniences to Filesystem API
#23005 commented on
Jun 21, 2025 • 0 new comments -
[Python] Add Array ctor microbenchmarks
#23028 commented on
Jun 21, 2025 • 0 new comments -
[C++/Python] S3FileSystem.create_dir should raise for a nested directory with recursive keyword set to False
#23032 commented on
Jun 21, 2025 • 0 new comments -
[Release] Document environment configuration to run release verification on macOS
#23092 commented on
Jun 21, 2025 • 0 new comments -
[C++] Provide API for IPC roundtrip of RecordBatches not using the encapsulated message format
#23120 commented on
Jun 21, 2025 • 0 new comments -
[R] Explore roxygen2 R6 class documentation
#23127 commented on
Jun 21, 2025 • 0 new comments -
[Archery] Cleanup integration module to use companion classes
#23148 commented on
Jun 21, 2025 • 0 new comments -
[Archery] Benchmark diff should provide a TUI friendly output
#23159 commented on
Jun 21, 2025 • 0 new comments -
[R] Add col_select argument to read_ipc_stream
#16921 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Ability to concat parquet files
#42896 commented on
Jun 21, 2025 • 0 new comments -
[Release] Dockerize post-03-website.sh script
#22420 commented on
Jun 21, 2025 • 0 new comments -
[FlightRPC] Allow timeouts on all stream reads
#22464 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] RecordReader::Reset logic is inefficient for small reads
#16826 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Key rotation tool
#42899 commented on
Jun 21, 2025 • 0 new comments -
[C++] ExtensionType subclass for "unknown" types?
#22572 commented on
Jun 21, 2025 • 0 new comments -
[Python] Add pyarrow.read_schema to API documentation, add prose documentation for schema serialization workflow
#16841 commented on
Jun 21, 2025 • 0 new comments -
[Python] Serialising numpy array yields `pyarrow.lib.ArrowNotImplementedError: list<item: float>`
#22610 commented on
Jun 21, 2025 • 0 new comments -
[R] Deduplicate strings using Arrow hash tables instead of passing all values through R's global hash table
#22622 commented on
Jun 21, 2025 • 0 new comments -
[Python][Parquet] Add examples of using HDFS filesystem and Parquet files together
#22625 commented on
Jun 21, 2025 • 0 new comments -
[C++] File system API should expand paths
#22703 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Provide for readahead-buffering in column readers
#42903 commented on
Jun 21, 2025 • 0 new comments -
[Python] Conversion of numpy array to pyarrow.Tensor: Negative ndarray strides not supported
#22726 commented on
Jun 21, 2025 • 0 new comments -
[C++] Raw data equality in arrays vs. semantic value equality
#22734 commented on
Jun 21, 2025 • 0 new comments -
[C++] Extending STL API to support row-wise conversion
#22751 commented on
Jun 21, 2025 • 0 new comments -
[C++] Consider implementing BufferOuputStream using BufferBuilder internally
#22761 commented on
Jun 21, 2025 • 0 new comments -
[C++] More extensive attributes usage could improve debugging
#22772 commented on
Jun 21, 2025 • 0 new comments -
[C++][Gandiva] Remove gandiva pointer aliases
#23711 commented on
Jun 21, 2025 • 0 new comments -
[C++] Remove compute pointer aliases
#23712 commented on
Jun 21, 2025 • 0 new comments -
[Python] Allow HDFS FileSystem to be created without Hadoop present
#23752 commented on
Jun 21, 2025 • 0 new comments -
[Python][Parquet] Performance regression in pyarrow-0.15.1 vs pyarrow-0.12.1 when reading a "partitioned parquet table" ?
#23815 commented on
Jun 21, 2025 • 0 new comments -
[C++][Dataset] Ability to restrict Hive partitioning to certain fields
#23895 commented on
Jun 21, 2025 • 0 new comments -
[Python] Expose dataset PartitioningFactory.inspect ?
#23898 commented on
Jun 21, 2025 • 0 new comments -
[C++][Dataset] Handle DictType index mismatch better
#23901 commented on
Jun 21, 2025 • 0 new comments -
[Python] csv.ConvertOptions Do Not Pass Through/Retain Nullability from Schema
#23903 commented on
Jun 21, 2025 • 0 new comments -
[Python] csv.ConvertOptions Documentation Is Unclear Around Disabling Type Inference
#17032 commented on
Jun 21, 2025 • 0 new comments -
[R] Add option to preserve dictionary logical type rather than coerce to factor
#23904 commented on
Jun 21, 2025 • 0 new comments -
[Format][C++] Add tensor and sparse tensor supports in File metadata
#23941 commented on
Jun 21, 2025 • 0 new comments -
[C++][Dataset] Give more informative error message for mismatching schemas for FileSystemSources
#23946 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] 1.4.0+ reader ignore stats created by 1.3.* writer
#42929 commented on
Jun 21, 2025 • 0 new comments -
[C++] Add Result<T> to the Visitor pattern
#17047 commented on
Jun 21, 2025 • 0 new comments -
[C++] Add a facility to create a Bitmap buffer from an data pointer with a specified sentinel
#24004 commented on
Jun 21, 2025 • 0 new comments -
[C++] Simplify build-support/run-test.sh
#24045 commented on
Jun 21, 2025 • 0 new comments -
[C++] Add benchmarks for arrow/util/rle_encoder.h for non-dictionary encodings.
#24908 commented on
Jun 20, 2025 • 0 new comments -
[C++/Python] retrieve fd of open memory mapped file and Open() memory mapped file by fd
#23170 commented on
Jun 21, 2025 • 0 new comments -
[Dataset][C++] RecordBatchProjector is not thread safe
#23184 commented on
Jun 21, 2025 • 0 new comments -
[Python] Define PyObjectBuffer with Py_XDECREF logic in destructor for object array memory
#23233 commented on
Jun 21, 2025 • 0 new comments -
[Packaging][Crossbow] Always upload binary artifacts regardless of the test result
#23259 commented on
Jun 21, 2025 • 0 new comments -
[Developer] Add Windows utility script to use Dependencies.exe to dump DLL dependencies for diagnostic purposes
#23386 commented on
Jun 21, 2025 • 0 new comments -
[C++][Dataset] Implement ScalarAsStatisctics for non-primitive types
#23387 commented on
Jun 21, 2025 • 0 new comments -
[C++] In CMake output, list each enabled thirdparty toolchain dependency and the reason for its being enabled
#23396 commented on
Jun 21, 2025 • 0 new comments -
[C++][CI] Hiveserver2 instegration test fails to connect to impala container
#23433 commented on
Jun 21, 2025 • 0 new comments -
[C++][CMake] Automatically set ARROW_GANDIVA_PC_CXX_FLAGS for conda and OSX sdk
#23434 commented on
Jun 21, 2025 • 0 new comments -
[CI] Turn off unnecessary features in the integration tests (spark/turbodbc/dask/hdfs)
#23444 commented on
Jun 21, 2025 • 0 new comments -
[Archery] Create a wrapper script in archery for docker compose in order to run the containers with the host's user and group
#16959 commented on
Jun 21, 2025 • 0 new comments -
[C++][Gandiva] Implement regexp_matches, regexp_like functions
#23503 commented on
Jun 21, 2025 • 0 new comments -
[C++][Documentation] Document how to set installed location for individual toolchain components
#23517 commented on
Jun 21, 2025 • 0 new comments -
[C++] Gandiva exposes LLVM symbols
#23536 commented on
Jun 21, 2025 • 0 new comments -
[C++] clang-tidy diagnostics not emitted for most headers
#23591 commented on
Jun 21, 2025 • 0 new comments -
[Python] Writing partitions with NaNs silently drops data
#16990 commented on
Jun 21, 2025 • 0 new comments -
[Dev] "archery lint" outputs unapplicable patch file
#16998 commented on
Jun 21, 2025 • 0 new comments -
Write Dataset with file_visitor core dump
#46800 commented on
Jun 24, 2025 • 0 new comments -
[C++][Parquet] Integer dictionary bitwidth preservation breaks multi-file read behaviour in pyarrow 20
#46629 commented on
Jun 23, 2025 • 0 new comments -
[Python] Table.from_pandas creates duplicate column names if the dataframe already contains __index_level_i__ columns
#46179 commented on
Jun 23, 2025 • 0 new comments -
[C++] Remove deprecated ObjectType and FileStatistics
#45747 commented on
Jun 23, 2025 • 0 new comments -
[C++][Python] Remove deprecated functions in 21.0.0
#45968 commented on
Jun 23, 2025 • 0 new comments -
[Docs][Release][Website] Figure out why the version banner code changed in 19.0.0
#45290 commented on
Jun 23, 2025 • 0 new comments -
[Python] read_csv converts strings with leading zeros to integers
#46853 commented on
Jun 23, 2025 • 0 new comments -
[GLib] Add Duration type support
#23970 commented on
Jun 23, 2025 • 0 new comments -
[Python][Parquet] Expose ParquetWriter properties and arrow_properties
#45902 commented on
Jun 23, 2025 • 0 new comments -
Support translations of simple data.table functions
#45098 commented on
Jun 22, 2025 • 0 new comments -
[GLib] Add support for arrow::DictionaryBuilder
#21428 commented on
Jun 22, 2025 • 0 new comments -
[C++][Python] Set up testing for backwards compatibility of the parquet reader
#22325 commented on
Jun 22, 2025 • 0 new comments -
[C++] Support for textual, JSON schema representation
#25078 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Schema resolution: map encoding
#42288 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Develop external predicate pushdown API for column readers
#42363 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Add ability to parse nested schemas from text specification like parquet-mr
#42410 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Determine a good default page size
#42520 commented on
Jun 21, 2025 • 0 new comments -
GH-46767: [C++] Enable EqualOptions::use_atol_ for arrow::Array, arrow::Scalar, arrow::RecordBatch, and arrow::ChuckedArray
#46779 commented on
Jun 19, 2025 • 0 new comments -
GH-46728: [Python] Skip test_gdb.py tests if PyArrow wasn't built debug
#46755 commented on
Jun 24, 2025 • 0 new comments -
GH-46740: [C++] Update bundled Thrift to 0.22.0
#46741 commented on
Jun 25, 2025 • 0 new comments -
GH-46410: [C++] Add parquet options to Meson configuration
#46647 commented on
Jun 25, 2025 • 0 new comments -
GH-46336: [Release][Packaging] Add support for Reproducible Builds for source archive
#46342 commented on
Jun 23, 2025 • 0 new comments -
GH-31387: [C++] Check nullability when validating fields on batches or struct arrays
#46129 commented on
Jun 24, 2025 • 0 new comments -
GH-46734: [C++] Arrow Flight SQL ODBC layer in Windows
#46099 commented on
Jun 24, 2025 • 0 new comments -
GH-45747: [C++] Remove deprecated ObjectType and FileStatistics, refactor hdfs code
#45998 commented on
Jun 23, 2025 • 0 new comments -
GH-42971: [C++] Parquet stream writer: Allow writing BYTE_ARRAY with converted type NONE
#44739 commented on
Jun 24, 2025 • 0 new comments -
GH-38558: [C++] Add support for null sort option per sort key
#38584 commented on
Jun 25, 2025 • 0 new comments -
[MATLAB] Create utility functions for simplifying management of `Proxy` instances for `Array`s
#38369 commented on
Jun 25, 2025 • 0 new comments -
[Format] Consider adding an official variant type to Arrow
#42069 commented on
Jun 25, 2025 • 0 new comments -
[GLib] Add `GArrowArrayStatistics`
#44435 commented on
Jun 25, 2025 • 0 new comments -
[Benchmarking][R] conbench is failing
#46716 commented on
Jun 25, 2025 • 0 new comments -
[Python][Packaging] Stop uploading nightly wheels to gemfury
#43904 commented on
Jun 25, 2025 • 0 new comments -
[Python] Automatically support fsspec filesystem URIs
#44900 commented on
Jun 24, 2025 • 0 new comments -
[Python] pa.array raises for mixed scalar types (float16 + int)
#25859 commented on
Jun 24, 2025 • 0 new comments -
[C++] FlightDataStream should be higher-level
#16731 commented on
Jun 21, 2025 • 0 new comments -
[C++] See if possible to deduplicate dictionaries in IPC streams in some way
#21799 commented on
Jun 21, 2025 • 0 new comments -
[C++] DictionaryBuilder provides information to determine array builder type at run-time
#21814 commented on
Jun 21, 2025 • 0 new comments -
[Documentation] Migrate FAQ page to Sphinx / rst around release time
#21989 commented on
Jun 21, 2025 • 0 new comments -
[C++] Refactor Buffers method on concatenate to consolidate code.
#21996 commented on
Jun 21, 2025 • 0 new comments -
[Python] Allow parsing more general JSON formats
#22011 commented on
Jun 21, 2025 • 0 new comments -
[C++] Improve clang-tidy speed
#16763 commented on
Jun 21, 2025 • 0 new comments -
[C++][Gandiva] Expression cache should not use ToString on data type
#22073 commented on
Jun 21, 2025 • 0 new comments -
[Crossbow] Support GitLab runners
#16773 commented on
Jun 21, 2025 • 0 new comments -
[R] Improve autosplice cpp code
#22186 commented on
Jun 21, 2025 • 0 new comments -
[C++] CSV reader: Ability to not infer column types.
#22232 commented on
Jun 21, 2025 • 0 new comments -
[Python] Exceptions swallowed in ParquetManifest._visit_directories
#22244 commented on
Jun 21, 2025 • 0 new comments -
[Doc] Better document the Tensor classes in the prose documentation
#22275 commented on
Jun 21, 2025 • 0 new comments -
[Python] Support optional arrow components via extras_require
#22300 commented on
Jun 21, 2025 • 0 new comments -
[Python] conversion from datetime objects with mixed timezones should normalize to UTC
#16806 commented on
Jun 21, 2025 • 0 new comments -
[CI] Build bundled dependencies in docker build step
#22324 commented on
Jun 21, 2025 • 0 new comments -
[Developer][Documentation] Organize source and binary dependency licenses into directories
#22363 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Remove redundant total_byte_size calculation
#42542 commented on
Jun 21, 2025 • 0 new comments -
[Python] Implement conversion between integer coded as floating points with NaN to an Arrow integer type
#16131 commented on
Jun 21, 2025 • 0 new comments -
[C++] File and stream APIs for interacting with "large" schemas
#16203 commented on
Jun 21, 2025 • 0 new comments -
[C++] gandiva-micro_benchmarks is broken in MSVC build
#21374 commented on
Jun 21, 2025 • 0 new comments -
[C++/Python] Test build with conda's defaults channel
#21398 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] page level min / max written by parquet-cpp is not recognized by parquet-tools
#42361 commented on
Jun 21, 2025 • 0 new comments -
[C++][Parquet] Detect parquet-mr style dictionary_page
#42881 commented on
Jun 21, 2025 • 0 new comments -
[Doc][C++] Add examples on how to construct with ArrayData::Make instead of builder classes
#21499 commented on
Jun 21, 2025 • 0 new comments -
[Python] Failed to convert 'float' to 'double' with using pandas_udf and pyspark
#21514 commented on
Jun 21, 2025 • 0 new comments -
[C++] Implement direct support for shared memory arrow columns
#21559 commented on
Jun 21, 2025 • 0 new comments -
[C++] Build toolchain support for libcurl
#16051 commented on
Jun 21, 2025 • 0 new comments -
[Packaging] [C++/Python] Add conda package verification scripts
#21593 commented on
Jun 21, 2025 • 0 new comments -
[Integration] Update turbodbc integration test to install a pinned version in the Dockerfile
#21616 commented on
Jun 21, 2025 • 0 new comments -
[C++] Share more of the IPC testing utils with the rest of Arrow
#21623 commented on
Jun 21, 2025 • 0 new comments -
[C++] Test and benchmark libraries library search path subtly affected by installation
#21677 commented on
Jun 21, 2025 • 0 new comments -
[Crossbow] Send notifications only for failures
#21734 commented on
Jun 21, 2025 • 0 new comments -
[C++] allow construction of ArrayBuilders from existing arrays
#21788 commented on
Jun 21, 2025 • 0 new comments -
[C++] Unable to load libjvm on ppc64le architecture for hdfs.connect()
#17212 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add VectorFunction wrapping arrow::Concatenate
#25122 commented on
Jun 20, 2025 • 0 new comments -
[C++] Deprecate or remove Scalar::Parse and Scalar::CastTo
#25124 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Large decimal values don't roundtrip correctly
#42960 commented on
Jun 20, 2025 • 0 new comments -
[C++] jemalloc_set_decay_ms precedence
#25126 commented on
Jun 20, 2025 • 0 new comments -
[Python] hdfs fails to connect to for HDFS 3.x cluster
#25136 commented on
Jun 20, 2025 • 0 new comments -
[Python] Add tests to verify that one can build a C++ extension against the manylinux1 wheels
#25149 commented on
Jun 20, 2025 • 0 new comments -
[C++] Writing IPC messages with 64-byte buffer alignment vs. 8-byte default
#25151 commented on
Jun 20, 2025 • 0 new comments -
[C++] overloaded virtual function "arrow::io::Writable::Write" is only partially overridden in class
#25157 commented on
Jun 20, 2025 • 0 new comments -
[C++][Gandiva][MinGW] Enable crashed tests
#25187 commented on
Jun 20, 2025 • 0 new comments -
[Python] csv.read_csv progress bar
#17224 commented on
Jun 20, 2025 • 0 new comments -
[C++][Compute] Provide a kernel property testing API
#25246 commented on
Jun 20, 2025 • 0 new comments -
[Python] pandas index information gets lost when partition_cols are used
#17227 commented on
Jun 20, 2025 • 0 new comments -
[C++][Compute] Formalize Op functor concept
#25267 commented on
Jun 20, 2025 • 0 new comments -
[FlightRPC][C++][Python] Allow updating TLS certificate at runtime
#17233 commented on
Jun 20, 2025 • 0 new comments -
[Python][Dataset] Write a custom field to _metadata caching file size
#17242 commented on
Jun 20, 2025 • 0 new comments -
[C++][Developer] Implement tool to compile and run C++ benchmarks from master branch against older codebase revisions
#25336 commented on
Jun 20, 2025 • 0 new comments -
[Format] Add forward compatibility checks for Decimal::bitWidth to reference libraries
#25337 commented on
Jun 20, 2025 • 0 new comments -
[C++] Make ThreadPool task ordering configurable
#24915 commented on
Jun 20, 2025 • 0 new comments -
[Docs] Integration testing instructions for base docker image are incorrect
#17182 commented on
Jun 20, 2025 • 0 new comments -
[Python] Column names of type CategoricalIndex fails to convert back to pandas
#24955 commented on
Jun 20, 2025 • 0 new comments -
[R] Implementing tidyr interface
#24956 commented on
Jun 20, 2025 • 0 new comments -
[C++] Selective compression on the wire
#24984 commented on
Jun 20, 2025 • 0 new comments -
[Release] Website release notes count not strictly release associated patches
#24988 commented on
Jun 20, 2025 • 0 new comments -
[C++] Listing files with S3FileSystem is slow
#25019 commented on
Jun 20, 2025 • 0 new comments -
[C++] C++ array kernels framework and execution buildout (umbrella issue)
#25028 commented on
Jun 20, 2025 • 0 new comments -
[C++] Determine strategy for propagating failures in initializing built-in function registry in arrow/compute
#25031 commented on
Jun 20, 2025 • 0 new comments -
[C++] Determine desirable maximum length for ExecBatch in pipelined and parallel execution of kernels
#25032 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add "TypeResolver" class interface to replace current OutputType::Resolver pattern
#25052 commented on
Jun 20, 2025 • 0 new comments -
[Python] Add necessary plumbing to enable Numba-generated functions to be registered as functions in the global C++ function/kernels registry
#25064 commented on
Jun 20, 2025 • 0 new comments -
[C++] Parallelize execution of arrow::compute::ScalarFunction
#25065 commented on
Jun 20, 2025 • 0 new comments -
[C++] Arrow-native C++ Data Frame-style programming interface for analytics (umbrella issue)
#17201 commented on
Jun 20, 2025 • 0 new comments -
[Python] An independent Cython package for Cython-based projects that want to program against the C data interface
#25072 commented on
Jun 20, 2025 • 0 new comments -
[C++][Python] Make reading functions to return consistent exceptions
#25109 commented on
Jun 20, 2025 • 0 new comments -
[C++] Benchmark hash table against thirdparty options, possibly vendor a thirdparty hash table library
#25111 commented on
Jun 20, 2025 • 0 new comments -
[C++] Scalar formatting code used in array/diff.cc should be reusable
#25115 commented on
Jun 20, 2025 • 0 new comments -
[C++/Python] Kernel for SetItem(IntegerArray, values) ("replace_with_indices")
#25505 commented on
Jun 20, 2025 • 0 new comments -
[C++/Python] Add option to Take kernel to interpret negative indices as indexing from the right
#25506 commented on
Jun 20, 2025 • 0 new comments -
[C++/Python] Add option to Take kernel to interpret negative indices as NULL
#25507 commented on
Jun 20, 2025 • 0 new comments -
[C++] Optimize IPC stream reading
#25514 commented on
Jun 20, 2025 • 0 new comments -
[C++] TableReader support protobuf
#25530 commented on
Jun 20, 2025 • 0 new comments -
[C++] Scan Dataset in reverse
#17277 commented on
Jun 20, 2025 • 0 new comments -
[C++] Column type inference in read_csv vs. open_csv. CSV conversion error to null
#17278 commented on
Jun 20, 2025 • 0 new comments -
[C++] Fix test case TestSchemaMetadata.MetadataVersionForwardCompatibility
#25548 commented on
Jun 20, 2025 • 0 new comments -
[C++] Ensure private functions are static or in an anonymous namespace
#25563 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Consider revamping RleDecoder based on "upstream" changes in Apache Impala
#17282 commented on
Jun 20, 2025 • 0 new comments -
[C++] Improve clarity of MapBuilder API
#25578 commented on
Jun 20, 2025 • 0 new comments -
[Python] read_csv() case of user specified column_names AND include_columns
#25590 commented on
Jun 20, 2025 • 0 new comments -
[Python] Array.__str__ shows misleading output for timestamp types with time zone set
#25593 commented on
Jun 20, 2025 • 0 new comments -
[Python] Memorymapped arrow file conversion to parquet loads everything into RAM
#25594 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add option to disable jemalloc background thread on Linux
#17285 commented on
Jun 20, 2025 • 0 new comments -
[Packaging][wheel] Could NOT Find LLVM_DIR Error when Building with Gandiva
#25600 commented on
Jun 20, 2025 • 0 new comments -
[R] An "cannot handle Array of type <utf8_view>" error occurs when reading ipc file include strings output by r-polars
#45523 commented on
Jun 19, 2025 • 0 new comments -
[Python] Severe performance regression in isin() filter After pyarrow v18
#46777 commented on
Jun 19, 2025 • 0 new comments -
[C++] Add vectorized "IntegersMultipleOf" to arrow/util/int_util.h
#25348 commented on
Jun 20, 2025 • 0 new comments -
[Python][Packaging] Enable S3 support in Windows wheels
#25362 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add crossbow job to capture build setup
#25368 commented on
Jun 20, 2025 • 0 new comments -
[C++] Implement PrettyPrint for Scalars
#25373 commented on
Jun 20, 2025 • 0 new comments -
[C++] Detect unauthorized memory allocations in function kernels
#17251 commented on
Jun 20, 2025 • 0 new comments -
[R] Add chunk_size to Table$create()
#25384 commented on
Jun 20, 2025 • 0 new comments -
[Python] Specifying columns in a dataset drops the index (pandas) metadata.
#25391 commented on
Jun 20, 2025 • 0 new comments -
[C++] Use feature enum
#25400 commented on
Jun 20, 2025 • 0 new comments -
[C++][Dataset][Python] ParquetDataset typecast on read
#25411 commented on
Jun 20, 2025 • 0 new comments -
[Python][Dataset] Support pickling of ParquetFileFragment's RowGroupInfo
#25417 commented on
Jun 20, 2025 • 0 new comments -
[C++] Replace usages of TestBase::MakeRandomArray in testing/gtest_util.h with RandomArrayGenerator
#25431 commented on
Jun 20, 2025 • 0 new comments -
[Python] parquet.read_table causes crashes on Windows Server 2016 w/ Xeon Processor
#25432 commented on
Jun 20, 2025 • 0 new comments -
[Java] Document how to set netty/unsafe allocators
#25440 commented on
Jun 20, 2025 • 0 new comments -
[FlightRPC][Integration] Add support for setting metadata version for integration tests
#25458 commented on
Jun 20, 2025 • 0 new comments -
[Python] Expose CpuInfo for informational / debugging purposes
#25476 commented on
Jun 20, 2025 • 0 new comments -
[C++/Python] Support necessary functionality to have an Arrow-string type in pandas
#25481 commented on
Jun 20, 2025 • 0 new comments -
[C#] Show API docs on arrow.apache.org
#25485 commented on
Jun 20, 2025 • 0 new comments -
[C++] SchemaFromJSON for testing deeply nested schemas
#17269 commented on
Jun 20, 2025 • 0 new comments -
[C++][Gandiva] Add CMake support for compiling LLVM's IR into a library
#24051 commented on
Jun 21, 2025 • 0 new comments -
[Developer][C++] IWYU fails on include-cycle in uriparser/Uri.h
#24206 commented on
Jun 20, 2025 • 0 new comments -
[Python] Define API for user-defined conversions of array cell values in pyarrow.array
#24220 commented on
Jun 20, 2025 • 0 new comments -
[Developer][Integration] Consolidate example JSON and test/validate uniformly
#24252 commented on
Jun 20, 2025 • 0 new comments -
[Release] Ensure that the JIRAs belonging the release's commits have the proper version number
#24259 commented on
Jun 20, 2025 • 0 new comments -
[Python] Externalize option whether to bundle zlib DLL in Python packages
#24280 commented on
Jun 20, 2025 • 0 new comments -
[C++] Create CMake utility to streamline creating ADD_$COMPONENT_TEST helper functions
#24325 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add "ON only if system dependencies available" build mode for certain optional Arrow components
#24359 commented on
Jun 20, 2025 • 0 new comments -
[FlightRPC][C++] DoPutPayloadWriter doesn't always expose server error message
#24363 commented on
Jun 20, 2025 • 0 new comments -
[Python] Refactor context_choices in test_cuda_numba_interop to be a module level fixture
#24375 commented on
Jun 20, 2025 • 0 new comments -
[Python] Empty table creation from schema with nested dictionary type
#24395 commented on
Jun 20, 2025 • 0 new comments -
[C++] Sanitize hdfs host when creating HadoopFileSystem from endpoint
#24408 commented on
Jun 20, 2025 • 0 new comments -
[Python][Dataset] Consider adding Cast like operation
#24409 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add "random access" / slice read API to RecordBatchFileReader
#24446 commented on
Jun 20, 2025 • 0 new comments -
[C++] Simplify IPC tests by using BufferOutputStreams
#24472 commented on
Jun 20, 2025 • 0 new comments -
[Gandiva][UDF] Solutions to register new UDFs dynamically without checking it into arrow repo.
#24502 commented on
Jun 20, 2025 • 0 new comments -
[Archery] Comment bot should report any errors happening during crossbow submit
#24548 commented on
Jun 20, 2025 • 0 new comments -
[Python] Get Access to the type_to_type_id dictionary
#24549 commented on
Jun 20, 2025 • 0 new comments -
[Python][Parquet] Crash on parquet.read_table on windows python 3.82
#24569 commented on
Jun 20, 2025 • 0 new comments -
[C#] BinaryArray.Builder Reserve/Resize methods are broken
#25449 commented on
Jun 20, 2025 • 0 new comments -
[C#] Enhance TimeStampArray to support IANA/Olson TZ strings
#25583 commented on
Jun 20, 2025 • 0 new comments -
[C++][Compute] Add scalar_hash function
#17211 commented on
Jun 20, 2025 • 0 new comments -
[MATLAB] Create a superclass for `DateType`-related MATLAB tests
#37577 commented on
Jun 20, 2025 • 0 new comments -
[MATLAB]: Add utility for hyperlinking text for use in display in the Command Window
#37958 commented on
Jun 20, 2025 • 0 new comments -
[MATLAB] Add utility for bolding text for use in display in the MATLAB Command Window
#37957 commented on
Jun 20, 2025 • 0 new comments -
[MATLAB] Add tests verifying `arrow.array.<Type>Array` throws an error when given arrays of the wrong type.
#35644 commented on
Jun 20, 2025 • 0 new comments -
[MATLAB] Report error when building the MATLAB Interface to Arrow in Debug mode on Windows
#35239 commented on
Jun 20, 2025 • 0 new comments -
[MATLAB] Add `NumNulls` property to `arrow.array.Array` class
#38422 commented on
Jun 20, 2025 • 0 new comments -
[MATLAB][Packaging] Add script for uploading Release Candidate (RC) MLTBX packages for the MATLAB bindings to the Apache Arrow GitHub Releases area
#40924 commented on
Jun 20, 2025 • 0 new comments -
[MATLAB] Update arrow/matlab/README.md with the latest change.
#38572 commented on
Jun 20, 2025 • 0 new comments -
[Gandiva] Add support for literal variables
#24052 commented on
Jun 20, 2025 • 0 new comments -
[C++][Documentation] Link to generated Doxygen docs from main Sphinx TOC tree
#24064 commented on
Jun 20, 2025 • 0 new comments -
[Python] Segfault in pandas version 1.0.1, read_parquet after creating a clickhouse odbc connection
#24098 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Examine Arrow-decoding perf regressions introduced by PARQUET-1797
#24161 commented on
Jun 20, 2025 • 0 new comments -
[C++] Deduplicate schema equivalence checks
#24165 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add short representation string to common classes
#24183 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add field to IpcReadOptions to include padding in Buffer metadata accounting
#24194 commented on
Jun 20, 2025 • 0 new comments -
[C++] Rearrange code in bit-util.h/.cc for AppendWord
#24760 commented on
Jun 20, 2025 • 0 new comments -
[C++][Dataset] Add test case to check if all essential properties are preserved after ScannerBuilder::Project is called
#24761 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Optional parallel processing when writing Parquet files
#24764 commented on
Jun 20, 2025 • 0 new comments -
[C++]Expose API for pushing down rep/def level comparison down to decoder
#24765 commented on
Jun 20, 2025 • 0 new comments -
[C++] Implement "round robin" scheduler interface to fixed-size ThreadPool
#24788 commented on
Jun 20, 2025 • 0 new comments -
[Python] Test error message when discovering dataset with invalid files
#24813 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add support for gflags version detection
#24814 commented on
Jun 20, 2025 • 0 new comments -
[C++][Gandiva] Reduce number of files and headers
#24822 commented on
Jun 20, 2025 • 0 new comments -
[C++] Add multi-consumer Scheduler API to sit one layer above ThreadPool
#24827 commented on
Jun 20, 2025 • 0 new comments -
[Format] Create reference implementations of IPC RecordBatch body compression from ARROW-300
#24829 commented on
Jun 20, 2025 • 0 new comments -
[Python] supporting pandas sparse series in pyarrow
#24837 commented on
Jun 20, 2025 • 0 new comments -
[Doc] General introduction to archery
#24846 commented on
Jun 20, 2025 • 0 new comments -
[C++] Don't re-initialize Minio in every s3fs benchmark
#17164 commented on
Jun 20, 2025 • 0 new comments -
[Dev] Use --password-stdin for docker login from archery
#24854 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Require error message when using ParquetException::EofException
#42957 commented on
Jun 20, 2025 • 0 new comments -
[C++] Implement Array to JSON function
#24864 commented on
Jun 20, 2025 • 0 new comments -
[Python][Documentation] Add column limit recommendations Parquet page
#24897 commented on
Jun 20, 2025 • 0 new comments -
[R] Add bindings to ConcatenateTables
#24898 commented on
Jun 20, 2025 • 0 new comments -
[Python][Dataset] Infer the filesystem from the first path if multiple paths are passed to dataset()
#24583 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Expose an API that surface RLE information for rep/def levels when reading parquet files
#42377 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Expose an API that allows direct writing of RLE information for rep/def levels when writing parquet files
#42949 commented on
Jun 20, 2025 • 0 new comments -
Read and write dataset description in both R and Python
#24587 commented on
Jun 20, 2025 • 0 new comments -
[Gandiva][UDF] Support complex datatype for UDF return type.
#24588 commented on
Jun 20, 2025 • 0 new comments -
[C++] gcc6 warning re: arrow::internal::ArgSort
#17127 commented on
Jun 20, 2025 • 0 new comments -
[C++] Default display for multi-choice define_option_string is misleading
#24601 commented on
Jun 20, 2025 • 0 new comments -
[Gandiva][UDF] Add a udf for gandiva to extract all named groups.
#24623 commented on
Jun 20, 2025 • 0 new comments -
[Python] Crash in lib.concat_tables on Windows
#24637 commented on
Jun 20, 2025 • 0 new comments -
[Python][R] Expose incremental write API for Feather files
#24644 commented on
Jun 20, 2025 • 0 new comments -
[C++] Create "libarrow_thrift" containing all code requiring the Thrift libraries
#24650 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Add benchmarks for rep/def level decoding at multiple levels
#24663 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Create randomized nested data generation round trip read/write unit tests
#24664 commented on
Jun 20, 2025 • 0 new comments -
[C++][Parquet] Implement non-vectorized array reconstruction logic.
#24667 commented on
Jun 20, 2025 • 0 new comments -
[Python] Create tools to enable optional components (like Gandiva, Flight) to be built and deployed as separate Python packages
#24688 commented on
Jun 20, 2025 • 0 new comments -
[C++][CSV] Issue building CSV component under GCC 6.1.0
#24703 commented on
Jun 20, 2025 • 0 new comments -
[Python] Allow fast writing of Decimal column to parquet
#24713 commented on
Jun 20, 2025 • 0 new comments -
[Packaging][Python] macOS wheels occasionally exceed travis build time limit
#24747 commented on
Jun 20, 2025 • 0 new comments