Skip to content
Compare
Choose a tag to compare

[NIGHTLY] v22.02.00

Pre-release
Pre-release

🔗 Links

🚨 Breaking Changes

  • Refactor TableTest assertion methods to a separate utility class (#9762) @jlowe
  • Use cuFile direct device reads/writes by default in cuIO (#9722) @vuule
  • Add parameters to control row group size in Parquet writer (#9677) @vuule

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

  • Copy Java native dependencies directly into classpath (#9787) @jlowe
  • Refactor TableTest assertion methods to a separate utility class (#9762) @jlowe
  • Use Java classloader to find test resources (#9760) @jlowe
  • Skip cufile tests in JNI build script (#9744) @pxLi
  • Enable string to decimal 128 cast (#9742) @razajafri
  • Use stop instead of stop_. (#9735) @bdice
  • Forward-merge branch-21.12 to branch-22.02 (#9730) @bdice
  • Improve cmake format script (#9723) @vyasr
  • Use cuFile direct device reads/writes by default in cuIO (#9722) @vuule
  • Update check for inf/nan strings in libcudf float conversion to ignore case (#9694) @davidwendt
  • Update cudf JNI to 22.02.0-SNAPSHOT (#9681) @pxLi
  • Some improvements to parse_decimal function and bindings for is_fixed_point (#9658) @razajafri
  • Use List of Columns as Input for drop_nulls, gather and drop_duplicates (#9558) @isVoid
  • Simplify merge internals and reduce overhead (#9516) @vyasr
  • Simplify write_csv by removing unnecessary writer/impl classes (#9089) @cwharris
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Remove Cython APIs for table view generation (#9199) @vyasr
  • Upgrade pandas version in cudf (#9147) @galipremsagar
  • Make AST operators nullable (#9096) @vyasr
  • Remove the option to pass data types as strings to read_csv and read_json (#9079) @vuule
  • Update JNI java CSV APIs to not use deprecated API (#9066) @revans2
  • Support additional format specifiers in from_timestamps (#9047) @davidwendt
  • Expose expression base class publicly and simplify public AST API (#9045) @vyasr
  • Add support for struct type in ORC writer (#9025) @vuule
  • Remove aliases of various api.types APIs from utils.dtypes. (#9011) @vyasr
  • Java bindings for conditional join output sizes (#9002) @jlowe
  • Move compute_column API out of ast namespace (#8957) @vyasr
  • cudf.dtype function (#8949) @shwina
  • Refactor Frame reductions (#8944) @vyasr
  • Add nested column selection to parquet reader (#8933) @devavret
  • JNI Aggregation Type Changes (#8919) @revans2
  • Add groupby_aggregation and groupby_scan_aggregation classes and force their usage. (#8906) @nvdbaranec
  • Expand CSV and JSON reader APIs to accept dtypes as a vector or map of data_type objects (#8856) @vuule
  • Change cudf docs theme to pydata theme (#8746) @galipremsagar
  • Enable compiled binary ops in libcudf, python and java (#8741) @karthikeyann
  • Make groupby transform-like op order match original data order (#8720) @isVoid

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

Compare
Choose a tag to compare

[NIGHTLY] v21.12.00

Pre-release
Pre-release

🔗 Links

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

  • Followup to PR 9088 comments (#9659) @cwharris
  • Update cuCollections to version that supports installed libcudacxx (#9633) @robertmaynard
  • Add 11.5 dev.yml to cudf (#9617) @galipremsagar
  • Add xfail for parquet reader 11.5 issue (#9612) @galipremsagar
  • remove deprecated Rmm.initialize method (#9607) @rongou
  • Use HostColumnVectorCore for child columns in JCudfSerialization.unpackHostColumnVectors (#9596) @sperlingxx
  • Set RMM pool to a fixed size in JNI (#9583) @rongou
  • Use nvCOMP for Snappy compression/decompression (#9582) @vuule
  • Build CUDA version agnostic packages for dask-cudf (#9578) @Ethyling
  • Fixed tests warning: "TYPED_TEST_CASE is deprecated, please use TYPED_TEST_SUITE" (#9574) @ttnghia
  • Enable CMake format in CI and fix style (#9570) @vyasr
  • Add NVTX Start/End Ranges to JNI (#9563) @abellina
  • Add librdkafka and python-confluent-kafka to dev conda environments s… (#9562) @jdye64
  • Add offsets_begin/end() to strings_column_view (#9559) @davidwendt
  • remove alignment options for RMM jni (#9550) @rongou
  • Add axis parameter passthrough to DataFrame and Series take for pandas API compatibility (#9549) @dantegd
  • Remove sizeof and standardize on memory_usage (#9544) @vyasr
  • Adds cudaProfilerStart/cudaProfilerStop in JNI api (#9543) @abellina
  • Generalize comparison binary operations (#9542) @vyasr
  • Expose APIs to wrap CUDA or RMM allocations with a Java device buffer instance (#9538) @jlowe
  • Add scan sum support for duration types to libcudf (#9536) @davidwendt
  • Force inlining to improve AST performance (#9530) @vyasr
  • Generalize some more indexed frame methods (#9529) @vyasr
  • Add Java bindings for rolling window stddev aggregation (#9527) @razajafri
  • catch rmm::out_of_memory exceptions in jni (#9525) @rongou
  • Add an overload of make_empty_column with type_id parameter (#9524) @ttnghia
  • Accelerate conditional inner joins with larger right tables (#9523) @vyasr
  • Initial pass of generalizing decimal support in cudf python layer (#9517) @galipremsagar
  • Cleanup for flattening nested columns (#9509) @rwlee
  • Enable running tests using RMM arena and async memory resources (#9506) @rongou
  • Remove dependency on six. (#9495) @bdice
  • Cleanup some libcudf strings gtests (#9489) @davidwendt
  • Rename strings/array_tests.cu to strings/array_tests.cpp (#9480) @davidwendt
  • Refactor sorting APIs (#9464) @vyasr
  • Implement DataFrame.hash_values, deprecate DataFrame.hash_columns. (#9458) @bdice
  • Deprecate Series.hash_encode. (#9457) @bdice
  • Update conda recipes for Enhanced Compatibility effort (#9456) @ajschmidt8
  • Small clean up to simplify column selection code in ORC reader (#9444) @vuule
  • add missing stream to scalar.is_valid() wherever stream is available (#9436) @karthikeyann
  • Adds Deprecation Warnings to one_hot_encoding and Implement get_dummies with Cython API (#9435) @isVoid
  • Update pre-commit hook URLs. (#9433) @bdice
  • Remove pyarrow import in dask_cudf.io.parquet (#9429) @charlesbluca
  • Miscellaneous improvements for UDFs (#9422) @isVoid
  • Use pre-commit for CI (#9412) @vyasr
  • Update to UCX-Py 0.23 (#9407) @pentschev
  • Expose OutOfBoundsPolicy in JNI for Table.gather (#9406) @abellina
  • Improvements to tdigest aggregation code. (#9403) @nvdbaranec
  • Add Java API to deserialize a table to host columns (#9402) @jlowe
  • Frame copy to use class instead of type() (#9397) @madsbk
  • Change all DeprecationWarnings to FutureWarning. (#9392) @bdice
  • Update Java nvcomp JNI bindings to nvcomp 2.x API (#9384) @jbrennan333
  • Add IndexedFrame class and move SingleColumnFrame to a separate module (#9378) @vyasr
  • Support Arrow NativeFile and PythonFile for remote ORC storage (#9377) @rjzamora
  • Use Arrow PythonFile for remote CSV storage (#9376) @rjzamora
  • Add multi-threaded writing to GDS writes (#9372) @devavret
  • Miscellaneous column cleanup (#9370) @vyasr
  • Use single kernel to extract all groups in cudf::strings::extract (#9358) @davidwendt
  • Consolidate binary ops into Frame (#9357) @isVoid
  • Move rank scan implementations from scan_inclusive.cu to rank_scan.cu (#9351) @davidwendt
  • Remove usage of deprecated thrust::host_space_tag. (#9350) @bdice
  • Use Default Memory Resource for Temporaries in reduction.cpp (#9344) @isVoid
  • Fix Cython compilation warnings. (#9327) @bdice
  • Fix some unused variable warnings in libcudf (#9326) @davidwendt
  • Use optional-iterator for copy-if-else kernel (#9324) @davidwendt
  • Remove Table class (#9315) @vyasr
  • Unpin dask and distributed in CI (#9307) @galipremsagar
  • Add optional-iterator support to indexalator (#9306) @davidwendt
  • Consolidate more methods in Frame (#9305) @vyasr
  • Add Arrow-NativeFile and PythonFile support to read_parquet and read_csv in cudf (#9304) @rjzamora
  • Pin mypy in .pre-commit-config.yaml to match conda environment pinning. (#9300) @bdice
  • Use gather.hpp when gather-map exists in device memory (#9299) @davidwendt
  • Fix Automerger for Branch-21.12 from branch-21.10 (#9285) @galipremsagar
  • Refactor cuIO timestamp processing with cuda::std::chrono (#9278) @PointKernel
  • Change strings copy_if_else to use optional-iterator instead of pair-iterator (#9266) @davidwendt
  • Update cudf java bindings to 21.12.0-SNAPSHOT (#9248) @pxLi
  • Various internal MultiIndex improvements (#9243) @vyasr
  • Add detail interface for split and slice(table_view), refactors both function with host_span (#9226) @isVoid
  • Refactor MD5 implementation. (#9212) @bdice
  • Update groupby result_cache to allow sharing intermediate results based on column_view instead of requests. (#9195) @karthikeyann
  • Use nvcomp's snappy decompressor in avro reader (#9181) @devavret
  • Add isocalendar API support (#9169) @marlenezw
  • Simplify read_json by removing unnecessary reader/impl classes (#9088) @cwharris
  • Simplify read_csv by removing unnecessary reader/impl classes (#9041) @cwharris
  • Refactor hash join with cuCollections multimap (#8934) @PointKernel
Compare
Choose a tag to compare

🚨 Breaking Changes

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements

v21.06.01
101fc0f
Compare
Choose a tag to compare
v21.06.01
Compare
Choose a tag to compare

🚨 Breaking Changes

  • Add support for make_meta_obj dispatch in dask-cudf (#8342) @galipremsagar
  • Add separator-on-null parameter to strings concatenate APIs (#8282) @davidwendt
  • Introduce a common parent class for NumericalColumn and DecimalColumn (#8278) @vyasr
  • Update ORC statistics API to use C++17 standard library (#8241) @vuule
  • Preserve column hierarchy when getting NULL row from LIST column (#8206) @isVoid
  • Groupby.shift c++ API refactor and python binding (#8131) @isVoid

🐛 Bug Fixes

📖 Documentation

🚀 New Features

🛠️ Improvements