Skip to content

v0.7.10

Choose a tag to compare

@github-actions github-actions released this 30 Apr 01:34
· 163 commits to refs/heads/main since this release
7f26424

What's Changed πŸš€

✨ Features

πŸ› Bug Fixes

  • fix(test): update test_local_full_ls to expect canonical file:// URIs @rohitkulshreshtha (#6824)
  • fix: ensure paimon is installed for integration tests @rchowell (#6827)
  • fix(udf): resolve use_process=True subprocess deadlocks @rohitkulshreshtha (#6793)
  • fix(udf): handle UDF expressions with no column references (#6805) @rohitkulshreshtha (#6814)
  • fix(io): emit canonical file:// URIs for Windows drive paths @rohitkulshreshtha (#6817)
  • fix(docs): fix many docs issues @colin-ho (#6811)
  • fix(checkpoint): use strip_file_uri_to_path in put_bytes for Windows @rohitkulshreshtha (#6796)
  • fix(checkpoint): normalize Windows tempdir paths in s3_store tests @rohitkulshreshtha (#6791)
  • fix: respect per-method max_retries/on_error overrides @BABTUNA (#6784)
  • fix(core): normalize FixedSizeListArray inner field name to 'item' @veinkr-bot (#6733)
  • fix(tests): deflake test_sharding_with_file_scan @veinkr-bot (#6787)
  • fix: preserve identity partition predicates when combined with ScalarFn siblings @gavin9402 (#6695)
  • fix(flight-shuffle): handle all-empty inputs in shuffle cache and shu… @ohbh (#6780)
  • fix(flotilla): overreporting of bytes.read @universalmind303 (#6774)
  • fix: add missing serde feature to uuid workspace dependency @chenghuichen (#6773)
  • fix: respect proxy env vars (HTTP_PROXY, HTTPS_PROXY, etc.) in S3 client @BABTUNA (#6679)
  • fix: fix column not found when using count_rows() for sparse data @caican00 (#6703)
  • fix(scheduler): include dispatched tasks in autoscaling ratio @desmondcheongzx (#6388)
  • fix(io): Write metrics in close for the last batch @kvthr (#6606)
  • fix(docs): replace deprecated .struct.get() with .get() expression in examples @everettVT (#6709)
  • fix: incrementally ramp up Ray autoscaler resource requests to avoid exceeding cluster capacity @ohbh (#6653)
  • fix: Fix document example, unnest is not a param on prompt @colin-ho (#6712)
  • fix(stats): Track source execution time per message in pipeline stats @samstokes (#6715)
  • fix: add tenacity retries to Google Sheets upload in benchmarking @jeevb (#6713)
  • fix: skip schema pruning on Source node when can_absorb_select is false @helmanofer (#6501)
  • fix(io): support writing to local fs via GravitinoGvfs local issue @qingfeng-occ (#6579)

πŸš€ Performance

♻️ Refactor

  • refactor(flight): implement into_partitions for flight shuffle @ohbh (#6764)
  • refactor(flight): implement distributed gather for flight shuffle @ohbh (#6751)
  • refactor(partition-refs): consolidate partition ref types and trim unused derives @ohbh (#6742)
  • refactor(distributed): Remove node_origin_id self-ref in distributed @cckellogg (#6738)
  • refactor(inline-agg): macro-generate accumulator dispatch @BABTUNA (#6642)
  • refactor: flight-server passes PartitionRefs directly to the scheduler @ohbh (#6627)
  • refactor(daft-distributed): cleanup of statistics_manager @universalmind303 (#6718)

πŸ“– Documentation

βœ… Tests

πŸ”§ Maintenance

Full Changelog: v0.7.9...v0.7.10