Skip to content

Releases: cowsdb/libcows

1.1.1

17 Jan 12:53
e2c234a
Compare
Choose a tag to compare
Update README.md

1.1.0

20 Dec 15:33
a7752f8
Compare
Choose a tag to compare
Update build_lib.yml

1.0.2

12 Dec 08:58
4500abd
Compare
Choose a tag to compare

1.0.1

20 Nov 14:30
4500abd
Compare
Choose a tag to compare
  • based on 1.0.0rc2

1.0.0

17 Nov 15:17
4500abd
Compare
Choose a tag to compare
Prepare for 0.16.0 arm64

0.16.0

09 Nov 19:16
4500abd
Compare
Choose a tag to compare

chdb 0.16.0

based on clickhouse 23.10

Query Enhancements

  • Vector Addition:

    • python3 -m chdb "SELECT [1, 2, 3] + [4, 5, 6]".
  • Omit file() Function:

    • python3 -m chdb "SELECT * from '/home/Clickhouse/bench/hits_0.parquet' limit 10".
  • NumPy as Input Format:

    • Support for NumPy as an input format with the query SELECT * FROM 'data.npy'.
  • Parquet Optimizations:

    • Writing parquet files is 10x faster, it's multi-threaded now. Almost the same speed as reading.
    • Parquet filter pushdown. I.e. when reading Parquet files, row groups (chunks of the file) are skipped based on the WHERE condition and the min/max values in each column.
    • Optimize reading small row groups by batching them together in Parquet.
  • Condition Pushdown for ORC:

    • Using data skipping indices in ORC, similarly to Parquet.
  • PRQL Support:

    • Added support for PRQL as a query language.
  • urlCluster Function:

    • Add urlCluster table function.

New Features

  • Introducing arrayFold for applying a lambda function to multiple arrays.
  • Extended support for asynchronous inserts with external data via the native protocol.
  • Introduced function jsonMergePatch for merging JSON strings.
  • Continued support for Kusto Query Language dialect with Phase 1 implementation.
    - Introduced a new SQL function arrayRandomSample for sampling elements from an input array.
    - Added support for dropping cache for Protobuf format with SYSTEM DROP SCHEMA FORMAT CACHE [FOR Protobuf].
  • Conditions on arguments of a table with a space-filling curve in its key can now be used for indexing.
  • New setting force_optimize_projection_name checks that a projection is used in the query.
  • Added aggregation function lttb using the Largest-Triangle-Three-Buckets algorithm for downsampling data.
  • CHECK TABLE query has better performance and usability, supporting checking particular parts.
    - Introduced function byteSwap for reversing the bytes of unsigned integers.
    - Added functions formatQuery and formatQuerySingleLine for formatted SQL query output.
    - Introduced DWARF input format for reading debug symbols from an ELF file.
    - Introduced SHOW SETTING setting_name as a simpler version of SHOW SETTINGS.
    - Added fields substreams and filenames to the system.parts_columns table.
    - Introduced a setting create_table_empty_primary_key_by_default for default ORDER BY ().

Performance Improvements

  • Fixed contention on Context lock, significantly improving performance for short-running concurrent queries.
  • Improved the performance of inverted index creation by 30%.
  • Optimized memory consumption for external aggregation with many temporary files.
  • Added option query_plan_preserve_num_streams_after_window_functions to preserve the number of streams after evaluating window functions.
  • Released more streams if data is small, optimizing resource usage.
  • Optimized RoaringBitmaps before serialization.
  • Optimized inverted index posting lists to use the smallest possible representation.
  • Set a reasonable size for the marks cache for secondary indices by default.
  • Avoided unnecessary reconstruction of index granules when reading skip indexes.
  • Cached CAST function in set during execution to improve the performance of function IN when set element type doesn't match column type.
  • Improved write performance to EmbeddedRocksDB tables.
  • Improved overall resilience for ClickHouse in case of many parts within a partition.
  • Reduced memory consumption during loading of hierarchical dictionaries.
  • All dictionaries now support the setting dictionary_use_async_executor.
  • Prevented excessive memory usage when deserializing AggregateFunctionTopKGenericData.
  • Reduced CPU consumption for AsyncMetrics threads on a Keeper with lots of watches.
  • Experimental inverted indexes now do not store tokens with too many matches, saving space.
  • Improved write performance to EmbeddedRocksDB tables.
  • Improved write performance to hierarchical dictionaries.

0.15.0

01 Nov 14:33
f604de5
Compare
Choose a tag to compare
Update README.md

0.14.2

08 Sep 07:43
ab6ba9d
Compare
Choose a tag to compare
Update build_lib.yml

0.13.0

17 Aug 16:25
aa96984
Compare
Choose a tag to compare
Update build_lib_arm64.yml

0.11.5

09 Aug 21:38
f59c5ee
Compare
Choose a tag to compare
Update build_lib_arm64.yml