v1.2.4
This release has several performance optimizations.
- The most important one is in the internal filter algorithm, which now uses vector instead of priority_queue which gives it around 30% speedup in some benchmarks
- Now, the number of threads used in the filter algorithm is automatically detected based on the actual threads of the machine.
The benchmark scripts are also improved to use better measurements and printing.