-
-
Notifications
You must be signed in to change notification settings - Fork 33.1k
Open
Labels
interpreter-core(Objects, Python, Grammar, and Parser dirs)(Objects, Python, Grammar, and Parser dirs)performancePerformance or resource usagePerformance or resource usagetype-featureA feature request or enhancementA feature request or enhancement
Description
Feature or enhancement
Proposal:
Adding adaptivity to binarysort
routine of list.sort
.
See PR for specifics.
Initial Post (Outdated)
Note:
- This is POC that this has a observable impact.
- This is subject to further calibration and optimizations, but high level concept is dicusable.
- Will issue PR shortly.
Rationale
Galloping provides adaptivity when merging runs.
However, underlying binarysort
always does O(nlogn)
.
Concept
The concept is simple:
binarysort
optionally does adaptive routine.- It has a mechanism to switch it off during the run and go to simple binarysort
- It returns 1 if it has completed full data with binary routine and 0 otherwise
timsort
callsbinarysort
- If it returns 1, then use it again next time
- If it returns 0, then use
binarysort
without adaptivity next time - increase number of simple
binarysort
runs before next attempt of adaptive run with every 0 returned
High level results
A0
is currentA1
is with adaptivityP
means performance / runtimeC
means comparison count- Aggregate numbers for all datasets combined are in the title
- Plain integers and floats

- Integers and floats wrapped in
list
, so comparisons are__lt__
calls, thus more expensive

So the benefit is higher when comparisons cost more.
But there is some benefit for optimized comparisons as well.
Has this already been discussed elsewhere?
I have already discussed this feature proposal on Discourse
Links to previous discussion of this feature:
https://discuss.python.org/t/sorting-adaptivity-improvement/103700
Linked PRs
Metadata
Metadata
Assignees
Labels
interpreter-core(Objects, Python, Grammar, and Parser dirs)(Objects, Python, Grammar, and Parser dirs)performancePerformance or resource usagePerformance or resource usagetype-featureA feature request or enhancementA feature request or enhancement