python-sorting-benchmarks

Benchmarks of 6 classic sorting algorithms in pure Python — showing why constant factors and CPython overhead make Timsort (built-in sorted()) untouchable.

python-sorting-benchmarks

A companion repository for the blog post:
"I Implemented Every Sorting Algorithm in Python — The Results Nobody Talks About (Benchmarked on CPython)"
https://emitechlogic.com/sorting-algorithm-in-python/]

This repo contains clean, from-scratch implementations of six classic sorting algorithms in pure Python, plus a robust benchmarking suite that measures their real-world performance on CPython.

The goal is to show why textbook Big-O analysis doesn't tell the full story in Python — constant factors, interpreter overhead, recursion costs, memory allocations, and garbage collection dominate practical performance.

Key Findings (from the blog post)

Bubble/Selection sort become unusable around 1,000–5,000 elements.
Insertion sort surprisingly wins on small (<100 elements) or nearly-sorted data.
Merge/Quick/Heap sort are decent but still 5–150× slower than Python's built-in sorted() (Timsort).
Python's built-in sort is untouchable — use it always in production.

Repository Structure

python-sorting-benchmarks/ ├── sorts/ │ ├── init.py │ ├── bubble_sort.py │ ├── selection_sort.py │ ├── insertion_sort.py │ ├── merge_sort.py │ ├── quick_sort.py │ └── heap_sort.py ├── data_generator.py # Functions to generate test datasets ├── benchmark.py # Main benchmarking script ├── results_example.csv # Sample output from my machine ├── README.md # This file └── requirements.txt # Empty — uses only stdlib

How to Run the Benchmarks

Clone the repo:

git clone https://github.com/Emmimal/python-sorting-benchmarks.git
cd python-sorting-benchmarks

Run the benchmarks: python benchmark.py

What This Benchmark Does

Tests all sorting algorithms on multiple dataset sizes and patterns
Results are:
- Printed to the console
- Saved to results.csv
Runtime: ~10–30 minutes on a standard laptop
(Bubble sort on large datasets is intentionally slow)

You can adjust dataset sizes and input patterns in benchmark.py.

Environment (Used for Blog Post Results)

Python: CPython 3.11.4
OS: Ubuntu 22.04 LTS
CPU: Intel i5-1135G7
RAM: 16 GB

Your results may vary slightly due to hardware differences and garbage collection timing,
but relative performance trends should remain consistent.

Notes on Implementations

All sorting functions return a new sorted list
Input data is copied before sorting for fair benchmarking
In-place techniques are used internally where possible
Code is simple and readable
- No micro-optimizations
- Designed to expose real Python overhead

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

python-sorting-benchmarks

python-sorting-benchmarks

Key Findings (from the blog post)

Repository Structure

How to Run the Benchmarks

What This Benchmark Does

Environment (Used for Blog Post Results)

Notes on Implementations

License

Questions or Feedback?

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
sorts		sorts
LICENSE		LICENSE
README.md		README.md
benchmark.py		benchmark.py
data_generator.py		data_generator.py
requirements.txt		requirements.txt

License

Emmimal/python-sorting-benchmarks

Folders and files

Latest commit

History

Repository files navigation

python-sorting-benchmarks

python-sorting-benchmarks

Key Findings (from the blog post)

Repository Structure

How to Run the Benchmarks

What This Benchmark Does

Environment (Used for Blog Post Results)

Notes on Implementations

License

Questions or Feedback?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages