1brc-py-std

1 Billion Row Challenge in Python. No libs, just std.

This repository follows few reasons:

I want to challenge myself to see how efficiently I can process large datasets using only the Python standard library.
I want to practice profiling python
I want just to have fun with python :)

I'll post my thoughts and findings in the README as I progress through the challenge. I think cool blog post can be written at the end of the challenge.

Straight forward implementation

Ok, we got straight forward implementation now. Just read csv row by row, save min, max sum, count in dict. On 10 million items we got 7.19 seconds. Not very good, on 1 billion it will be painfully slow. Let's get our hands dirty and find what takes this time.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitignore		.gitignore
README.md		README.md
create_measurements.py		create_measurements.py
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

1brc-py-std

Straight forward implementation

About

Uh oh!

Releases

Packages

Uh oh!

Languages

platonoff-dev/1brc-py-std

Folders and files

Latest commit

History

Repository files navigation

1brc-py-std

Straight forward implementation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages