Skip to content

platonoff-dev/1brc-py-std

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

1brc-py-std

1 Billion Row Challenge in Python. No libs, just std.

This repository follows few reasons:

  1. I want to challenge myself to see how efficiently I can process large datasets using only the Python standard library.
  2. I want to practice profiling python
  3. I want just to have fun with python :)

I'll post my thoughts and findings in the README as I progress through the challenge. I think cool blog post can be written at the end of the challenge.

Straight forward implementation

Ok, we got straight forward implementation now. Just read csv row by row, save min, max sum, count in dict. On 10 million items we got 7.19 seconds. Not very good, on 1 billion it will be painfully slow. Let's get our hands dirty and find what takes this time.

About

1 Billion Row Challenge in Python. No libs, just std.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages