Make PyArrow cpu_count to correspond NUMBA_NUM_THREADS #349

PokhodenkoSA · 2019-11-28T13:32:45Z

This PR makes PyArrow treads count equal to Numba threads count.
I do not know which threading layer is used by PyArrow. it is possible that there is no intersection with Numba layers.
By default PyArrow uses all available cpus for reading.

sdc/io/csv_ext.py

AlexanderKalistratov

LGTM

sdc/io/csv_ext.py

sdc/tests/tests_perf/test_perf_read_csv.py

pep8speaks · 2019-12-03T10:08:14Z

Hello @PokhodenkoSA! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file sdc/tests/tests_perf/gen_csv.py:

Line 76:5: E722 do not use bare 'except'
Line 81:9: E128 continuation line under-indented for visual indent

Comment last updated at 2019-12-11 08:00:26 UTC

sdc/tests/tests_perf/test_perf_read_csv.py

PokhodenkoSA · 2019-12-03T14:21:51Z

This PR looks very big. I will split it to many smaller ones with dedicated improvements. Now I am using the PR for preparing benchmark report only.

Add pyarrow_cpu_count context manager which always returns cpu_count to previous value.

Use single config data in all places Add PyArrow benchmark record read_csv data size (1m,10) Show size [rows,cols] Implement data file caching and move functions for generating to gen_csv.py

Hardcode84 reviewed Nov 28, 2019

View reviewed changes

sdc/io/csv_ext.py Outdated Show resolved Hide resolved

AlexanderKalistratov approved these changes Nov 28, 2019

View reviewed changes

densmirn reviewed Nov 29, 2019

View reviewed changes

sdc/io/csv_ext.py Outdated Show resolved Hide resolved

sdc/io/csv_ext.py Show resolved Hide resolved

densmirn reviewed Nov 29, 2019

View reviewed changes

sdc/io/csv_ext.py Outdated Show resolved Hide resolved

densmirn added the Ready for Review label Nov 29, 2019

densmirn reviewed Dec 3, 2019

View reviewed changes

sdc/tests/tests_perf/test_perf_read_csv.py Outdated Show resolved Hide resolved

densmirn reviewed Dec 3, 2019

View reviewed changes

sdc/tests/tests_perf/test_perf_read_csv.py Outdated Show resolved Hide resolved

PokhodenkoSA added 5 commits December 10, 2019 17:26

Make PyArrow cpu_count to correspond NUMBA_NUM_THREADS

cf19919

Add pyarrow_cpu_count context manager which always returns cpu_count to previous value.

Improve benchmark test for read_csv()

d0dfc79

Use single config data in all places Add PyArrow benchmark record read_csv data size (1m,10) Show size [rows,cols] Implement data file caching and move functions for generating to gen_csv.py

Use functools.wraps

7e91ca2

Use contextlib.contextmanager

225cbec

Merge branch 'master' into pyarrow-cpu_count

063cc85

PokhodenkoSA merged commit 76d588d into IntelPython:master Dec 11, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Make PyArrow cpu_count to correspond NUMBA_NUM_THREADS #349

Make PyArrow cpu_count to correspond NUMBA_NUM_THREADS #349

Uh oh!

PokhodenkoSA commented Nov 28, 2019 •

edited

Loading

Uh oh!

Uh oh!

AlexanderKalistratov left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pep8speaks commented Dec 3, 2019 •

edited

Loading

Uh oh!

Uh oh!

PokhodenkoSA commented Dec 3, 2019 •

edited

Loading

Uh oh!

Uh oh!

Make PyArrow cpu_count to correspond NUMBA_NUM_THREADS #349

Make PyArrow cpu_count to correspond NUMBA_NUM_THREADS #349

Uh oh!

Conversation

PokhodenkoSA commented Nov 28, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

AlexanderKalistratov left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pep8speaks commented Dec 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Comment last updated at 2019-12-11 08:00:26 UTC

Uh oh!

Uh oh!

PokhodenkoSA commented Dec 3, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

PokhodenkoSA commented Nov 28, 2019 •

edited

Loading

pep8speaks commented Dec 3, 2019 •

edited

Loading

PokhodenkoSA commented Dec 3, 2019 •

edited

Loading