DOC: Replace .values with .to_numpy() in enhancingperf #26313

huizew · 2019-05-08T02:07:47Z

Replace .values with .to_numpy() in the benchmark demonstration code.

As suggested in pandas-dev#24807 (comment) Replace `.values` with `.to_numpy()` in the benchmark demonstration code.

WillAyd · 2019-05-08T02:45:49Z

Thanks for the PR! How much effort do you think it would it be to swap out all of these instances across the documentation?

codecov · 2019-05-08T02:46:30Z

Codecov Report

Merging #26313 into master will decrease coverage by <.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #26313      +/-   ##
==========================================
- Coverage   92.04%   92.03%   -0.01%     
==========================================
  Files         175      175              
  Lines       52302    52302              
==========================================
- Hits        48142    48137       -5     
- Misses       4160     4165       +5

Flag	Coverage Δ
#multiple	`90.59% <ø> (ø)`	⬆️
#single	`40.73% <ø> (-0.17%)`	⬇️

Impacted Files	Coverage Δ
pandas/io/gbq.py	`78.94% <0%> (-10.53%)`	⬇️
pandas/core/frame.py	`97.01% <0%> (-0.12%)`	⬇️
pandas/util/testing.py	`90.6% <0%> (-0.11%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 20fa58d...05789ca. Read the comment docs.

codecov · 2019-05-08T02:46:30Z

Codecov Report

Merging #26313 into master will decrease coverage by <.01%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #26313      +/-   ##
==========================================
- Coverage   92.04%   92.03%   -0.01%     
==========================================
  Files         175      175              
  Lines       52302    52302              
==========================================
- Hits        48142    48137       -5     
- Misses       4160     4165       +5

Flag	Coverage Δ
#multiple	`90.59% <ø> (ø)`	⬆️
#single	`40.73% <ø> (-0.17%)`	⬇️

Impacted Files	Coverage Δ
pandas/io/gbq.py	`78.94% <0%> (-10.53%)`	⬇️
pandas/core/frame.py	`97.01% <0%> (-0.12%)`	⬇️
pandas/util/testing.py	`90.6% <0%> (-0.11%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 20fa58d...bd2b70e. Read the comment docs.

huizew · 2019-05-08T02:57:53Z

@WillAyd All the .values in this file have been replaced.

Took a look at the other files under pandas/doc/source. Apart from whatsnew folder, no other file uses .values for pandas objects. And I suppose whatsnew files shouldn't be changed, right?

Fix: after replacing .values with .to_numpy(), some lines are too long to pass the line-length check.

WillAyd

Great thanks for checking. I think this looks good - @gfyoung any thoughts?

gfyoung · 2019-05-08T04:16:49Z

For our own edification, can those benchmark numbers be double-checked (i.e. the ones that follow the timeit commands)? Are those still approximately correct when we swap in to_numpy?

And how much better is it vs. using .values ?

huizew · 2019-05-08T05:35:43Z

Thanks for the review.

I just checked that changing to .to_numpy() doesn’t change the benchmark time very much (less than 1%), and all the points that this guide tries to demonstrate still hold.

The performance comparison between .to_numpy() and .values is not discussed in this guide page. The difference seems negligible when compared to other things discussed in this guide page, such as Cython and Jit compiling. Personally I think the main reason behind this PR is to follow the documentation/community’s encouragement to use .to_numpy() rather than .values

gfyoung · 2019-05-08T06:32:29Z

Personally I think the main reason behind this PR is to follow the documentation/community’s encouragement to use .to_numpy() rather than .values

True, but always good to double check to make sure we aren't actually proposing a performance regression in our docs.

gfyoung · 2019-05-08T06:32:53Z

Thanks @huizew !

DOC: Replace .values with .to_numpy()

bd2b70e

As suggested in pandas-dev#24807 (comment) Replace `.values` with `.to_numpy()` in the benchmark demonstration code.

huizew changed the title ~~DOC: Replace .values with .to_numpy()~~ DOC: Replace .values with .to_numpy() in enhancingperf May 8, 2019

DOC: Make lines shorter after using .to_numpy()

05789ca

Fix: after replacing .values with .to_numpy(), some lines are too long to pass the line-length check.

WillAyd added the Docs label May 8, 2019

WillAyd approved these changes May 8, 2019

View reviewed changes

WillAyd added this to the 0.25.0 milestone May 8, 2019

gfyoung merged commit 7bfbd81 into pandas-dev:master May 8, 2019

WillAyd mentioned this pull request May 8, 2019

Enhancingperf documentation updates #24807

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DOC: Replace .values with .to_numpy() in enhancingperf #26313

DOC: Replace .values with .to_numpy() in enhancingperf #26313

huizew commented May 8, 2019

WillAyd commented May 8, 2019

codecov bot commented May 8, 2019 •

edited

Loading

codecov bot commented May 8, 2019

huizew commented May 8, 2019

WillAyd left a comment

gfyoung commented May 8, 2019 •

edited

Loading

huizew commented May 8, 2019 •

edited

Loading

gfyoung commented May 8, 2019

gfyoung commented May 8, 2019

DOC: Replace .values with .to_numpy() in enhancingperf #26313

DOC: Replace .values with .to_numpy() in enhancingperf #26313

Conversation

huizew commented May 8, 2019

WillAyd commented May 8, 2019

codecov bot commented May 8, 2019 • edited Loading

Codecov Report

codecov bot commented May 8, 2019

Codecov Report

huizew commented May 8, 2019

WillAyd left a comment

Choose a reason for hiding this comment

gfyoung commented May 8, 2019 • edited Loading

huizew commented May 8, 2019 • edited Loading

gfyoung commented May 8, 2019

gfyoung commented May 8, 2019

codecov bot commented May 8, 2019 •

edited

Loading

gfyoung commented May 8, 2019 •

edited

Loading

huizew commented May 8, 2019 •

edited

Loading