Implement Series.item #1502

itholic · 2020-05-16T20:52:16Z

This PR proposes Series.item

>>> kser = ks.Series([10])
>>> kser.item()
10

codecov-io · 2020-05-16T21:15:46Z

Codecov Report

Merging #1502 into master will decrease coverage by 0.03%.
The diff coverage is 100.00%.

@@            Coverage Diff             @@
##           master    #1502      +/-   ##
==========================================
- Coverage   93.93%   93.90%   -0.04%     
==========================================
  Files          36       36              
  Lines        8445     8448       +3     
==========================================
  Hits         7933     7933              
- Misses        512      515       +3

Impacted Files	Coverage Δ
databricks/koalas/missing/series.py	`100.00% <ø> (ø)`
databricks/koalas/base.py	`98.02% <100.00%> (ø)`
databricks/koalas/series.py	`96.98% <100.00%> (+0.01%)`	⬆️
databricks/koalas/frame.py	`95.43% <0.00%> (-0.15%)`	⬇️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 6fbe6d0...17f9dfc. Read the comment docs.

databricks/koalas/series.py

ueshin · 2020-05-19T19:52:36Z

databricks/koalas/series.py

+        item_top_two = self[:2]
+        if len(item_top_two) != 1:
+            raise ValueError("can only convert an array of size 1 to a Python scalar")
+        return item_top_two[0]


Seems like this still runs Spark jobs twice? We should explicitly call to_pandas() or collect()?

Ah, I'll check and fix it. Thanks !

Let's be very careful on this @itholic next time - this isn't a trivial mistake. This one line mistake makes the performance two times slower. Thanks for pointing this out @ueshin.

@HyukjinKwon
Yeah, I should have considered much more carefully.
Thanks for reminding me once again, @HyukjinKwon ! I must keep that in mind.

According to the comment #1502 (comment), fixed `Series.item` to run a Spark job not twice, once.

According to the comment databricks/koalas#1502 (comment), fixed `Series.item` to run a Spark job not twice, once.

Implemented Series.item

800edad

itholic changed the title ~~Implemented Series.item~~ Implement Series.item May 16, 2020

HyukjinKwon reviewed May 18, 2020

View reviewed changes

databricks/koalas/series.py Outdated Show resolved Hide resolved

HyukjinKwon reviewed May 18, 2020

View reviewed changes

databricks/koalas/series.py Outdated Show resolved Hide resolved

HyukjinKwon approved these changes May 18, 2020

View reviewed changes

itholic added 2 commits May 18, 2020 16:21

python -> Python

873eeca

Collect 2 items

17f9dfc

HyukjinKwon reviewed May 18, 2020

View reviewed changes

databricks/koalas/series.py Outdated Show resolved Hide resolved

Reuse self[:2]

56c71f0

HyukjinKwon merged commit 8c0f135 into databricks:master May 19, 2020

ueshin reviewed May 19, 2020

View reviewed changes

itholic mentioned this pull request May 20, 2020

Fix Series.item to run a Spark job once #1518

Merged

HyukjinKwon pushed a commit that referenced this pull request May 21, 2020

Fix Series.item to run a Spark job once (#1518)

bdd34ed

According to the comment #1502 (comment), fixed `Series.item` to run a Spark job not twice, once.

itholic deleted the s_item branch May 29, 2020 00:50

rising-star92 added a commit to rising-star92/databricks-koalas that referenced this pull request Jan 27, 2023

Fix Series.item to run a Spark job once (#1518)

394c97d

According to the comment databricks/koalas#1502 (comment), fixed `Series.item` to run a Spark job not twice, once.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement Series.item #1502

Implement Series.item #1502

itholic commented May 16, 2020

codecov-io commented May 16, 2020 •

edited

ueshin May 19, 2020

itholic May 19, 2020

HyukjinKwon May 20, 2020 •

edited

itholic May 20, 2020

Implement Series.item #1502

Implement Series.item #1502

Conversation

itholic commented May 16, 2020

codecov-io commented May 16, 2020 • edited

Codecov Report

ueshin May 19, 2020

Choose a reason for hiding this comment

itholic May 19, 2020

Choose a reason for hiding this comment

HyukjinKwon May 20, 2020 • edited

Choose a reason for hiding this comment

itholic May 20, 2020

Choose a reason for hiding this comment

codecov-io commented May 16, 2020 •

edited

HyukjinKwon May 20, 2020 •

edited