Doc: Adds example of exploding lists into columns instead of storing in dataframe cells #19215

pdpark · 2018-01-12T23:56:49Z

closes DOC: section on caveats of storing lists inside DataFrame/Series #17027

codecov · 2018-01-13T05:08:57Z

Codecov Report

Merging #19215 into master will increase coverage by 0.02%.
The diff coverage is n/a.

@@            Coverage Diff             @@
##           master   #19215      +/-   ##
==========================================
+ Coverage   91.53%   91.55%   +0.02%     
==========================================
  Files         147      147              
  Lines       48797    48797              
==========================================
+ Hits        44664    44676      +12     
+ Misses       4133     4121      -12

Flag	Coverage Δ
#multiple	`89.92% <ø> (+0.02%)`	⬆️
#single	`41.6% <ø> (ø)`	⬆️

Impacted Files	Coverage Δ
pandas/plotting/_converter.py	`66.95% <0%> (+1.73%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 8347ff8...11ff8a7. Read the comment docs.

jreback · 2018-01-13T18:26:06Z

doc/source/gotchas.rst

+
+
+Alternative to storing lists in DataFrame Cells
+------------------------------------------------------


needs to be the same length as the title

jreback · 2018-01-13T18:26:57Z

doc/source/gotchas.rst

+
+   nearest_neighbors = [['Zach LaVine', 'Jeremy Lin', 'Nate Robinson', 'Isaia']]*3
+   nearest_neighbors
+


make things into separate ipython:: python blocks, rather than using comments (you can simply write text and not use the #)

jreback · 2018-01-13T18:27:51Z

doc/source/gotchas.rst

+   nearest_neighbors
+
+   #. Create an index with the "parent" columns to be included in the final Dataframe
+   df2 = pd.concat([df[['name','opponent']], pd.DataFrame(nearest_neighbors)], axis=1)


you don't need to keep naming the dataframes, just use

df = ..... or whatever

jreback · 2018-01-13T18:28:25Z

doc/source/gotchas.rst

+------------------------------------------------------
+Storing nested lists/arrays inside a pandas object should be avoided for performance and memory use reasons. Instead they should be "exploded" into a flat ``DataFrame`` structure.
+
+Example of exploding nested lists into a DataFrame:


since you have 2 examples you can use another level of sub-section

jreback · 2018-02-24T17:26:14Z

can you update

jreback · 2018-08-02T17:33:09Z

can you rebase and update

pdpark · 2018-08-31T01:15:06Z

Will do - have been absent due to starting new job, but plan to spend some time on this.

datapythonista · 2018-10-09T04:47:45Z

Closing as discontinued. Superseded by #23041

pdpark added 2 commits January 12, 2018 15:01

DOC: Adds example of alternative to storing lists in a Dataframe

e91444e

Restores: pandas-dev#17027

Doc: Fixes issues with code examples.

11ff8a7

pdpark mentioned this pull request Jan 12, 2018

Doc: Added warning to treat group chunks as immutable when using apply #19114

Closed

1 task

jreback added the Docs label Jan 13, 2018

jreback requested changes Jan 13, 2018

View reviewed changes

mgautam98 mentioned this pull request Oct 8, 2018

Doc: Adds example of exploding lists into columns instead of storing in dataframe cells #23041

Closed

1 task

datapythonista closed this Oct 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Doc: Adds example of exploding lists into columns instead of storing in dataframe cells #19215

Doc: Adds example of exploding lists into columns instead of storing in dataframe cells #19215

pdpark commented Jan 12, 2018

codecov bot commented Jan 13, 2018 •

edited

Loading

jreback Jan 13, 2018

jreback Jan 13, 2018

jreback Jan 13, 2018

jreback Jan 13, 2018

jreback commented Feb 24, 2018

jreback commented Aug 2, 2018

pdpark commented Aug 31, 2018

datapythonista commented Oct 9, 2018



		Alternative to storing lists in DataFrame Cells
		------------------------------------------------------


		nearest_neighbors = [['Zach LaVine', 'Jeremy Lin', 'Nate Robinson', 'Isaia']]*3
		nearest_neighbors

Doc: Adds example of exploding lists into columns instead of storing in dataframe cells #19215

Doc: Adds example of exploding lists into columns instead of storing in dataframe cells #19215

Conversation

pdpark commented Jan 12, 2018

codecov bot commented Jan 13, 2018 • edited Loading

Codecov Report

jreback Jan 13, 2018

Choose a reason for hiding this comment

jreback Jan 13, 2018

Choose a reason for hiding this comment

jreback Jan 13, 2018

Choose a reason for hiding this comment

jreback Jan 13, 2018

Choose a reason for hiding this comment

jreback commented Feb 24, 2018

jreback commented Aug 2, 2018

pdpark commented Aug 31, 2018

datapythonista commented Oct 9, 2018

codecov bot commented Jan 13, 2018 •

edited

Loading