ARROW-4472: [Website][Python] Blog post about string memory use work in Arrow 0.12 #3553

wesm · 2019-02-04T17:49:49Z

This blog shows how we were able to significant improve performance and memory use in common cases when converting from the Arrow string memory layout to pandas's native memory model based on NumPy arrays of Python objects.

Change-Id: I6e87debdd41565bf8921ea05b997568aa4eb0fa3

wesm · 2019-02-04T18:03:45Z

Here's a published version https://wesm.github.io/arrow-site-test/blog/2019/02/04/python-string-memory-0.12/

test publishing the website really isn't that easy, and a number of things are broken...

Change-Id: I6e8cd52878c06474ce45f4feb9713bb3f74cda53

fsaintjacques · 2019-02-05T02:45:13Z

LGTM

xhochy

Looks good, some small improvements but then this can go out tomorrow.

xhochy · 2019-02-05T06:43:42Z

site/_posts/2019-02-05-python-string-memory-0.12.md

+We can use the `memory_profiler` Python package to easily get process memory
+usage within a running Python application.
+
+```


Suggested change

```

```python

xhochy · 2019-02-05T06:45:01Z

site/_posts/2019-02-05-python-string-memory-0.12.md

+
+## Memory and Performance Benchmarks
+
+We can use the `memory_profiler` Python package to easily get process memory


Suggested change

We can use the `memory_profiler` Python package to easily get process memory

We can use the [`memory_profiler`][2] Python package to easily get process memory

xhochy · 2019-02-05T06:45:16Z

site/_posts/2019-02-05-python-string-memory-0.12.md

+provide fast and memory-efficient interoperability with pandas and other
+popular libraries.
+
+[1]: https://www.slideshare.net/xhochy/extending-pandas-using-apache-arrow-and-numba


Suggested change

[1]: https://www.slideshare.net/xhochy/extending-pandas-using-apache-arrow-and-numba

[1]: https://www.slideshare.net/xhochy/extending-pandas-using-apache-arrow-and-numba

[2]: https://pypi.org/project/memory-profiler/

Change-Id: I872f47f6ba079c6f06e1f520a0cb7747411c897a

wesm · 2019-02-05T15:07:44Z

Oops sorry I missed these edits @xhochy

…in Arrow 0.12 This blog shows how we were able to significant improve performance and memory use in common cases when converting from the Arrow string memory layout to pandas's native memory model based on NumPy arrays of Python objects. Author: Wes McKinney <wesm+git@apache.org> Closes #3553 from wesm/python-string-memory-0.12 and squashes the following commits: f0d684d <Wes McKinney> Update publication date 2bbb92d <Wes McKinney> Fix some base urls c624e55 <Wes McKinney> Draft blog post about string memory use work in Arrow 0.12

Draft blog post about string memory use work in Arrow 0.12

c624e55

Change-Id: I6e87debdd41565bf8921ea05b997568aa4eb0fa3

Fix some base urls

2bbb92d

Change-Id: I6e8cd52878c06474ce45f4feb9713bb3f74cda53

xhochy reviewed Feb 5, 2019

View reviewed changes

Update publication date

f0d684d

Change-Id: I872f47f6ba079c6f06e1f520a0cb7747411c897a

wesm closed this in 9af5a70 Feb 5, 2019

wesm deleted the python-string-memory-0.12 branch February 5, 2019 15:07

asfimport mentioned this pull request Feb 5, 2019

[Website][Python] Blog post about Python string memory use improvements in 0.12 #21029

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ARROW-4472: [Website][Python] Blog post about string memory use work in Arrow 0.12 #3553

ARROW-4472: [Website][Python] Blog post about string memory use work in Arrow 0.12 #3553

Uh oh!

wesm commented Feb 4, 2019

Uh oh!

wesm commented Feb 4, 2019

Uh oh!

fsaintjacques commented Feb 5, 2019

Uh oh!

xhochy left a comment

Uh oh!

xhochy Feb 5, 2019

Uh oh!

xhochy Feb 5, 2019

Uh oh!

xhochy Feb 5, 2019

Uh oh!

wesm commented Feb 5, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		## Memory and Performance Benchmarks

		We can use the `memory_profiler` Python package to easily get process memory

	We can use the `memory_profiler` Python package to easily get process memory
	We can use the [`memory_profiler`][2] Python package to easily get process memory

	[1]: https://www.slideshare.net/xhochy/extending-pandas-using-apache-arrow-and-numba
	[1]: https://www.slideshare.net/xhochy/extending-pandas-using-apache-arrow-and-numba
	[2]: https://pypi.org/project/memory-profiler/

ARROW-4472: [Website][Python] Blog post about string memory use work in Arrow 0.12 #3553

ARROW-4472: [Website][Python] Blog post about string memory use work in Arrow 0.12 #3553

Uh oh!

Conversation

wesm commented Feb 4, 2019

Uh oh!

wesm commented Feb 4, 2019

Uh oh!

fsaintjacques commented Feb 5, 2019

Uh oh!

xhochy left a comment

Choose a reason for hiding this comment

Uh oh!

xhochy Feb 5, 2019

Choose a reason for hiding this comment

Uh oh!

xhochy Feb 5, 2019

Choose a reason for hiding this comment

Uh oh!

xhochy Feb 5, 2019

Choose a reason for hiding this comment

Uh oh!

wesm commented Feb 5, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants