Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Docs dataframe joins #4569

Merged
merged 4 commits into from Mar 12, 2019
Merged

Docs dataframe joins #4569

merged 4 commits into from Mar 12, 2019

Conversation

@mrocklin
Copy link
Member

@mrocklin mrocklin commented Mar 8, 2019

  • Tests added / passed
  • Passes flake8 dask
Large to Small Joins
--------------------

Many join or merge computations combine a large tqble with one small one. If
Copy link
Member

@TomAugspurger TomAugspurger Mar 8, 2019

typo: "tqble"


.. code-block:: python
small.repartition(npartitions=1)
Copy link
Member

@TomAugspurger TomAugspurger Mar 8, 2019

small = small.repartition(npartitions=1)?


The Pandas merge API supports the ``left_index=`` and ``right_index=`` options
to perform joins on the index. For Dask DataFrames these keyword options hold
special significance if the index has known divisions (see :doc:`dataframe
Copy link
Member

@TomAugspurger TomAugspurger Mar 8, 2019

If you add a label above Partitions in dataframe-design like

.. _dataframe-design-partitions

Partitons
---------

you'll be able to reference like `:ref:`dataframe-design-partitions` I think, to link directly to the relevant section.

Copy link
Member

@jrbourbeau jrbourbeau left a comment

Couple of minor nitpicks, otherwise LGTM. Thank you for adding this @mrocklin, it reads well

docs/source/dataframe-joins.rst Outdated Show resolved Hide resolved
docs/source/dataframe-joins.rst Outdated Show resolved Hide resolved
Co-Authored-By: mrocklin <mrocklin@gmail.com>
@jrbourbeau jrbourbeau merged commit ebcc44e into dask:master Mar 12, 2019
@jrbourbeau
Copy link
Member

@jrbourbeau jrbourbeau commented Mar 12, 2019

Thanks!

jorge-pessoa pushed a commit to jorge-pessoa/dask that referenced this issue May 14, 2019
* [skip ci] Add docs for random array creation (dask#4566)

* [skip ci] Move categoricals outside of design doc

* respond to feedback

* [skip ci] Apply suggestions from code review

Co-Authored-By: mrocklin <mrocklin@gmail.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Linked issues

Successfully merging this pull request may close these issues.

None yet

3 participants