Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve documentation for scheduler states #1498

Merged
merged 3 commits into from Oct 26, 2017

Conversation

pitrou
Copy link
Member

@pitrou pitrou commented Oct 24, 2017

This categorizes the state variable in three groups, adds a table of where tasks appear depending on their state, another table listing worker state changes in each task transition, and improves some wording choices in my POV :-)

@@ -2,7 +2,7 @@
#

# You can set these variables from the command line.
SPHINXOPTS =
SPHINXOPTS = -j4
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This will build docs in parallel.


* ``{'task': (inc, 1)}``: a tuple satisfying the dask graph protocol. This
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Putting 'task' there seemed to be a bug.


A key's priority is only used to break ties, when many keys are being
considered for execution. The priority does *not* determine running order,
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

"The priority does not determine running order" sounds wrong to me, otherwise what would it be used for? The goal is to start computing some tasks earlier than others...

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, I suspect that my intent here was to say that priority does not fully determine running order. Many other concerns can take precedence. This probably isn't necessary to say though.

processing → erred occupancy, idle, used_resources
processing → released occupancy, idle, used_resources
memory → released worker_bytes
memory → forgotten worker_bytes
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The task in tasks itself is also forgotten if you want to include this

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am actually a bit surprised that the right column of this table is not more dense. I would have expected more worker and task state to be affected. Ah, I see that you are only listing worker state, never mind.

@mrocklin
Copy link
Member

In general this looks great to me. I agree with all of the wording changes. Thank you for going through this.

@pitrou pitrou added enhancement Improve existing functionality or make things work better documentation Improve or add to documentation and removed enhancement Improve existing functionality or make things work better labels Oct 24, 2017
@pitrou pitrou changed the title [WIP] Improve documentation for scheduler states Improve documentation for scheduler states Oct 25, 2017
@mrocklin
Copy link
Member

I'm +1 on this PR and happy to merge any time.

@pitrou
Copy link
Member Author

pitrou commented Oct 26, 2017

Cool, thank you :-)

@pitrou pitrou merged commit 6b8dae3 into dask:master Oct 26, 2017
@pitrou pitrou deleted the scheduler_state_docs branch October 26, 2017 11:26
@mrocklin
Copy link
Member

mrocklin commented Oct 26, 2017 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improve or add to documentation
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants