[tune] Exception raised when there is no more trials #3069

old-bear · 2018-10-16T09:32:52Z

System information

OS Platform and Distribution (e.g., Linux Ubuntu 16.04): MacOS 10.11
Ray installed from (source or binary): binary
Ray version: 0.5.3
Python version: 2.7
Exact command to reproduce:

Describe the problem

For SeachAlgorithm, if next_trials returns empty and is_finished yields True. The following happens in TrialRunner:

   def step(self):
        ...
        next_trial = self._get_next_trial()      <----- no more trial, which yields None here
        if next_trial is not None:
            self.trial_executor.start_trial(next_trial)
        elif self.trial_executor.get_running_trials():      <--- all trials completes, so no more running trials
            self._process_events()
        else:
            ....
            raise TuneError("Called step when all trials finished?")    <--- reach here

Source code / logs

See above

For the solution, I think we can add

if not self._search_alg.is_finished():
    raise(...)

The text was updated successfully, but these errors were encountered:

richardliaw · 2018-10-16T15:41:20Z

What is the situation where this occurs? Do you have an example for reproducing?

…

On Tue, Oct 16, 2018 at 2:33 AM old-bear ***@***.***> wrote: System information - *OS Platform and Distribution (e.g., Linux Ubuntu 16.04)*: MacOS 10.11 - *Ray installed from (source or binary)*: binary - *Ray version*: 0.5.3 - *Python version*: 2.7 - *Exact command to reproduce*: Describe the problem For SeachAlgorithm, if next_trials returns empty and is_finished yields True. The following happens in TrialRunner: def step(self): ... next_trial = self._get_next_trial() <----- no more trial, which yields None here if next_trial is not None: self.trial_executor.start_trial(next_trial) elif self.trial_executor.get_running_trials(): <--- all trials completes, so no more running trials self._process_events() else: .... raise TuneError("Called step when all trials finished?") <--- reach here Source code / logs See above For the solution, I think we can add if not self._search_alg.is_finished(): raise(...) — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#3069>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AEUc5TDMU6F7o93IkEQSC7JmnLn8uqGaks5ulafMgaJpZM4Xd883> .

ericl · 2018-10-16T18:47:29Z

We already check search_alg.is_finished() in runner.is_finished(), so I don't think we should even be entering step() if the search algo is finished.

old-bear · 2018-10-17T02:21:57Z

The situation may be a little tricky:

class MySearchAlgorithm(SearchAlgorithm):
    def next_trials(self):
        # doing some calcuation
        # here is_finished is still False
        ... 
        # and decide we should stop here
        self._is_finished = true
        return []

    def is_finished(self):
        return self._is_finished

When using this algorithm, the check in tune.py before will pass:

    while not runner.is_finished():
        runner.step()     <-- run step here

, and thus trigger this problem.

As for the search algorithm, although we can move all the calculation into on_trial_result and just return the calculated ones in next_trials to avoid this problem, I think that would be quite unnatural

ericl · 2018-10-17T04:42:30Z

I see, that makes sense!

ericl added the bug Something that is supposed to be working; but isn't label Oct 17, 2018

richardliaw mentioned this issue Oct 18, 2018

[tune] Fix SearchAlg finishing early #3081

Merged

old-bear closed this as completed Oct 23, 2018

richardliaw reopened this Nov 5, 2018

richardliaw closed this as completed Nov 5, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[tune] Exception raised when there is no more trials #3069

[tune] Exception raised when there is no more trials #3069

old-bear commented Oct 16, 2018

richardliaw commented Oct 16, 2018 via email

ericl commented Oct 16, 2018

old-bear commented Oct 17, 2018

ericl commented Oct 17, 2018

[tune] Exception raised when there is no more trials #3069

[tune] Exception raised when there is no more trials #3069

Comments

old-bear commented Oct 16, 2018

System information

Describe the problem

Source code / logs

richardliaw commented Oct 16, 2018 via email

ericl commented Oct 16, 2018

old-bear commented Oct 17, 2018

ericl commented Oct 17, 2018