Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add a parallel algorithm to improve the performance of TPE with large concurrency #1052

Closed
PurityFan opened this issue May 6, 2019 · 3 comments
Assignees
Labels

Comments

@PurityFan
Copy link
Contributor

What would you like to be added: When the number of concurrencies is large, the points selected by the TPE will be concentrated, we will add excellent parallel algorithms to solve this problem.

Why is this needed: Better usage of computing resource

Components that may involve changes: A new algorithm to handle parallel TPE and related doc

@Crysple
Copy link
Contributor

Crysple commented Jul 7, 2019

I've run experiments on some test functions with different experimental configuration ( trial concurrency, constant_liar_type etc.).

Experiment on Ackley Function

Ackley Function

  • There are only two args in the Ackley Function. Denote them as x and y, then the function is:
    image

  • Its global optimum point is:

    image

  • An image on it might be:

image

Experiment

Setup

Two contrast experiments: Exp 1 with Exp 2 and Exp 3 with Exp 4 to see if there's any improvement.

Max Trial Num Trial Concurrency Parallel_Optimize Constant_liar
Exp 1 160 20 True min
Exp 2 160 20 False (original TPE) /
Exp 3 400 50 True min
Exp 4 400 50 False (original TPE) /

Result

Exp 1 and Exp 2

Exp 3 and Exp 4

In this experiment on Ackley Function, TPE in parallel mode tends to explore more and find a better 'best result'.
Besides, It can be seen that in the new version of TPE, the distribution of the sampled points is more even

Strange Phenomenon

With parallel optimize mode on, starting from the second trial, TPE will generate several trial configurations that are totally the same. This might be a potential bug.

image

image

Experiment on Rastrigin Function

Rastrigin Function

  • On an n-dimensional domain it is defined by:

    image

    where A=10 and x_{i} in [-5.12,5.12]. It has a global minimum at X = 0 where f(X)=0.

  • Plot on this function

image

Experiment

Setup

Max Trial Num Trial Concurrency Parallel_Optimize Constant_liar
Exp 1 400 20 True min
Exp 2 400 20 False (original TPE) /
Exp 3 1000 20 True min
Exp 4 1000 20 False (original TPE) /
Exp 5 1000 50 True min
Exp 6 1000 50 False (original TPE) /

Result

Exp 1 and Exp 2

image
image

Exp 3 and Exp 4

image

image

Exp 5 and Exp 6

  • Exp 5
    image
    image

  • Exp 6
    image
    image

@scarlett2018
Copy link
Member

scarlett2018 commented Jul 10, 2019

  • Need an option to switch between parallel model and non-parallel model

@suiguoxin
Copy link
Member

solved in #1373

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

4 participants