[MRG] REPS / C-REPS analytical gradient benchmarks #81

RicardoDominguez · 2018-08-13T19:38:49Z

Benchmarks as suggested in #70

AlexanderFabisch · 2018-08-13T21:06:27Z

benchmarks/REPS_CREPS_gradients/creps.py

@@ -0,0 +1,344 @@
+# Authors: Jan Hendrik Metzen <jhm@informatik.uni-bremen.de>
+#          Alexander Fabisch <afabisch@informatik.uni-bremen.de>


You can add yourself here, also in the other files.

AlexanderFabisch · 2018-08-13T21:07:39Z

benchmarks/REPS_CREPS_gradients/creps.py

+from bolero.utils.log import get_logger
+
+
+def solve_dual_contextual_reps(S, R, epsilon, min_eta, approx_grad = True):


Please use PEP8:

def solve_dual_contextual_reps(S, R, epsilon, min_eta, approx_grad=True):

AlexanderFabisch · 2018-08-13T21:24:38Z

benchmarks/REPS_CREPS_gradients/creps.py

+    return d, eta, nu
+
+
+class CREPSOptimizer(ContextualOptimizer):


The problem with copying the whole file is that if there are any changes in (c)reps in the future, you have to copy them to the benchmark. You should make a subclass of CREPSOptimizer that overrides set_evaluation_feedback. The same should be done for the REPSOptimizer.

AlexanderFabisch · 2018-08-13T21:34:47Z

benchmarks/REPS_CREPS_gradients/reps_benchmark.py

+	for i in range(n_trials):
+		r = eval_loop(Rosenbrock, opt, n_dims, n_iter)
+	total_time = time.time() - start_time
+	print name, 'completed in average time of', round(total_time / n_trials, 2), 'seconds'


We try to be compatible to Python 3 soon. For example:

print("%s: %s %.2f %s" % (name, 'completed in average time of', round(total_time / n_trials, 2), 'seconds'))

AlexanderFabisch · 2018-08-13T21:35:44Z

benchmarks/REPS_CREPS_gradients/reps_benchmark.py

+	total_time = time.time() - start_time
+	print name, 'completed in average time of', round(total_time / n_trials, 2), 'seconds'
+	rwds = -np.maximum.accumulate(r)
+	print name, 'minimum found', rwds[-1]


Please use parentheses for prints.

AlexanderFabisch · 2018-08-13T21:36:50Z

benchmarks/REPS_CREPS_gradients/reps_benchmark.py

+x = np.zeros(n_dims)
+
+optimizers = {
+    "Numerical gradient": REPSOptimizer(x, random_state=0, approx_grad = True),


AlexanderFabisch · 2018-08-13T21:38:18Z

bolero/optimizer/reps.py

@@ -49,15 +49,21 @@ def solve_dual_reps(R, epsilon, min_eta):

    # Definition of the dual function
    def g(eta):  # Objective function
-        return eta * epsilon + eta * logsumexp(R / eta, b=1.0 / len(R))


These changes should already be in master, please rebase:

git fetch <remote> git rebase <remote>/master # resolve conflicts if there are any

AlexanderFabisch · 2018-08-13T21:40:39Z

benchmarks/REPS_CREPS_gradients/reps.py

+
+    # Perform the actual optimization of the dual function
+    if approx_grad:
+		r = fmin_l_bfgs_b(g, x0, approx_grad=True, bounds=bounds)


Please don't mix tabs and spaces.

AlexanderFabisch · 2018-08-13T21:43:26Z

benchmarks/REPS_CREPS_gradients/creps_benchmark.py

+
+def show_results(results):
+    """Display results."""
+    for objective_name, objective_results in results.items():


You cannot recognize any difference between both curves. Maybe there is a better way, for example, using a dashed and a solid line.

I have used a dashed line so that both lines can be seen, but there is no difference between both curves as the solutions found are the same.

Yes, looks better now!

RicardoDominguez · 2018-08-14T11:54:24Z

I am not sure if I have rebased properly, I am getting the feeling that I haven't.

RicardoDominguez · 2018-08-14T18:39:38Z

I have definitely messed up with the rebase, please ignore until I fix it.

AlexanderFabisch · 2018-08-14T19:03:29Z

I have definitely messed up with the rebase, please ignore until I fix it.

The diff looks good but it says I cannot merge. Let me know if you are not able to solve this problem. Maybe this helps?

RicardoDominguez · 2018-08-15T10:19:05Z

@AlexanderFabisch I think the rebase is correct now, let me know if I have missed anything :)

AlexanderFabisch · 2018-08-15T14:18:09Z

flake8 gave me the following output:

flake8 *.py
creps_benchmark.py:16:1: E122 continuation line missing indentation or outdented
creps_benchmark.py:23:1: E302 expected 2 blank lines, found 1
creps_benchmark.py:40:1: E305 expected 2 blank lines after class or function definition, found 1
creps_benchmark.py:41:13: E203 whitespace before ':'
creps_benchmark.py:74:1: E302 expected 2 blank lines, found 1
creps_benchmark.py:91:80: E501 line too long (86 > 79 characters)
creps_benchmark.py:94:1: E302 expected 2 blank lines, found 1
creps_benchmark.py:101:80: E501 line too long (85 > 79 characters)
creps_benchmark.py:126:1: E302 expected 2 blank lines, found 1
creps_numerical.py:51:21: E128 continuation line under-indented for visual indent
creps_numerical.py:59:5: E265 block comment should start with '# '
reps_benchmark.py:18:1: E302 expected 2 blank lines, found 1
reps_benchmark.py:29:1: E305 expected 2 blank lines after class or function definition, found 1
reps_numerical.py:47:1: E101 indentation contains mixed spaces and tabs
reps_numerical.py:47:1: W191 indentation contains tabs
reps_numerical.py:49:1: E101 indentation contains mixed spaces and tabs

Mixed tabs and spaces are critical. That should not go to the master.

AlexanderFabisch · 2018-08-15T14:18:55Z

Looks good to me apart from that!

AlexanderFabisch · 2018-08-15T14:20:42Z

If you zoom in you can really see that the analytical version is performing consistently better.

RicardoDominguez · 2018-08-15T15:01:59Z

Mixed tabs and spaces are critical. That should not go to the master.

Sorry about that, I thought my editor picked those things up. Now flake8 doesn't complain.

AlexanderFabisch · 2018-08-15T16:16:16Z

Great, thanks for the contribution!

RicardoDominguez · 2018-08-15T17:50:18Z

My pleasure!

AlexanderFabisch requested changes Aug 13, 2018

View reviewed changes

AlexanderFabisch changed the title ~~REPS / C-REPS analytical gradient benchmarks~~ [MRG] REPS / C-REPS analytical gradient benchmarks Aug 13, 2018

RicardoDominguez added 11 commits August 15, 2018 11:05

Fix REPS benchmark dependencies

867cf72

Merge CREPS tests

d83048f

Merged REPS tests

cb02031

Print min/max found

1645d25

Improved CREPS benchmark

9116598

Added readme and figures

5960489

Benchmark CREPS inherit from optimizer CREPS

af586e9

Benchmark REPS inherit from optimizer REPS

ed2bf03

Add myself as author

151595b

Fix string compatibility

9fe0db2

Improve figures

7e3867d

AlexanderFabisch approved these changes Aug 15, 2018

View reviewed changes

Fix flake8

c35a53c

AlexanderFabisch merged commit a52ce2b into rock-learning:master Aug 15, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MRG] REPS / C-REPS analytical gradient benchmarks #81

[MRG] REPS / C-REPS analytical gradient benchmarks #81

RicardoDominguez commented Aug 13, 2018

AlexanderFabisch Aug 13, 2018

AlexanderFabisch Aug 13, 2018

AlexanderFabisch Aug 13, 2018

AlexanderFabisch Aug 13, 2018

AlexanderFabisch Aug 13, 2018

AlexanderFabisch Aug 13, 2018

AlexanderFabisch Aug 13, 2018

AlexanderFabisch Aug 13, 2018

AlexanderFabisch Aug 13, 2018

RicardoDominguez Aug 14, 2018

AlexanderFabisch Aug 14, 2018

RicardoDominguez commented Aug 14, 2018

RicardoDominguez commented Aug 14, 2018

AlexanderFabisch commented Aug 14, 2018

RicardoDominguez commented Aug 15, 2018 •

edited

Loading

AlexanderFabisch commented Aug 15, 2018

AlexanderFabisch commented Aug 15, 2018

AlexanderFabisch commented Aug 15, 2018

RicardoDominguez commented Aug 15, 2018

AlexanderFabisch commented Aug 15, 2018

RicardoDominguez commented Aug 15, 2018

		@@ -0,0 +1,344 @@
		# Authors: Jan Hendrik Metzen <jhm@informatik.uni-bremen.de>
		# Alexander Fabisch <afabisch@informatik.uni-bremen.de>

		from bolero.utils.log import get_logger


		def solve_dual_contextual_reps(S, R, epsilon, min_eta, approx_grad = True):

[MRG] REPS / C-REPS analytical gradient benchmarks #81

[MRG] REPS / C-REPS analytical gradient benchmarks #81

Conversation

RicardoDominguez commented Aug 13, 2018

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

RicardoDominguez commented Aug 14, 2018

RicardoDominguez commented Aug 14, 2018

AlexanderFabisch commented Aug 14, 2018

RicardoDominguez commented Aug 15, 2018 • edited Loading

AlexanderFabisch commented Aug 15, 2018

AlexanderFabisch commented Aug 15, 2018

AlexanderFabisch commented Aug 15, 2018

RicardoDominguez commented Aug 15, 2018

AlexanderFabisch commented Aug 15, 2018

RicardoDominguez commented Aug 15, 2018

RicardoDominguez commented Aug 15, 2018 •

edited

Loading