INDY-1682: bugfix in logic for instance performance degraded #963

Toktar · 2018-10-31T13:29:02Z

The master replica can have throughput = 0, but before that the monitor is always reseted.
When we adding a new instance with adding a new node, other instances can have throughput> 0. In the old logic a new insance perfomance was degraded.
In the new logic added checking that a new replica received any requests.

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

Toktar · 2018-10-31T14:00:28Z

test this please

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

ashcherbakov · 2018-11-02T06:53:39Z

plenum/test/monitoring/test_revival_spike_resistant_ema_throughput_measurement.py

@@ -74,7 +74,7 @@ def test_rsr_ema_tm_past_windows_processed_on_add_request(tm):
    assert tm.reqs_in_window == 1



Please add tests that getThroughput is None in the start window

We test it in test_rsr_ema_tm_after_start_switches_to_revival_on_not_empty_window . Is it ok or we need more tests?

It makes sense to add a test which will specifically verify get_throughput value during the safe start to TESTS OF THROUGHPUT CALCULATION section of this file.

Added test_rsr_ema_tm_throughput_in_safe_start()

ashcherbakov · 2018-11-02T06:57:06Z

plenum/test/monitoring/test_revival_spike_resistant_ema_throughput_measurement.py

@@ -98,7 +98,35 @@ def test_rsr_ema_tm_past_windows_processed_on_get_throughput(tm):

    # [30, 45)
    tm.get_throughput(42)
-    assert tm.window_start_ts == 30
+    assert tm.window_start_ts == 15


Why 15? We are expecting that there is no info before window_size * min_cnt seconds pass, that is 15*16?

Because get_throughput recalculated throughput value early, but now in the safe start we do it only in add_request().
After tm.add_request(16) value of tm.window_start_ts will 15. And after tm.get_throughput(42) it will not be change.

ashcherbakov · 2018-11-02T07:03:17Z

plenum/test/view_change/conftest.py

@@ -44,7 +44,7 @@ def fake_view_changer(request, tconf):
    )
    monitor = FakeSomething(
        isMasterDegraded=lambda: False,
-        areBackupsDegraded=lambda: [],
+        areBackupsDegraded=lambda a: [],


Do we need a parameter in lambda here?

No, my mistake. Fixed it.

ashcherbakov · 2018-11-02T07:04:42Z

plenum/test/monitoring/test_throughput_based_master_degradation_detection.py

@@ -278,3 +285,15 @@ def test_master_not_degraded_on_revival_spike_on_one_backup_while_load_stopped(t
    throughput_ratio = get_throughput_ratio(inst_req_streams, tconf)

    assert_master_not_degraded(throughput_ratio, tconf)
+
+
+def test_master_not_degraded_on_new_instance(fake_monitor, tconf):


The test rather checks that newly added instance is not degraded.
I think we need two tests: that master is not degraded when a new instance is added, and that new instance is not degraded right after it's added.

May be I can add a new check for master instance in this test or we need two tests with different checkings for one case?

ashcherbakov · 2018-11-02T07:10:36Z

plenum/test/monitoring/test_revival_spike_resistant_ema_throughput_measurement.py


    assert tm.throughput_before_idle == 0
    assert tm.idle_start_ts == 0
-    assert tm.empty_windows_count == 4
+    assert tm.empty_windows_count == 0


Why it's 0 now? Should we have a new test or edit this one to get the expected behaviour?

Why is value "0" bad? We didn't order any transactions and empty_windows_count is 0.

Please add a comment to the code explaining why empty_windows_count must be 0 here.

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

spivachuk · 2018-11-02T09:28:19Z

plenum/test/monitoring/test_revival_spike_resistant_ema_throughput_measurement.py

@@ -74,7 +74,7 @@ def test_rsr_ema_tm_past_windows_processed_on_add_request(tm):
    assert tm.reqs_in_window == 1


-def test_rsr_ema_tm_past_windows_processed_on_get_throughput(tm):
+def test_rsr_ema_tm_past_windows_processed_on_get_throughput_after_start(tm):


The name is incorrect because windows are not processed. We would name this test test_rsr_ema_tm_past_windows_not_processed_on_get_throughput_during_safe_start.

Changed it.

refactoring test_revival_spike_resistant_ema_throughput_measurement.py Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

spivachuk · 2018-11-02T09:46:17Z

plenum/test/monitoring/test_revival_spike_resistant_ema_throughput_measurement.py

@@ -180,7 +210,8 @@ def tm_in_normal(tm_after_start):
        tm.add_request(ts)

    # [15, 30) - [225, 240) -- up to 16 not empty windows
-    tm.get_throughput(15)
+    tm.add_request(15)
+    assert tm.get_throughput(15) is None


This assert should not be a responsibility of this fixture.

spivachuk · 2018-11-02T10:03:14Z

plenum/test/monitoring/test_revival_spike_resistant_ema_throughput_measurement.py

@@ -137,9 +164,11 @@ def test_rsr_ema_tm_after_start_switches_to_revival_on_not_empty_window(tm_after

    # [60, 75)
    throughput = tm.get_throughput(62)
+    tm.add_request(62)


The last two lines should be swapped because the test should verify get_throughput return value in REVIVAL, not FADED state.

refactoring test_revival_spike_resistant_ema_throughput_measurement.py Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

spivachuk · 2018-11-02T12:28:07Z

plenum/test/monitoring/test_revival_spike_resistant_ema_throughput_measurement.py

@@ -180,7 +210,7 @@ def tm_in_normal(tm_after_start):
        tm.add_request(ts)

    # [15, 30) - [225, 240) -- up to 16 not empty windows
-    tm.get_throughput(15)
+    tm.add_request(15)
    assert tm.state == State.REVIVAL

    for ts in range(15, 240, 5):


Now there is one extra request at 15.

refactoring test_revival_spike_resistant_ema_throughput_measurement.py Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

Toktar added 2 commits October 31, 2018 16:01

INDY-1682: bugfix in getThroughputs()

7557019

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

INDY-1682: refactoring

64f8c43

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

Toktar added 5 commits November 1, 2018 12:05

INDY-1682: bugfix in checking areBackupsDegraded()

028c840

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

INDY-1682: change Replicas iterator

87b2212

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

INDY-1682: change logic for get_throughput() in instance start

1210359

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

INDY-1682: update test_master_not_degraded_on_new_instance

6f80e5b

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

INDY-1682: update test_send_IC_if_master_degraded

cc103f1

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

Toktar changed the title ~~INDY-1682: bugfix in getThroughputs()~~ INDY-1682: bugfix in logic for insurance performance degraded Nov 2, 2018

ashcherbakov reviewed Nov 2, 2018

View reviewed changes

Toktar added 2 commits November 2, 2018 11:21

INDY-1682: bugfix fixture fake_view_changer in tests

1c3a70e

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

INDY-1682: update test_instances_not_degraded_on_new_instance

bfb850f

Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

ashcherbakov previously approved these changes Nov 2, 2018

View reviewed changes

spivachuk reviewed Nov 2, 2018

View reviewed changes

Toktar dismissed ashcherbakov’s stale review via 8f128dd November 2, 2018 09:35

INDY-1682: refactoring tests for throughput measurement

8f128dd

refactoring test_revival_spike_resistant_ema_throughput_measurement.py Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

spivachuk reviewed Nov 2, 2018

View reviewed changes

INDY-1682: refactoring tests for throughput measurement

a3fa4ec

refactoring test_revival_spike_resistant_ema_throughput_measurement.py Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

spivachuk reviewed Nov 2, 2018

View reviewed changes

INDY-1682: refactoring tests for throughput measurement

e77e51b

refactoring test_revival_spike_resistant_ema_throughput_measurement.py Signed-off-by: toktar <renata.toktar@dsr-corporation.com>

spivachuk approved these changes Nov 2, 2018

View reviewed changes

ashcherbakov approved these changes Nov 2, 2018

View reviewed changes

Toktar changed the title ~~INDY-1682: bugfix in logic for insurance performance degraded~~ INDY-1682: bugfix in logic for instance performance degraded Nov 2, 2018

ashcherbakov merged commit b6d4335 into hyperledger:master Nov 2, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

INDY-1682: bugfix in logic for instance performance degraded #963

INDY-1682: bugfix in logic for instance performance degraded #963

Toktar commented Oct 31, 2018

Toktar commented Oct 31, 2018

ashcherbakov Nov 2, 2018

Toktar Nov 2, 2018 •

edited

spivachuk Nov 2, 2018

Toktar Nov 2, 2018

ashcherbakov Nov 2, 2018

Toktar Nov 2, 2018

ashcherbakov Nov 2, 2018

Toktar Nov 2, 2018

ashcherbakov Nov 2, 2018

Toktar Nov 2, 2018

ashcherbakov Nov 2, 2018

Toktar Nov 2, 2018

spivachuk Nov 2, 2018

Toktar Nov 2, 2018

spivachuk Nov 2, 2018

Toktar Nov 2, 2018

spivachuk Nov 2, 2018

Toktar Nov 2, 2018

spivachuk Nov 2, 2018

Toktar Nov 2, 2018

spivachuk Nov 2, 2018 •

edited

Toktar Nov 2, 2018

		@@ -74,7 +74,7 @@ def test_rsr_ema_tm_past_windows_processed_on_add_request(tm):
		assert tm.reqs_in_window == 1

INDY-1682: bugfix in logic for instance performance degraded #963

INDY-1682: bugfix in logic for instance performance degraded #963

Conversation

Toktar commented Oct 31, 2018

Toktar commented Oct 31, 2018

Choose a reason for hiding this comment

Toktar Nov 2, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

spivachuk Nov 2, 2018 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Toktar Nov 2, 2018 •

edited

spivachuk Nov 2, 2018 •

edited