Rogue node detection. #27

CdavM · 2016-07-25T04:42:32Z

This PR should be used to discuss an implementation of the rogue node algorithm implementation.

mitar · 2016-07-26T09:41:13Z

So, why are tests failing here?

mitar · 2016-07-26T09:42:09Z

nodewatcher/modules/analysis/rogue_nodes/tasks.py

+
+from nodewatcher import celery
+
+from nodewatcher.modules.monitor.http.survey.management.commands.export_survey_data import extract_survey_graph


I think it would be better if this particular function would be somewhere else, and then both the management command and this task would import it.

CdavM · 2016-07-27T21:50:37Z

It's no longer failing. Had to rename a couple of modules.

mitar · 2016-07-27T21:55:11Z

nodewatcher/modules/analysis/rogue_nodes/tests.py

+                        graph=input_graph['graph'],
+                        friendly_nodes=input_graph['friendly_nodes'],
+                    )
+                    # append -results to the end of the filename


Comment style.

mitar · 2016-07-27T21:57:00Z

Hm, do you think it is useful to store whole results for your tests? Maybe. But isn't the idea that you detect rogue nodes, but slight changes in values might not be so problematic? So maybe more important is that class rogue/not does not change?

mitar · 2016-07-27T21:57:38Z

nodewatcher/modules/analysis/rogue_nodes/algorithm.py

+    """
+
+    nx_graph = nx.Graph()
+    nx_graph.add_nodes_from([(node['i'], {'b': node['b']}) if 'b' in node else node['i'] for node in graph["v"]])


Don't use ".

CdavM · 2016-07-27T22:06:10Z

When you say "results", you mean the output of the detection algorithm? Meaning a bunch of nodes and the probability of each being rogue?
I think we need to store all the data because that should be the output of an algorithm. An alternative is to filter the results before the algorithm outputs them, but I prefer it the way it is now.
But it is true that I am assuming that there is only one rogue node in these test cases and we are not sensitive at all to perturbations.

So what should we do?

CdavM · 2016-07-31T18:28:15Z

Yes, I implemented a custom test suite. Please do a code review.

CdavM · 2016-07-31T18:30:35Z

We could also try to ensure that the numbers are in a certain range, as you proposed, but it is not within the scope of the algorithm to determine the "classification" of a rogue node. That is done by the maintainers. So we might be breaking abstraction barriers.

But more importantly, I don't foresee anyone tampering with this algorithm in the near future so I wouldn't spend too much time trying to predict how they will work on it.

…tream-rogue-node-detection

mitar · 2016-08-11T09:55:58Z

nodewatcher/modules/analysis/rogue_nodes/algorithm.py

+    """
+
+    nx_graph = nx.Graph()
+    nx_graph.add_nodes_from([(node['i'], {'b': node['b']}) if 'b' in node else node['i'] for node in graph['v']])


I do not get it here. Nodes are or tuple of (ID, {b: BSSID}) or just ID? This hurts my mental type-checking. :-)

Yes, that's how you create nodes in nx:
http://networkx.readthedocs.io/en/latest/reference/generated/networkx.Graph.add_node.html#networkx.Graph.add_node

It has to be a tuple in case you're storing additional information.

mitar · 2016-08-11T10:25:44Z

I reviewed the pull request. Please answer questions I made. Also see changes I made. See if everything still works.

CdavM · 2016-08-18T15:54:14Z

Hi, not all tests are running with your code. Django only reports running one test: comparing 'behind-couch-....' with its results. How do we fix this?

mitar · 2016-08-18T16:21:57Z

nodewatcher/modules/analysis/rogue_nodes/tests.py

+        for filename in files:
+            # Test every JSON file that does not contain "results" in the filename.
+            if os.path.splitext(filename)[1] == '.json' and 'results' not in filename:
+                test_cases.addTest(RogueNodeTestCase('run_test', os.path.join(path, filename)))


Can you check if this is called for all files?

Before you modified the code, I was able to use print statements to debug. It seems that stdout is now being redirected. I also can't use pdb, it seems to skip over the whole thing. How do I verify that this is called?

Print statements in load_tests should work, because that is called when tests are being loaded (the same as before in your code). So does for loop work correctly?

Inside tests stdout is collected and you get it to output only if there is an error. So one simple way is to throw an exception with your message. ;-)

OK, the for loop works correctly and finds all the test cases. test_cases is an array of 7 elements, one for every test we have.
But run_test only runs once in total (only with the first test_filename).

Not sure. Research. Maybe the issue is that we have the same name for tests all the time. So id returns the same. You should probably override id and get it to return module name + class name + test_filename or something.

See how it is officially defined. You could also just call super and append fest_filename to the id.

I decided it would be easier to construct a test_case with a directory and the test would go into the directory and check each entry. This also actually works.

Have you tried id method approach?

Are you sure it works? That tests don't now stop on first error? That you run all tests and allow some of them to fail?

Hi,

the overriding id doesn't work.
Tests probably do stop on the first error, but at least all of them run.

ok. I found out that I have to override the hashing function. It works now. will push new code shortly.

mitar · 2016-08-19T18:51:55Z

nodewatcher/modules/analysis/rogue_nodes/tests.py

+                    with io.open(os.path.join(path, results_filename), encoding='utf-8') as asserted_output_file:
+                        asserted_output = json.load(asserted_output_file)
+
+                    self.assertEqual(algorithm_output, asserted_output)


If any test fails now, other tests are not run.

CdavM · 2016-08-21T00:09:12Z

@mitar Can you do a code review of this module?

mitar · 2016-08-21T03:24:44Z

Perfect. Merging!

CdavM added 9 commits July 13, 2016 20:48

Added the NetworkX package.

59c82fb

Initial rogue node algorithm implementation.

8a99095

PEP8 Style fixes.

78d19c6

Send an email instead.

4f6b054

Email now using default from address.

be9cea0

Sending the email to admins.

1060de8

Style fix.

f9943c9

Initial implementation of the testing script.

b6fbb68

Added some test data sets along with their expected reports.

1aa8cae

Merge branch 'development' into rogue-node-detection

bec14c8

mitar reviewed Jul 26, 2016
View reviewed changes

CdavM added 8 commits July 27, 2016 12:57

Renamed an import command.

dc84b42

No longer importing a function directly.

b28e210

Moved the core algorithm to a different file.

c5136a4

Created a parameter for the extraction function.

a3b0416

Cleaned up imports.

60fbdf3

Removed unnecessary import statement.

015a18a

PEP8 love.

29ac60e

Added more test cases.

9f58dd4

mitar reviewed Jul 27, 2016
View reviewed changes

Comment style.

e8ef976

Implemented a custom test suite.

3f89bb3

mitar and others added 3 commits August 2, 2016 01:10

Merge branch 'development' into rogue-node-detection

e585290

Moved all_nodes_survey_graph to a separate file.

6eda39c

Merge remote-tracking branch 'upstream/rogue-node-detection' into ups…

0dfaf2f

…tream-rogue-node-detection

mitar reviewed Aug 11, 2016
View reviewed changes

Smaller stylistic and refactoring changes.

220cf75

Order of imports.

b1e3053

CdavM added 2 commits August 18, 2016 09:18

Changed type of a result to a float.

fd821f5

Removed broken test case.

61dedad

mitar reviewed Aug 18, 2016
View reviewed changes

CdavM added 2 commits August 19, 2016 09:19

Testing script now passes a directory.

3e9f76f

Removed buggy test case.

b4f85ae

mitar reviewed Aug 19, 2016
View reviewed changes

Overwrote the hashing function to distinguish test cases.

7837f29

Merge branch 'development' into rogue-node-detection

49d9037

mitar merged commit 672b8a2 into development Aug 21, 2016

mitar deleted the rogue-node-detection branch August 21, 2016 03:25

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rogue node detection. #27

Rogue node detection. #27

CdavM commented Jul 25, 2016

mitar commented Jul 26, 2016

mitar Jul 26, 2016

CdavM commented Jul 27, 2016

mitar Jul 27, 2016

mitar commented Jul 27, 2016

mitar Jul 27, 2016

CdavM commented Jul 27, 2016

CdavM commented Jul 31, 2016

CdavM commented Jul 31, 2016

mitar Aug 11, 2016

CdavM Aug 11, 2016

mitar commented Aug 11, 2016

CdavM commented Aug 18, 2016

mitar Aug 18, 2016

CdavM Aug 18, 2016

mitar Aug 18, 2016

CdavM Aug 18, 2016

mitar Aug 18, 2016

mitar Aug 18, 2016

CdavM Aug 19, 2016

mitar Aug 19, 2016

CdavM Aug 19, 2016

CdavM Aug 19, 2016

mitar Aug 19, 2016

CdavM commented Aug 21, 2016

mitar commented Aug 21, 2016


		from nodewatcher import celery

		from nodewatcher.modules.monitor.http.survey.management.commands.export_survey_data import extract_survey_graph

Rogue node detection. #27

Rogue node detection. #27

Conversation

CdavM commented Jul 25, 2016

mitar commented Jul 26, 2016

Choose a reason for hiding this comment

CdavM commented Jul 27, 2016

Choose a reason for hiding this comment

mitar commented Jul 27, 2016

Choose a reason for hiding this comment

CdavM commented Jul 27, 2016

CdavM commented Jul 31, 2016

CdavM commented Jul 31, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mitar commented Aug 11, 2016

CdavM commented Aug 18, 2016

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

CdavM commented Aug 21, 2016

mitar commented Aug 21, 2016