[WIP] 441/bugfix/slow report generation with plenty of machines #447

ShayNehmad · 2019-10-02T06:53:15Z

Feature / Fixes

Fixes #441.

Have you added an explanation of what your changes do and why you'd like to include them?
Have you successfully tested your changes locally?

Screenshots:

From:

To (Middle of progress):

To (final):

Changes

Improved runtime of the get_displayed_node_by_id function via caching
Improved runtime of the get_scanned function via algorithmic improvments
Other small optimisations in EdgeService and NodeService via caching mostly.

…rogress...

Lowers amount of deps

…n every run to check if something is a node or a monkey.

danielguardicore · 2019-10-02T08:09:04Z

monkey/monkey_island/cc/services/attack/technique_reports/technique_report_tools.py

@@ -16,7 +16,8 @@ def parse_creds(attempt):
        if attempt[key]:
            return '%s ; %s : %s' % (username,
                                     cred['type'],
-                                     cred['output'])
+                                     # TODO Figure out why this is causing an exception with Vakaris


monkey/monkey_island/cc/models/monkey.py

danielguardicore · 2019-10-02T13:38:46Z

monkey/monkey_island/cc/services/edge.py

-            to_id = NodeService.get_monkey_by_id(edge["to"])
-            if to_id is None:
-                to_label = NodeService.get_node_label(NodeService.get_node_by_id(edge["to"]))
+            if Monkey.is_monkey(to_id):


Not sure if this needs to be changed now, but this is hacky. We should have the Monkey check inside the NodeService

If we create a Node model and think about the data structure - this should be changed (I think Monkey should be an extension of Node, instead of 2 different models). Until then this patch can remain IMO

danielguardicore · 2019-10-02T13:43:47Z

monkey/monkey_island/cc/services/reporting/report.py

-               for node in mongo.db.node.find({'exploited': True}, {'_id': 1})]
+             mongo.db.monkey.find({}, {'_id': 1}) if
+             not NodeService.get_monkey_manual_run(NodeService.get_monkey_by_id(monkey['_id']))]
+


This entire section should probably be rewritten (not now?)
it's basically iterating twice over mongo.db.monkey.find({}, {'_id': 1} and then asking different questions.
I'd rather we pull it once (meaning)
nodes_with_monkeys = [NodeService.get_displayed_node_by_id(monkey['_id'], True) for monkey in mongo.db.monkey.find({}, {'_id': 1})]
And then filter.
This requires an ugly rewrite to use an inner function or have the report not be static :/

Or a different "exploited_nodes_fetcher" which can be stateful. Again - I agree, but this is not for now. Added to long-term planning on the board.

danielguardicore · 2019-10-02T13:44:12Z

monkey/monkey_island/cc/services/reporting/report.py

+        nodes_with_monkeys = [NodeService.get_displayed_node_by_id(monkey['_id'], True) for monkey in
+                              mongo.db.monkey.find({}, {'_id': 1})]
+        nodes = nodes_without_monkeys + nodes_with_monkeys
+        return nodes
+
    @staticmethod
    def get_exploited():
-        exploited = \
+        exploited_with_monkeys = \
            [NodeService.get_displayed_node_by_id(monkey['_id'], True) for monkey in
-             mongo.db.monkey.find({}, {'_id': 1})
-             if not NodeService.get_monkey_manual_run(NodeService.get_monkey_by_id(monkey['_id']))] \
-            + [NodeService.get_displayed_node_by_id(node['_id'], True)
-               for node in mongo.db.node.find({'exploited': True}, {'_id': 1})]
+             mongo.db.monkey.find({}, {'_id': 1}) if
+             not NodeService.get_monkey_manual_run(NodeService.get_monkey_by_id(monkey['_id']))]
+
+        exploited_without_monkeys = [NodeService.get_displayed_node_by_id(node['_id'], True) for node in
+                                     mongo.db.node.find({'exploited': True}, {'_id': 1})]


Same comment just on node find.

danielguardicore · 2019-10-02T13:44:42Z

monkey/monkey_island/cc/services/reporting/report.py

@@ -699,6 +706,8 @@ def generate_report():
        cross_segment_issues = ReportService.get_cross_segment_issues()
        monkey_latest_modify_time = Monkey.get_latest_modifytime()

+        scanned_nodes = ReportService.get_scanned()


Why moved outside? Readability?

Yes. Not critical and can be undone

ShayNehmad added 3 commits September 22, 2019 19:59

WIP commit, added caches, found place which is n*n

bea4140

Started improving and researching the performence issues - still in p…

1060c00

…rogress...

Fixed decorator order, now caching works

93c9aaa

ShayNehmad added Bug An error, flaw, misbehavior or failure in the Monkey or Monkey Island. Island labels Oct 2, 2019

ShayNehmad added this to the Infection Monkey for Zero Trust milestone Oct 2, 2019

ShayNehmad self-assigned this Oct 2, 2019

ShayNehmad added this to In progress in Monkey Dev Board via automation Oct 2, 2019

ShayNehmad added 6 commits October 2, 2019 09:54

Using ring as the primary caching library, no functools.

4d9467b

Lowers amount of deps

Updated docs and TODO (we won't get to it this PR)

628ebc0

Added monkey island logic to get label by id

6327f6e

Added cache test to test_monkey.py

122919d

Optimised monkey_to_net_node

264e740

get_edge_label is a little quicker - uses cache. Still calls the DB o…

e3b93f1

…n every run to check if something is a node or a monkey.

danielguardicore reviewed Oct 2, 2019

View reviewed changes

ShayNehmad added 2 commits October 2, 2019 12:18

Added cached checking of is_monkey to optimise runtime of EdgeService

d02e349

Deleted unused function

656184e

danielguardicore reviewed Oct 2, 2019

View reviewed changes

monkey/monkey_island/cc/models/monkey.py Show resolved Hide resolved

danielguardicore reviewed Oct 2, 2019

View reviewed changes

monkey/monkey_island/cc/models/monkey.py Show resolved Hide resolved

danielguardicore reviewed Oct 2, 2019

View reviewed changes

ShayNehmad added 3 commits October 2, 2019 16:48

Removed TODO - seems like an edge case that won't reproduce for clients.

063a136

Fixed label cache logic and added to UTs

70daf4b

Formatting fix

2cabcb6

danielguardicore approved these changes Oct 3, 2019

View reviewed changes

ShayNehmad merged commit 3b6714e into develop Oct 3, 2019

Monkey Dev Board automation moved this from In progress to Done Oct 3, 2019

ShayNehmad deleted the 441/bugfix/slow-report-generation-with-plenty-of-machines branch October 3, 2019 15:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] 441/bugfix/slow report generation with plenty of machines #447

[WIP] 441/bugfix/slow report generation with plenty of machines #447

ShayNehmad commented Oct 2, 2019 •

edited

danielguardicore Oct 2, 2019

danielguardicore Oct 2, 2019

ShayNehmad Oct 2, 2019

danielguardicore Oct 2, 2019

ShayNehmad Oct 2, 2019

danielguardicore Oct 2, 2019

danielguardicore Oct 2, 2019

ShayNehmad Oct 2, 2019

[WIP] 441/bugfix/slow report generation with plenty of machines #447

[WIP] 441/bugfix/slow report generation with plenty of machines #447

Conversation

ShayNehmad commented Oct 2, 2019 • edited

Feature / Fixes

Screenshots:

Changes

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ShayNehmad commented Oct 2, 2019 •

edited