Fix manual test bug #97

pyth0n1c · 2024-01-31T20:22:40Z

Respect manual_testing flag.
Previously, there was a bug where the validator was not always working as expected because:

If tests were not defined, then the validator was not called. This has been fixed by setting it to always=True
Because of the way tests are run and results are generated, it was showing as a failure. We now show a manual test as a "successful" test with a special message indicating that it was marked as "manual_test" in the output in test_results/summary.yml. Please see the attached image as an example. This should make it easy to review results.

Note that test 'name' and 'message' fields have been dynamically generated at run time due to the presence of the manual_test field:

than a dict.

cmcginley-splunk

Have a few comments for you Eric, but looks good!

contentctl/actions/detection_testing/infrastructures/DetectionTestingInfrastructure.py

contentctl/objects/abstract_security_content_objects/detection_abstract.py

contentctl/objects/unit_test_result.py

slightly unexpected check for MANUAL_TEST when actually executing test cases.

pyth0n1c · 2024-02-17T00:18:20Z

Merged latest contentctl updates into this branch. Now that a test SKIPPED state has been introduced, detections marked with MANUAL_TEST will instead be SKIPPED instead of simply logged as successful

functionality to take advantage of the new SKIP state.

accounting for skipped unit tests (due to the presence of the manual_test flag).

cmcginley-splunk · 2024-04-05T20:16:33Z

contentctl/actions/detection_testing/views/DetectionTestingView.py

+                #SKIPPED tests still show a success in this field, but we want to count them differently
+                pass_increment = 1
+                for test in summary.get("tests"):
+                    if test.get("test_type") == "unit" and test.get("status") == "skip":


I think, and I don't need this to be blocking, but I'd like to make detection state resolution (e.g. is it considered a skip/fail/error/pass) a property of the detection itself

E.g. define a property function status which iterates over tests

Pass: If at least one test passes and the rest pass/skip

Skip: If all tests skip

Fail: If at least one test fails and the rest pass/skip/fail

Error: If at least one detection errors (regardless of other test states)

💯
I think we should make this a pydantic computed_property. On top of that, I would add an additional option to this set of values (which is the default value) along the lines of NOT_YET_TESTED.
This way, when we are rendering the current state of all tests, like in a user interface, we can easily differentiate /sort all sorts of different tests.

cmcginley-splunk · 2024-04-05T20:18:07Z

contentctl/actions/detection_testing/views/DetectionTestingView.py


        # Sort s.t. all failures appear first (then by name)
-        tested_detections.sort(key=lambda x: (x["success"], x["name"]))
+        #Second short condition is a hack to get detections with unit skipped tests to appear above pass tests


Can I ask why you want skip above pass?

The results file can be, and often is, REALLY large. I think it makes sense for errors/failures at the top, as these are the ones that deserve the most attention (and what people are likely to be looking for when they open this file).

I personally think "pass" is the least interesting since it means they are working as expected.
I think SKIP falls somewhere in between a failure and a pass. For example, someone might want to manually evaluable these OR note how may tests have been skipped.

Definitely a personal opinion.

cmcginley-splunk · 2024-04-05T20:23:42Z

contentctl/actions/detection_testing/views/DetectionTestingView.py

        )
        success_rate = Utils.getPercent(
-            total_pass, total_detections, 1
+            total_pass, total_detections-total_skipped, 1


This is a non-blocking comment and a reiteration of a conversation in slack; just including it for good book-keeping

I think future work could break down another metric as tested_detections which is the value you are calculating in-line here. It would provide clarity to someone reading the YAML as to how success rate is derived. Likewise, total_detection should become a count of all detections w/in the scope of this test run (e.g. all, selected, changes) and if all, should include experimental and deprecated in its count, with separate counts for total_production total_experimental etc.

I think this is just good data to have, but this also serves a request from Javier around having a metics for test coverage of all detections delivered in ESCU (which would include experimental and deprecated)

Yes, I think this type of additional breakdown is a good idea.
I also think it should be abstracted from this particular view to a Model so that it can be used in different UI components, such as:

The file, like we use it today

Command line output

Other User Interfaces we may support in the future.

contentctl/input/director.py

cmcginley-splunk · 2024-04-05T20:35:24Z

contentctl/objects/abstract_security_content_objects/detection_abstract.py


+
+    # A list of groups of tests, relying on the same data
+    test_groups: Union[list[TestGroup], None] = None


I see you migrated this code from Detection

Can you explain the intention behind having separate classes here (detection and the abstract)? I'm fine w/ this code living in either class, but I chose Detection bc I thought the spirit of the abstract class was to be a more faithful representation of the model as it exists in YAML, and since the test_groups attribute is an inferred field, I thought maybe it made sense to live elsewhere.

So tl;dr I'm totally fine with this, and no change required, just curious, and wondering if we don't have a need for two separate classes, perhaps we could collapse them into one in the future

Great question! Many contentctl users want to change content somehow (and detections are the content they are most-often interested in changing). These changes could be, but are not limited to:

Adding new fields (and possibly building validators for those fields)

Setting more or less strict validations on fields

Setting values for fields already defined in that object (but which may be missing from their YMLs)

Disabling some of the validations we do.

However, if they change Detection_Abstract and WE change Detection_Abstract, it has a high probability of creating painful merge conflict issues. As such, the plan was to retain control of Detection_Abstract and ship all our out-of-the-box functionality in that file. Then if customers want their own functionality, they can add it to Detection.py.
This way, pulling the latest contentctl updates should be painless and without merge conflicts. We are hoping to do this with all types, but have started with Detection since it is the one that receives user changes most often.

contentctl/objects/unit_test_result.py

cmcginley-splunk

Left some comments regarding philosophy and future work, but LGTM :)

pyth0n1c added 2 commits January 31, 2024 11:55

Initial attempt to fix manual_test issues.

391ae07

Return a unit test object rather

93d3088

than a dict.

cmcginley-splunk reviewed Feb 1, 2024

View reviewed changes

Add verbose description to WHY we do a

264ed62

slightly unexpected check for MANUAL_TEST when actually executing test cases.

pyth0n1c added the Draft label Feb 17, 2024

Merge branch 'main' into fix_manual_test_bug

f5ac97c

pyth0n1c and others added 5 commits March 20, 2024 09:27

Merge branch 'main' into fix_manual_test_bug

eb80f20

Update manual_test flag

ac31be0

functionality to take advantage of the new SKIP state.

Fix small syntax error in empty class

89ce6b3

Merge branch 'main' into fix_manual_test_bug

9620532

Improve test summary

85c3fdf

accounting for skipped unit tests (due to the presence of the manual_test flag).

cmcginley-splunk reviewed Apr 5, 2024

View reviewed changes

contentctl/input/director.py Show resolved Hide resolved

cmcginley-splunk reviewed Apr 5, 2024

View reviewed changes

contentctl/objects/unit_test_result.py Outdated Show resolved Hide resolved

cmcginley-splunk previously approved these changes Apr 5, 2024

View reviewed changes

Change error string to align with review comment

205c1f6

pyth0n1c dismissed cmcginley-splunk’s stale review via 205c1f6 April 5, 2024 20:57

Merge branch 'main' into fix_manual_test_bug

7fc6e05

pyth0n1c removed the Draft label Apr 5, 2024

pyth0n1c merged commit a6ab65e into main Apr 5, 2024



		# A list of groups of tests, relying on the same data
		test_groups: Union[list[TestGroup], None] = None

Fix manual test bug #97

Fix manual test bug #97

Uh oh!

Conversation

pyth0n1c commented Jan 31, 2024

Uh oh!

cmcginley-splunk left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

pyth0n1c commented Feb 17, 2024

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cmcginley-splunk left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants