Skip to content

GenyTest is a powerful tool that automates the generation of test cases, aiming to achieve a perfect `1.0` mutation score.

Notifications You must be signed in to change notification settings

Samet-MohamedAmin/genytest

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GenyTest

Automatic Test Case Generator for 1.0 Mutation Score mutation score.

GenyTest is a powerful tool that automates the generation of test cases, aiming to achieve a perfect 1.0 mutation score. Designed to work with tests that have incremental scores, such as coverage tests, Geny Test relies on two core technologies:

Note that this tool can be used with any incremental score test such as coverage tests.

Why ?

When dealing with complex legacy code, it's crucial to have comprehensive tests in place to prevent regressions. Coverage tests alone are insufficient. A 1.0 mutation score allows for code modification without the fear of hidden regressions in unfamiliar code. Automatically generating test cases can be challenging due to the high number of possible test scenarios and the computational resources required. Geny Test tackles these challenges with an efficient parallel strategy and a powerful algorithm.

A function with 4 parameters and 5 possible values each parameter will have 5 ** 4 = 225 testcases. And the number of tests can reach thounds easilly if possible values are more than 5.

Guessing the test cases is hard if the function is complex. Given the example of GildedRose-Refactoring-Kata, I spent more then hour to find the suitable testcases and I failed.

In a perfect world mutation score is 1.0 :)

The challenge in this tool that the execution of mutation tests consume a lot CPU and memory and take some time (sometimes 60 seconds). And the more the testcases are, the more the execution consumes ressources and time. Plus it require some kind of sandboxed environment because the approval test file will change for each test. So a good parallel strategy and a good algorithm should be used.

Example

GildedRose possible_values.yaml

Given the example of GildedRose-Refactoring-Kata

Given the possible_values.yaml config file:

name:
  type: string
  values:
    static:
      - Aged Brie
      - Sulfuras, Hand of Ragnaros
      - Backstage passes to a TAFKAL80ETC concert
      - Some title
    ranges: []
sellIn:
  type: int
  values:
    static: []
    ranges:
      -
        min: -2
        max: 2
      -
        min: 9
        max: 11
      -
        min: 5
        max: 7

quality:
  type: int
  values:
    static: []
    ranges:
      -
        min: -2
        max: 2
      -
        min: 47
        max: 52

Execution

That generates 484 test cases

Phase 1: Eliminate useless scores with parallel execution of go routinse This process is executed in parallel.

At the end the number of test cases are reduced from 484 to 23 With basic maths it eliminates 95.25% of test cases

[test]$ go run main.go
---> Mutation
2023/04/17 19:46:10 scores[0] = 14.47
2023/04/17 19:46:35 scores[483] = 100.00
2023/04/17 19:46:58 scores[241] = 86.84
2023/04/17 19:47:24 scores[120] = 44.74
2023/04/17 19:47:26 scores[362] = 97.37
2023/04/17 19:47:57 scores[60] = 44.74
2023/04/17 19:47:57 ---------> marked usesless form 61 to 120
2023/04/17 19:47:57 --------------------------------------------> calculated 13.43%
2023/04/17 19:47:59 scores[180] = 84.21
2023/04/17 19:48:04 scores[301] = 93.42
2023/04/17 19:48:05 scores[422] = 98.68
2023/04/17 19:48:38 scores[30] = 44.74
2023/04/17 19:48:38 ---------> marked usesless form 31 to 60
2023/04/17 19:48:38 --------------------------------------------> calculated 20.25%
2023/04/17 19:48:41 scores[150] = 73.68
2023/04/17 19:48:42 scores[210] = 86.84
2023/04/17 19:48:48 scores[271] = 90.79
2023/04/17 19:48:48 ---------> marked usesless form 211 to 241
2023/04/17 19:48:48 --------------------------------------------> calculated 27.07%
2023/04/17 19:48:48 scores[331] = 97.37
2023/04/17 19:49:23 scores[392] = 97.37
2023/04/17 19:49:23 ---------> marked usesless form 363 to 392
2023/04/17 19:49:23 --------------------------------------------> calculated 33.47%
2023/04/17 19:49:23 scores[15] = 26.32

....


2023/04/17 20:00:26 --------------------------------------------> calculated 98.97%
2023/04/17 20:00:29 scores[313] = 94.74
2023/04/17 20:00:29 --------------------------------------------> calculated 99.17%
2023/04/17 20:00:29 ---------> marked usesless form 313 to 313
2023/04/17 20:00:29 --------------------------------------------> calculated 99.17%
2023/04/17 20:01:04 scores[398] = 97.37
2023/04/17 20:01:04 ---------> marked usesless form 398 to 398
2023/04/17 20:01:04 --------------------------------------------> calculated 99.38%
2023/04/17 20:01:05 scores[445] = 98.68
2023/04/17 20:01:05 ---------> marked usesless form 447 to 483
2023/04/17 20:01:05 --------------------------------------------> calculated 99.59%
2023/04/17 20:01:05 ---------> marked usesless form 445 to 445
2023/04/17 20:01:05 --------------------------------------------> calculated 99.59%
2023/04/17 20:01:05 scores[179] = 84.21
2023/04/17 20:01:05 ---------> marked usesless form 180 to 180
2023/04/17 20:01:05 --------------------------------------------> calculated 99.79%
2023/04/17 20:01:07 scores[270] = 90.79
2023/04/17 20:01:07 ---------> marked usesless form 271 to 271
2023/04/17 20:01:07 --------------------------------------------> calculated 100.00%

Phase 2: Fine tune the end results and elimnates testcases that does not increment mutation score with an other order of testcases.

This phase is optional

At the end the number of test cases are reduced from 23 to 20 With basic maths it eliminates 13% of test cases

2023/04/17 20:01:07 ------------------------------> start sequentialFilter
--- individual socres
2023/04/17 20:02:18 scores[1] = 14.47
2023/04/17 20:02:18 scores[3] = 17.11
2023/04/17 20:02:18 scores[0] = 14.47
2023/04/17 20:02:19 scores[5] = 18.42
2023/04/17 20:02:19 scores[4] = 15.79
2023/04/17 20:02:19 scores[7] = 19.74
2023/04/17 20:02:19 scores[2] = 18.42
2023/04/17 20:02:20 scores[6] = 30.26
2023/04/17 20:03:28 scores[9] = 35.53
2023/04/17 20:03:29 scores[13] = 13.16
2023/04/17 20:03:29 scores[11] = 28.95
2023/04/17 20:03:29 scores[8] = 22.37
2023/04/17 20:03:29 scores[10] = 30.26
2023/04/17 20:03:29 scores[15] = 30.26
2023/04/17 20:03:30 scores[14] = 15.79
2023/04/17 20:03:30 scores[12] = 28.95
2023/04/17 20:04:29 scores[16] = 22.37
2023/04/17 20:04:30 scores[20] = 22.37
2023/04/17 20:04:30 scores[17] = 27.63
2023/04/17 20:04:30 scores[21] = 19.74
2023/04/17 20:04:30 scores[19] = 21.05
2023/04/17 20:04:30 scores[18] = 28.95
2023/04/17 20:04:30 scores[22] = 34.21
--- sort test cases and recalculate scores in parallel for each subcombo of testcases
2023/04/17 20:05:41 scores[4] = 57.89
2023/04/17 20:05:41 scores[5] = 63.16
2023/04/17 20:05:41 scores[0] = 28.95
2023/04/17 20:05:41 scores[6] = 65.79
2023/04/17 20:05:41 scores[1] = 36.84
2023/04/17 20:05:41 scores[7] = 73.68
2023/04/17 20:05:42 scores[3] = 52.63
2023/04/17 20:05:42 scores[2] = 47.37
2023/04/17 20:06:50 scores[9] = 75.00
2023/04/17 20:06:50 scores[10] = 77.63
2023/04/17 20:06:51 scores[11] = 85.53
2023/04/17 20:06:51 scores[8] = 75.00
2023/04/17 20:06:52 scores[12] = 88.16
2023/04/17 20:06:52 scores[14] = 94.74
2023/04/17 20:06:52 scores[13] = 92.11
2023/04/17 20:06:53 scores[15] = 96.05
2023/04/17 20:07:50 scores[18] = 97.37
2023/04/17 20:07:50 scores[16] = 96.05
2023/04/17 20:07:50 scores[17] = 96.05
2023/04/17 20:07:51 scores[19] = 98.68
2023/04/17 20:07:51 scores[20] = 100.00
2023/04/17 20:07:51 scores[21] = 100.00
2023/04/17 20:07:51 scores[22] = 100.00
2023/04/17 20:07:51 ------------------------------> end sequentialFilter

Results

Example of Results:
-
  comboHash: "323b0843c996705319c9645513f9617483cac124"
  input:
    -
        name: Aged Brie
        sellIn: 0
        quality: -1

-
  comboHash: "18196499088da78902887f2b47ae2dcf25e464ec"
  input:
    -
        name: Some title
        sellIn: 11
        quality: 50

-
  comboHash: "6c7d85b57aeea6ba99aa3e561d23b849c4ef6f4e"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 10
        quality: -1

-
  comboHash: "9bc6b38ba6bfb2d9905428adff6ca05194096bb6"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 1
        quality: 50

-
  comboHash: "5a1d09011c5a7012f208dd420f89b6fa1aa90d3a"
  input:
    -
        name: Some title
        sellIn: 0
        quality: 0

-
  comboHash: "1a420cee1ce21fe5a12d402b47b11f78793c3073"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 0
        quality: 50

-
  comboHash: "6f3db4d50ef034fda030ac8b811549889bffdc30"
  input:
    -
        name: Aged Brie
        sellIn: 0
        quality: 50

-
  comboHash: "c8ce3f73efaa9cd1963a3ca52173ab414f03da08"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 5
        quality: -1

-
  comboHash: "88dc5dd974a57b84141dfa17123f0181d134b288"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 6
        quality: 48

-
  comboHash: "551d1a04e58ee41f969f120fbd41e2df34b8096f"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 6
        quality: -1

-
  comboHash: "f2fbb545ce825406d4bdc57e7150c5678eeb92cd"
  input:
    -
        name: Some title
        sellIn: 0
        quality: 50

-
  comboHash: "8a982d7e3253df2a307878374711fd69d56b8ea7"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 5
        quality: 48

-
  comboHash: "339e331552c9bd8bbfd0389ea2d1be49043d6dad"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 5
        quality: 49

-
  comboHash: "2732f9769bb026ef5a49d8cc6316b2d362c82592"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 11
        quality: -1

-
  comboHash: "f2dac14ea6977115803ccf2ec33626849162b8cc"
  input:
    -
        name: Aged Brie
        sellIn: 0
        quality: 48

-
  comboHash: "58068af86be0110470da14a6de97ee52d8ee684f"
  input:
    -
        name: Some title
        sellIn: 11
        quality: 1

-
  comboHash: "518c17d4042a49327768ef2d9423a5a6f2f53e0f"
  input:
    -
        name: Backstage passes to a TAFKAL80ETC concert
        sellIn: 5
        quality: 47

-
  comboHash: "5b3d16808ee26e70555edbe8bd189ff3fd6f4ec5"
  input:
    -
        name: Some title
        sellIn: 0
        quality: 2

Todo

todo.md

About

GenyTest is a powerful tool that automates the generation of test cases, aiming to achieve a perfect `1.0` mutation score.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages