The idea is that the nature of the tests we carry out are such that they are imperfect and the user should be able to take a look at each test case to make sure it is testing what it claims to be testing. If the user spots something a problematic test case (transformed sentence), they should be able to delete that row. This would happen before running the tests basically, and if they only notice it after running the tests, they should be able to come back to the test generation phase and make that edit.
The approach to do this is up for discussion - it could be either integrated in the save/load workflow or elsewhere