[EXPERIMENTAL] Syntactic and Semantic Mutations #3

FedeLoch · 2024-09-24T10:03:48Z

Syntactic and Semantic Mutations

After analysing some ideas about possible constraint mutations in the contraposition of how the Concolic algorithm works, we realized there is a high risk of mutating the current path to explore. Our current Concolic algorithm works iteratively, using the previously explored path + constraints to solve those constraints and leads to the next path to explore. If we mutate the current path, the next one could be corrupted by our mutation.

Having in mind this impact, we decided to take another approach. We propose to leave the Concolic algorithm to make its explorations, finding and solving all the paths, and just before using those paths to test the compiler mutate them.

The problem here is that our mutations must be made delicately, if we want to ensure the same behaviour between the interpreter and our compiler, we need to design and implement mutations that don't change the semantics of the explored path.

This approach brought us to consider two possible scenarios, the first one is if we desire to test relief of the correct behaviour between interpreter/compiler, we want to check that the compiler behaviour differs from the bytecode interpreter, and then the mutations mustn't change that behaviour. But at the same time, we may indeed, consider a second scenario of mutating the generated paths semantics consciously and therefore look at how often the compiler behaviour differs of our bytecode interpreter.

Methodology/Workflow

We implemented two methods

RABytecodeAutoTest solutionsFor: aBytecode ( which returns all the possible solutions for a given bytecode )
RAPrimitiveAutoTest solutionsFor: aPrimitive ( which returns all the possible solutions for a given primitive )

With these two methods, we pretend to capture all solutions regarding a specific bytecode or primitive. This allows us to understand the common constraint's paths generated for certain bytecode/primitive and use them to design mutations over them.

Our main idea is to add new constraint mutations forcing the constraint solver to explore new possible values for the implied variables, keeping the semantics to avoid inconsistencies or changing it in case we wanted to confirm behaviour differences.

Compiler's Coverage

Part of our goal is to guarantee that we are testing the semantic equivalence between our interpreter and the compiler, we want to make sure we are increasing the code coverage by adding these new path mutations.

Before our mutation experiments, we saw that the percentage of code covered by our Concolic exploration using the RABytecodeAutoTest is 13%

And, using the RAPrimitiveAutoTest 18%

And running both test suits we get 22 % of compiler coverage

Which means that we need to get another way to explore more compiler paths.

Syntactic Mutations

The idea of these mutations is to add/remove/update the current path ( with its constraints ) but keep the semantics, for that reason, we are going to mutate the constraints into it with the expectation of seeing differences that we are not able to see with the deterministic algorithm.

We are starting with the simple semantic equivalence EDITION constraints:

Var(A) > n -> Update to Var(A) >= n+1
operand1 OP operand2 -> Update to operand2 OP operand1 [ as log as OP be conmutative ]
Var(A) < n -> Update to Var(A) <= n-1
n < Var(A) -> Update to Var(A) >= n+1
Var(A) >= n -> Update to Var(A) = n OR Var(A) > n

To force the constraint solver to assign more interesting values to our variables, we propose the next INSERTION constraints as mutations:

isX(A) -> Add isNotY, for all Y != X
Var(A) >= n -> Add Var(B) tal que Var(A) + Var(B) > Var(B) + n

Considering randomness

IsNotInt(A) -> Add A != rand(-inf, +inf)
Var(A) AOP n) -> Add Var(A) + X AOP n + X Where X = Random(-inf, +inf)

The syntactic mutations that we :

// TODO

The semantic mutations added are:

// TODO

Design decisions

// TODO

Tests and observations

// TODO

Conclusions

// TODO

… one

… them to next analusis

…rThanMutatorTest

… set of mutations

…time

FedeLoch added 5 commits September 24, 2024 12:02

SyntacticMutation model

83a29aa

Fixing inject into performace problem

d5c431e

RAPathMutatorTest class

9893db1

Empty test cases to complete

52f93b5

Fixing wrong predecnce and replacing method call to use the mutations…

fcac41b

… one

FedeLoch changed the title ~~[EXPERIMENTAL] SyntacticMutation model~~ [EXPERIMENTAL] Syntactic and Semantic Mutations Sep 26, 2024

Forcing tests to fail

d479544

FedeLoch requested a review from guillep September 26, 2024 14:35

FedeLoch self-assigned this Sep 26, 2024

FedeLoch added 21 commits September 27, 2024 10:05

Fixing primiteTest

700a5c9

Creating RAPrimitiveAutoTest solutionsFor

9905cb4

Fixing primitive cases

2ee2696

ToString method to faclitate the readibility on solutions and compare…

f3696ec

… them to next analusis

Deleting unnecesary reimplementation

5bb760b

Tunning printing

6596673

CompilerCoverageCollector class

5b1e891

Deleting all mutations and adding some tests fto RASyntacticVarGreate…

900e607

…rThanMutatorTest

Testing

84ed811

Commutative constraint mutator

dc05880

Less than mutator

71f7e8f

Fix error

3be2d3c

n less than other constraint mutation

623bf71

Var(A) >= n -> ADD Var(A) = n OR Var(A) > n Mutation

a30d454

adding RASyntacticNLessThanVarMutator to mutator

3c9c486

Fixing bug

d9b1458

Fixing error

9225ed0

Considering bitAnd and bitOr

be1cc6b

Using exploreAndMutate

0715ac2

Implementing mutation recursively, in order to generate a much richer…

159f8ed

… set of mutations

Identation

04b0822

FedeLoch and others added 30 commits December 17, 2024 10:47

Fixing error

fabe34a

Merge 6e1e059

9d15f3a

Permissions +x

ae2b155

printing size

3dba734

Merge ae2b155

1bde845

Fix bug

50ba906

indentation

939c16b

Sorting the code, prepare refactoring

f9052fa

Fix path toString

4e085bb

Merge f9052fa

9b61099

fix printing path bug

0e15e63

second experiment

9ac4eef

Caching

1e8845d

Merge 9ac4eef

6f76c79

Add caching

a912432

Prefix all files by type

d703835

Fix typo

d08c930

do not keep full stack trace

7ff3903

cleaning test after execution

a83ecd0

Do not map stack traces at serialization time, already done at build …

1b84247

…time

print debug stack instead of short stack

9ec8d18

Print the case and not the test object

6c21c8a

Fix

ae9e907

fix error

f3fa8c2

new experiments

556b604

change experiment's name

aaeb0f0

maximumReachableCoverageWithoutMutations

c4dbb37

Add deserializer

2363813

diff support

acb8568

Bigger differ

698aac5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[EXPERIMENTAL] Syntactic and Semantic Mutations #3

[EXPERIMENTAL] Syntactic and Semantic Mutations #3

FedeLoch commented Sep 24, 2024 •

edited

Loading

[EXPERIMENTAL] Syntactic and Semantic Mutations #3

Are you sure you want to change the base?

[EXPERIMENTAL] Syntactic and Semantic Mutations #3

Conversation

FedeLoch commented Sep 24, 2024 • edited Loading

Syntactic and Semantic Mutations

Methodology/Workflow

Compiler's Coverage

Syntactic Mutations

We are starting with the simple semantic equivalence EDITION constraints:

To force the constraint solver to assign more interesting values to our variables, we propose the next INSERTION constraints as mutations:

Considering randomness

The syntactic mutations that we :

The semantic mutations added are:

Design decisions

Tests and observations

Conclusions

FedeLoch commented Sep 24, 2024 •

edited

Loading