Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add mutation testing #66

Merged
merged 15 commits into from
Feb 5, 2024
4 changes: 4 additions & 0 deletions .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -19,3 +19,7 @@ test/storage/*-old

# local deployment files
network.json

# Gambit (mutation test) files
gambit_out/
test-mutation/mutant_test_env/
140 changes: 140 additions & 0 deletions docs/mutation_testing.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,140 @@
# Mutation testing using Certora's Gambit framework

[Gambit tool](https://docs.certora.com/en/latest/docs/gambit/gambit.html) takes Solidity file or list of files as input and it will generate "mutants" as output. Those are copies of original file, but each with an atomic modification of original file, ie. flipped operator, mutated 'if' condition etc. Next step is running the test suite against the mutant. If all tests still pass, mutant survived, that's bad - either there's faulty test or there is missing coverage. If some test(s) fail, mutant's been killed, that's good.

## How to install Gambit
To build from source (assumes Rust and Cargo are installed), clone the Gambit repo
```
git clone [email protected]:Certora/gambit.git
```

Then run
```
cargo install --path .
```

Alternatively, prebuilt binaries can be used - more info [here](https://docs.certora.com/en/latest/docs/gambit/gambit.html#installation).


## Using Gambit
CLI command `gambit mutate` is used to generated mutants. It can take single file as an input, ie. let's say we want to generate mutants for file `L1ERC20Gateway.sol`:
```
gambit mutate --solc_remappings "@openzeppelin=node_modules/@openzeppelin" "@arbitrum=node_modules/@arbitrum" -f contracts/tokenbridge/ethereum/gateway/L1ERC20Gateway.sol
```

Command will output `gambit_out` folder containing list of mutants (modified Solidity files) and `gambit_results.json` which contains info about all the generated mutants. Another way to view the mutant info is to run:
```
gambit summary
```

Gambit can also take set of Solidity files as input in JSON format, ie:
```
gambit mutate --json test-mutation/config.json
```

List of all configuration options for the JSON file can be found [here](https://docs.certora.com/en/latest/docs/gambit/gambit.html#configuration-files).

Gambit only generates the mutants, it will not execute any tests. To check if mutant gets killed or survives, we need to copy modified Solidity file over the original one, re-compile the project and re-run the test suite. This process has been automated and described in next chapter.

## Gambit integration with Foundry
To get the most benefits from Gambit, we have integrated it with Foundry and automated testing and reporting process. This is how `gambitTester` script works:
- generates mutants for files specified in test-mutation/config.json
- for each mutant do in parallel (in batches):
- replace original file with mutant
- re-compile and run foundry suite
- track results
- report results

Here are practical steps to run mutation test over the set of Arbitrum token bridge contracts. First we need to update `test-mutation/config.json` with the list of solidity files to be tested. In this case we use config file that was prepared in advance:
```
cp test-mutation/all-configs/config.tokenbridge-ethereum.json test-mutation/config.json
```

Now run the script, it will generate mutants and start compiling/testing them in parallel:
```
yarn run test:mutation
```

It will take some time for script to execute (depends on the underlying HW).

Script output looks like this:
```
❯ yarn run test:mutation

yarn run v1.22.19
$ ts-node test-mutation/gambitTester.ts
====== Generating mutants
Generated 209 mutants in gambit_out/

====== Test mutants
Testing mutant batch 0..7
Testing mutant batch 7..14
Testing mutant batch 14..21
Testing mutant batch 21..28
Testing mutant batch 28..35
Testing mutant batch 35..42
Testing mutant batch 42..49
Testing mutant batch 49..56
Testing mutant batch 56..63
Testing mutant batch 63..70
Testing mutant batch 70..77
Testing mutant batch 77..84
Testing mutant batch 84..91
Testing mutant batch 91..98
Testing mutant batch 98..105
Testing mutant batch 105..112
Testing mutant batch 112..119
Testing mutant batch 119..126
Testing mutant batch 126..133
Testing mutant batch 133..140
Testing mutant batch 140..147
Testing mutant batch 147..154
Testing mutant batch 154..161
Testing mutant batch 161..168
Testing mutant batch 168..175
Testing mutant batch 175..182
Testing mutant batch 182..189
Testing mutant batch 189..196
Testing mutant batch 196..203
Testing mutant batch 203..210

====== Results

Mutant ID | File Name | Status
----------------------------------------------
----------------------------------------------
1 | L1ArbitrumMessenger.sol | SURVIVED
2 | L1ArbitrumMessenger.sol | SURVIVED
3 | L1ArbitrumMessenger.sol | KILLED
----------------------------------------------
4 | L1ArbitrumExtendedGateway.sol | KILLED
5 | L1ArbitrumExtendedGateway.sol | KILLED
6 | L1ArbitrumExtendedGateway.sol | KILLED
7 | L1ArbitrumExtendedGateway.sol | KILLED
8 | L1ArbitrumExtendedGateway.sol | KILLED
9 | L1ArbitrumExtendedGateway.sol | KILLED
10 | L1ArbitrumExtendedGateway.sol | KILLED
11 | L1ArbitrumExtendedGateway.sol | KILLED

...

205 | L1WethGateway.sol | SURVIVED
206 | L1WethGateway.sol | SURVIVED
207 | L1WethGateway.sol | SURVIVED
208 | L1WethGateway.sol | SURVIVED
209 | L1WethGateway.sol | SURVIVED
----------------------------------------------
Total Mutants: 209
Killed: 133 (63.64%)
Survived: 76 (36.36%)

====== Done in 20.91 min
```

We're insterested to analyze the mutants which survived. The 1st column in output, `Mutant ID`, can be used to find the exact mutation that was applied by looking into the matching entry in the `gambit_results.json` file.

## Other considerations
Mutation testing script is time-intensive due to all the re-compiling work. For that reason, list of input files should be optimized to give the most benefit for the limited time period available for testing. Ie. single Solidity files can be targeted and tested manually, and the broader scope of files can be tested overnight.

Other params that can be adjusted are type of mutations to use, number of mutants to be generated, randomness seed to use in generation, etc.

3 changes: 3 additions & 0 deletions package.json
Original file line number Diff line number Diff line change
Expand Up @@ -20,6 +20,7 @@
"test:unit": "forge test",
"test:e2e:local-env": "yarn hardhat test test-e2e/*",
"test:storage": "./scripts/storage_layout_test.bash",
"test:mutation": "ts-node test-mutation/gambitTester.ts",
"deploy:local:token-bridge": "ts-node ./scripts/local-deployment/deployCreatorAndCreateTokenBridge.ts",
"deploy:token-bridge-creator": "ts-node ./scripts/deployment/deployTokenBridgeCreator.ts",
"create:token-bridge": "ts-node ./scripts/deployment/createTokenBridge.ts",
Expand Down Expand Up @@ -51,6 +52,7 @@
"@typechain/ethers-v5": "^7.0.1",
"@typechain/hardhat": "^2.3.0",
"@types/chai": "^4.2.15",
"@types/fs-extra": "^11.0.4",
"@types/mocha": "^9.0.0",
"@types/node": "^14.14.28",
"@types/prompts": "^2.0.14",
Expand All @@ -66,6 +68,7 @@
"eslint-plugin-prettier": "^4.0.0",
"ethereum-waffle": "^3.2.0",
"ethers": "^5.4.5",
"fs-extra": "^11.2.0",
"hardhat": "2.17.3",
"hardhat-contract-sizer": "^2.10.0",
"hardhat-deploy": "^0.9.1",
Expand Down
5 changes: 2 additions & 3 deletions remappings.txt
Original file line number Diff line number Diff line change
Expand Up @@ -2,6 +2,5 @@ ds-test/=lib/forge-std/lib/ds-test/src/
forge-std/=lib/forge-std/src/
@openzeppelin/contracts-upgradeable/=node_modules/@openzeppelin/contracts-upgradeable/
@openzeppelin/contracts/=node_modules/@openzeppelin/contracts/



@arbitrum=node_modules/@arbitrum
@offchainlabs=node_modules/@offchainlabs
10 changes: 10 additions & 0 deletions test-mutation/all-configs/config.single.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,10 @@
[
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1ERC20Gateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
}
]
106 changes: 106 additions & 0 deletions test-mutation/all-configs/config.tokenbridge-ethereum.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,106 @@
[
{
"filename": "../contracts/tokenbridge/ethereum/L1ArbitrumMessenger.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1ArbitrumExtendedGateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1ArbitrumGateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1CustomGateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1ERC20Gateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1ForceOnlyReverseCustomGateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1GatewayRouter.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1OrbitCustomGateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1OrbitERC20Gateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1OrbitGatewayRouter.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1OrbitReverseCustomGateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1ReverseCustomGateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
},
{
"filename": "../contracts/tokenbridge/ethereum/gateway/L1WethGateway.sol",
"sourceroot": "..",
"solc_remappings": [
"@openzeppelin=../node_modules/@openzeppelin",
"@arbitrum=../node_modules/@arbitrum"
]
}
]
Loading
Loading