feat: execution traces #199

karmacoma-eth · 2023-09-28T01:08:57Z

(draft, will need to rebase and fix ci probably)

the traces in test_symbolic_create() now look like this: Traces: this_address::Concat(2987291460, p_x_uint256) new contract @ 0xaaaa0002::0x60a060405234801561001057600080fd5b506040516101073803806101078339818101604052810190610032919061008c565b602a8103610043576100426100b9565b5b8060808181525050506100e8565b600080fd5b6000819050919050565b61006981610056565b811461007457600080fd5b50565b60008151905061008681610060565b92915050565b6000602082840312156100a2576100a1610051565b5b60006100b084828501610077565b91505092915050565b7f4e487b7100000000000000000000000000000000000000000000000000000000600052600160045260246000fd5b60805160096100fe6000396000505060096000f3fe6080604052600080fd(p_x_uint256) ← 6080604052600080fd ← 0000000000000000000000000000000000000000000000000000000000000001 Traces: this_address::Concat(2987291460, p_x_uint256) new contract @ 0xaaaa0002::0x60a060405234801561001057600080fd5b506040516101073803806101078339818101604052810190610032919061008c565b602a8103610043576100426100b9565b5b8060808181525050506100e8565b600080fd5b6000819050919050565b61006981610056565b811461007457600080fd5b50565b60008151905061008681610060565b92915050565b6000602082840312156100a2576100a1610051565b5b60006100b084828501610077565b91505092915050565b7f4e487b7100000000000000000000000000000000000000000000000000000000600052600160045260246000fd5b60805160096100fe6000396000505060096000f3fe6080604052600080fd(p_x_uint256) ← 4e487b710000000000000000000000000000000000000000000000000000000000000001 (error: <class 'halmos.exceptions.Revert'>) ← 0000000000000000000000000000000000000000000000000000000000000000

and call_frame to context

+ return_scheme in message output + new color scheme

Every test has a revert case where value > 0, before splitting and actually executing the test logic with value == 0

EvmExceptions: halt the current context, but execution continues normally HalmosExceptions: halt the current path, but the test continues normally + add stuck() method to CallContext (returns internal errors encountered by itself or any of its subcalls) + better trace rendering for stuck executions + change subcalls() and logs() to be iterators instead of lists

fixes test_deploy_symbolic_bytecode

daejunpark · 2023-09-29T00:14:17Z

btw, not sure why the ci tests aren't triggered. would it be because of the conflicts?

src/halmos/sevm.py

daejunpark · 2023-09-29T20:09:59Z

src/halmos/sevm.py

+                            f"Unsupported console function: selector = 0x{funsig:0>8x}, "
+                            f"calldata = {hexify(arg)}"
+                        )
+                    )


no HalmosException here?

no we should continue execution, we don't really care if a console call is missing IMO

daejunpark · 2023-09-29T20:28:01Z

src/halmos/sevm.py


-            ex.calls.append((exit_code_var, exit_code, ex.output))
+            # TODO: check if still needed
+            ex.calls.append((exit_code_var, exit_code, ex.context.output.data))


currently call_id is generated using calls: call_id = len(ex.calls). but perhaps we can consider removing them in a separate pr.

src/halmos/sevm.py

daejunpark · 2023-09-29T20:52:38Z

src/halmos/sevm.py

                                    offset : offset + 32
                                ]
                            )
                        )
                elif opcode == EVM.CALLDATASIZE:
-                    if ex.calldata is None:
+                    calldata = None if ex.message().is_create() else ex.message().data
+                    # TODO: is optional calldata necessary?


if i remember correctly, we discussed this before, and concluded that we require calldata to be given explicitly. but let's deal with this in a separate pr.

src/halmos/sevm.py

daejunpark · 2023-09-29T21:01:02Z

src/halmos/sevm.py

@@ -2464,8 +2634,15 @@ def run(self, ex0: Exec) -> Tuple[List[Exec], Steps]:
                    offset: int = int_of(ex.st.pop(), "symbolic RETURNDATACOPY offset")
                    # size (in bytes)
                    size: int = int_of(ex.st.pop(), "symbolic RETURNDATACOPY size")
+
+                    # TODO: do we need to pass returndatasize here?


yes, RETURNDATACOPY doesn't zero-pad, but revert for out-of-bounds access.
https://github.com/ethereum/execution-specs/blob/master/src/ethereum/shanghai/vm/instructions/environment.py#L444-L447

note that all the other copy opcodes (CODECOPY, EXTCODECOPY, CALLDATALOAD, CALLDATACOPY) do zero-pad, lol

let's comment this non-intuitive behavior here for future contributors.

ah ok I didn't realize this was the purpose. Made some related improvements in 623a464

src/halmos/sevm.py

daejunpark · 2023-09-29T21:15:44Z

src/halmos/sevm.py

    # vm state
+    this: Address  # current account address


do we still need to keep this separately? it should be always equal to context.message.target, right?

if keeping this is only for convenient purpose, we can remove this, and write a helper function this() like caller(). this way, we can avoid making inconsistency by mistake in the future.

src/halmos/sevm.py

giving explicit contract and test function names makes it easier to invoke a specific test from the command line

also: - changes the system recursion limit so that we can in fact process up to 1024 messages deep - introduces a MAX_CALL_DEPTH global constant in halmos.sevm - sevm.run() now checks the current execution context and throws if it is over MAX_CALL_DEPTH - splits CallContext.stuck() into: - CallContext.is_stuck(), O(1) check that returns a bool - CallContext.get_stuck_reason(), O(n) traversal to fetch the internal exception - we use None as a sentinel value for context.output.data, meaning that if execution finishes with no output (as opposed to empty output), then it encountered an error and is stuck. As a result, we must be careful to: - not set context.output.data=None on normal halting and EvmException (hence the change to mloc() to return empty bytes instead of None) - set context.output.data=None on HalmosException (and really any other unexpected issue) TODO: - processing calls 1024 is quite slow (20+s), will investigate and fix later

daejunpark

did a second pass of review, and found a couple more semantics issues.

i think the outstanding semantics issues need to be fixed before merging this pr. the performance issues could be handled in a separate pr, though.

in the meantime, today, i will write more tests for semantics correctness, and include them in this pr. that might also help for fixing the merge conflicts with the current main.

daejunpark · 2023-10-02T18:11:52Z

src/halmos/__main__.py

+    hevm_fail = isinstance(context.output.error, FailCheatcode)
+    return hevm_fail or any(is_global_fail_set(x) for x in context.subcalls())
+
+


todo: evaluate performance and make it O(1) if needed.

daejunpark · 2023-10-02T18:24:32Z

src/halmos/__main__.py

            execs_to_model.append((idx, ex))
-        else:
-            stuck.append((opcode, idx, ex))
+            continue


since is_global_fail_set is not O(1), we may want to reorg this if-statements sequence, so that it isn't executed for not error or non-assert-fail reverts at all. or, we can just make it O(1).

not necessarily to be done in this pr, but at least commenting it as todo.

src/halmos/sevm.py

daejunpark · 2023-10-02T19:17:57Z

src/halmos/sevm.py

+
+            # add to worklist
+            new_ex.next_pc()
+            stack.append((new_ex, step_id))


todo: now that the create failure semantics is supported, this for-loop can be made consistent with the one for call_known(). further, their common logic can be factored out to avoid duplication.

agreed, do you mind if we do a cleanups as a subsequent PR though in the interest of getting this merged faster?

daejunpark · 2023-10-02T19:22:27Z

src/halmos/sevm.py

-                    "max_width" in self.options
-                    and len(out) >= self.options["max_width"]
-                ):
+                if len(out) >= self.options.get("max_width", 2**64):


2**64 is nice. can we just have it as the default value of max_width, so that it can be also displayed in the --help message?

- type annotations - missing assert - remove unused output field from Exec - in calldata(): return empty list instead of None of create calls

daejunpark · 2023-10-03T06:38:46Z

src/halmos/sevm.py

+            target=new_addr,
+            caller=caller,
+            value=value,
+            data=create_hexcode,


why is the calldata set to the creation code here?

according to the evm spec, calldata should be set to empty:
https://github.com/ethereum/execution-specs/blob/master/src/ethereum/shanghai/vm/instructions/system.py#L116-L117

if it's only for the tracing purpose, i think it is better to create a new field for its own, rather than repurposing the calldata field. i know that there is the calldata() helper function, but given that the calldata is crucial for correctness, i'd like to avoid any potential mistakes caused by this discrepancy in the future.

here is how I ended up this design. We need to represent either a regular message (has a target and calldata but no code) or a create message (technically has no target address and no calldata but has code)

a. we could represent them both in the same class using disjoint fields (ie. separate code and calldata fields)
b. we could represent them both in the same class with overloaded fields
c. we could represent them using separate classes like MessageCall and CreateCall

I went with (b.) because the semantic difference is pretty small and easily handled with a helper function to know if message.data corresponds to code or calldata. I similarly "hijacked" the target field because it's convenient to track the address being executed in that context.

I'm not a fan of option (a.) because the semantics of that union object is weird and you could end up in an inconsistent state (having both code and calldata), I think if we're going to refactor going with option (c.) would be cleaner:

it would be very clear at creation time if you're making a MessageCall or CreateCall

checking if something is a message call or create call is done simply via a type check rather than a helper function

they could easily be rendered/printed/traced differently

it's impossible to end up in an inconsistent state

ok, i think we need more time to discuss this further. let's keep this as it is for the scope of this pr.

daejunpark · 2023-10-03T07:39:56Z

in the meantime, today, i will write more tests for semantics correctness, and include them in this pr. that might also help for fixing the merge conflicts with the current main.

i added new comprehensive tests for the returndata and calldata manipulation, and reverting behaviors: 2fa17b4

- returndata is reset after successful contract creation - reading out of bounds of returndata is a regular EvmException - revert state changes after failed contract creation - catch internal errors arbitrarily deep after contract creation, not just one call deep

This reverts commit ffb708b.

karmacoma-eth added 30 commits August 23, 2023 14:37

fix: don't abort on unsupported console functions

c812c60

chore: add black commit to .git-blame-ignore-revs

2448961

WIP: calldata refactor

5297c36

WIP: message calls and event logs abstraction

23618cb

[WIP] basic trace render

7f3f193

WIP refactor messages and call frames

1346eef

WIP add unit tests for traces

55c8daa

WIP fix event test

dbf6493

add simple subcall tests

5d9b63d

fix: throw when running SSTORE in a static context

54527b4

Add support for symbolic subcalls

1c1f4d9

add log tests + revert when executing LOGx in static context

a7a12f8

add defensive check in Contract constructor

fdddbba

refactor: rename CallFrame to CallContext

c7774c1

and call_frame to context

fix: wire context output correctly

c594dac

display traces at different verbosity levels

f2de9b3

fix: display "0x" for empty output

013304a

fix handling of vm.fail() cheatcode

fd91dbf

record call scheme and fix static context propagation

881eb6f

tweak trace output

d119520

add virtual subcontext to trace for failed CREATE2

2f3be31

+ return_scheme in message output + new color scheme

add static contexts test

08bf7bd

add virtual trace element for cheatcodes

f711276

Call setUp() and test functions with 0 value

70b7e14

Every test has a revert case where value > 0, before splitting and actually executing the test logic with value == 0

fix: abort current path on HalmosException in create subcalls

e88166f

fixes test_deploy_symbolic_bytecode

fix: exit with error code when setUp() fails

eadd615

tests: add StaticContextsTest to all.json

6772f4d

tests: update create2 and getter tests

e1321b2

daejunpark reviewed Sep 29, 2023

View reviewed changes

src/halmos/sevm.py Show resolved Hide resolved

daejunpark reviewed Sep 29, 2023

View reviewed changes

karmacoma-eth added 2 commits September 29, 2023 15:08

rename CTest to AssertTest

a94a833

giving explicit contract and test function names makes it easier to invoke a specific test from the command line

daejunpark reviewed Oct 2, 2023

View reviewed changes

karmacoma-eth added 2 commits October 2, 2023 16:33

address PR feedback

0cd1847

- type annotations - missing assert - remove unused output field from Exec - in calldata(): return empty list instead of None of create calls

improve support for out of bounds index slicing

623a464

daejunpark reviewed Oct 3, 2023

View reviewed changes

test: add context test

2fa17b4

karmacoma-eth and others added 15 commits October 3, 2023 18:21

minor cosmetic changes

bba3fe8

Merge branch 'main' into exec-traces

0846368

fix merge aftermath

2146d2a

fix imports

54d33b6

temporary fix: ignore test_traces.py for now

8de72a0

fix: submodule merge conflicts

91fc32e

test: update expected

b13a785

feat: print setup failure traces for only -vv or higher

eef7175

test: rename test file

bb136a3

test: update expected

1eeb604

test: fix forge remappings ds-test for windows

09ae40f

tmp: ci debugging

ffb708b

Revert "tmp: ci debugging"

b10c904

This reverts commit ffb708b.

tmp fix: BrokenProcessPool with reused thread pools

88f6cfa

daejunpark approved these changes Oct 9, 2023

View reviewed changes

karmacoma-eth merged commit d5be133 into main Oct 10, 2023
109 checks passed

karmacoma-eth deleted the exec-traces branch October 10, 2023 00:31

daejunpark mentioned this pull request Oct 18, 2023

missing CREATE failure semantics #145

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: execution traces #199

feat: execution traces #199

karmacoma-eth commented Sep 28, 2023

daejunpark commented Sep 29, 2023

daejunpark Sep 29, 2023

karmacoma-eth Sep 29, 2023

daejunpark Sep 29, 2023

daejunpark Sep 29, 2023

daejunpark Sep 29, 2023 •

edited

Loading

karmacoma-eth Oct 3, 2023

daejunpark Sep 29, 2023

daejunpark left a comment

daejunpark Oct 2, 2023

daejunpark Oct 2, 2023

daejunpark Oct 2, 2023

karmacoma-eth Oct 4, 2023

daejunpark Oct 2, 2023

daejunpark Oct 3, 2023

karmacoma-eth Oct 4, 2023

daejunpark Oct 4, 2023

daejunpark commented Oct 3, 2023 •

edited

Loading

		hevm_fail = isinstance(context.output.error, FailCheatcode)
		return hevm_fail or any(is_global_fail_set(x) for x in context.subcalls())

feat: execution traces #199

feat: execution traces #199

Conversation

karmacoma-eth commented Sep 28, 2023

daejunpark commented Sep 29, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daejunpark Sep 29, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daejunpark left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

daejunpark commented Oct 3, 2023 • edited Loading

daejunpark Sep 29, 2023 •

edited

Loading

daejunpark commented Oct 3, 2023 •

edited

Loading