Define Execution Run Method to Compute Machine Hashes With Step Size for BOLD #2392

rauljordan · 2024-06-13T21:54:42Z

Background

As part of Arbitrum BOLD, validators will need to ask a stateless execution server for a list of hashes of executing an Arbitrator machine with some configuration. Validators can specify the start index at which to load the machine, how many times to step over it, and the step size in its execution. For instance:

"I'm in a BOLD challenge, and I need to get the hashes of executing the machine for message number 70, starting at program counter 10, and stepping through it in steps of size 1M. Give me 1024 hashes from that computation"

This PR keeps the implementation simple and adds unit tests to check the invariants of the function. This function is unused until BOLD is merged into Nitro

validator/server_arb/execution_run.go

rauljordan · 2024-06-17T16:21:26Z

Ready again @tsahee thanks for the great review!

…s/nitro into get-machine-hashes-with-step

eljobe

This is probably mostly issues with my ignorance of exactly why we're trying to do what we're trying to do. But, none-the-less, I find myself with questions as I read the code. So, I'm commenting on them.

eljobe · 2024-06-21T12:33:26Z

system_tests/validation_mock_test.go

@@ -126,6 +126,10 @@ func (r *mockExecRun) GetStepAt(position uint64) containers.PromiseInterface[*va
 	}, nil)
 }

+func (r *mockExecRun) GetMachineHashesWithStepSize(machineStartIndex, stepSize, numRequiredHashes uint64) containers.PromiseInterface[[]common.Hash] {


Is numRequiredHases essentially numIterations?

That is, you're telling the machine to start at position x, and then perform y iterations of s opcodes per iteration.
The fact that this also produces y hashcodes (one for each iteration) is more related to the return value than the arguments. Right?

The reason I'm making a big deal of the naming, is that I actually was confused when I first read the signature. numRequiredHashes sounds like it might be independent from how far through the machine execution progresses. But, I don't think it actually is. Without a doc comment, or something explicit, it isn't clear that these "required hashes" are the machine hash after each iteration of size stepSize.

Also, "required" sort of makes me think that it's possible that the function will error out because it wasn't able to provide enough hashes, or that it should fill in the returned slice with some "filler" hash if the machine finishes before enough hashes are generated.

Great feedback! Renamed to num iterations

validator/server_arb/execution_run.go

eljobe · 2024-06-21T12:41:19Z

validator/server_arb/execution_run.go

+		return machineHashes, nil
+	}
+
+	logInterval := requiredNumHashes / 20 // Log every 5% progress


Optional: Maybe this should be configurable? It seems like something we might want to be more noisy in the beginning, but then turn way down low later?

validator/server_arb/execution_run.go

eljobe · 2024-06-21T12:48:02Z

validator/server_arb/execution_run_test.go

+	e := &executionRun{}
+	ctx := context.Background()
+
+	t.Run("basic argument checks", func(t *testing.T) {


Why are all these different test cases being run using t.Run instead of just being separate top-level test functions?

Personal preference here. I find it more expressive to write subcases in plain english instead of having to wrangle the Go function naming convention. It also groups functionality into one place, but I'm happy to change it if the alternative is highly preferred

I looked in the CI and saw that it does display these runs separately and nicely.
The advantage of separate functions is that you can run one by itself, and out CI will re-execute every failed test once to see if it first failed due to system flakiness.
I think for short low-footprint tests like these using t.Run is fine, and for anything in system_test I would still prefer separate functions.

validator/server_arb/execution_run_test.go

eljobe · 2024-06-21T12:50:34Z

validator/server_arb/execution_run_test.go

+			}
+		}
+	})
+	t.Run("if finishes execution early, simply pads the remaining desired hashes with the machine finished hash", func(t *testing.T) {


Well, this answers my earlier question. Why is this the protocol? Why not just return fewer hashes if the machine finishes? Seems like it could reduce the size of the data passed back to the callers?

Agreed...will let the caller pad if they need to

rauljordan · 2024-06-24T15:48:07Z

Ready again @eljobe , thanks for the review

eljobe

LGTM

validator/valnode/validation_api.go

rauljordan · 2024-06-24T22:44:34Z

Hi @eljobe thanks again! Needs reapproval after the last naming change

tsahee

the only comment that I think of as blocking is creating bock machine / execution run per internal test instead of sharing them.
Great work.

system_tests/validation_mock_test.go

tsahee · 2024-06-28T14:00:49Z

validator/server_arb/execution_run.go

+		if err := machine.Step(ctx, stepSize); err != nil {
+			return nil, fmt.Errorf("failed to step machine to position %d: %w", absoluteMachineIndex, err)
+		}
+		if i%logInterval == 0 || i == maxIterations-1 {


not a must..
I found that logging every x seconds (remembering when you last logged and logging again if enough time passed) is better than logging every X iterations

tsahee · 2024-06-28T14:18:09Z

validator/server_arb/execution_run_test.go

+	e := &executionRun{}
+	ctx := context.Background()
+
+	t.Run("basic argument checks", func(t *testing.T) {


I looked in the CI and saw that it does display these runs separately and nicely.
The advantage of separate functions is that you can run one by itself, and out CI will re-execute every failed test once to see if it first failed due to system flakiness.
I think for short low-footprint tests like these using t.Run is fine, and for anything in system_test I would still prefer separate functions.

validator/server_arb/execution_run_test.go

rauljordan · 2024-07-08T15:21:01Z

Ready again @tsahee , thank you !

tsahee

LGTM

rauljordan added 2 commits June 13, 2024 16:24

get machine hashes with step size

9c8b146

tests

09c8b03

cla-bot bot added the s Automatically added by the CLA bot if the creator of a PR is registered as having signed the CLA. label Jun 13, 2024

rauljordan added 2 commits June 13, 2024 16:55

rem old

f9470c0

done

18a2510

tsahee requested review from Tristan-Wilson, amsanghi and tsahee June 13, 2024 22:03

tsahee requested changes Jun 15, 2024

View reviewed changes

validator/server_arb/execution_run.go Outdated Show resolved Hide resolved

validator/server_arb/execution_run.go Show resolved Hide resolved

validator/server_arb/execution_run.go Outdated Show resolved Hide resolved

validator/server_arb/execution_run.go Outdated Show resolved Hide resolved

rauljordan added 3 commits June 17, 2024 11:03

tsahi feedback

71c0a3d

tests passing

65e1b57

Merge branch 'master' into get-machine-hashes-with-step

4503855

rauljordan added 5 commits June 17, 2024 12:24

edit

f9484da

Merge branch 'get-machine-hashes-with-step' of github.com:OffchainLab…

0920d98

…s/nitro into get-machine-hashes-with-step

Merge branch 'master' into get-machine-hashes-with-step

2832272

Merge branch 'master' into get-machine-hashes-with-step

0432a78

Merge branch 'master' into get-machine-hashes-with-step

1f2a7eb

rauljordan requested a review from tsahee June 20, 2024 12:51

Merge branch 'master' into get-machine-hashes-with-step

feb1d90

rauljordan requested a review from eljobe June 20, 2024 16:24

eljobe reviewed Jun 21, 2024

View reviewed changes

rauljordan added 7 commits June 24, 2024 09:07

feedback

5ed59b6

Merge branch 'master' into get-machine-hashes-with-step

4249cc3

feedback

f7ed4f0

commentary

85d0e8d

rename

bf60a37

commentary

27e4a82

replace

1eff6fb

eljobe previously approved these changes Jun 24, 2024

View reviewed changes

validator/valnode/validation_api.go Outdated Show resolved Hide resolved

rename

e853eff

rauljordan dismissed eljobe’s stale review via e853eff June 24, 2024 16:00

Merge branch 'master' into get-machine-hashes-with-step

89a800d

tsahee requested changes Jun 28, 2024

View reviewed changes

tsahee added the design-approved label Jun 28, 2024

amsanghi and others added 3 commits July 1, 2024 16:31

Merge branch 'master' into get-machine-hashes-with-step

b90fe67

Merge branch 'master' into get-machine-hashes-with-step

01647d1

feedback

2a09c3e

rauljordan requested a review from tsahee July 8, 2024 15:20

Merge branch 'master' into get-machine-hashes-with-step

24ddd89

tsahee approved these changes Jul 10, 2024

View reviewed changes

Merge branch 'master' into get-machine-hashes-with-step

ea39210

tsahee enabled auto-merge July 10, 2024 23:51

tsahee merged commit 95255eb into master Jul 11, 2024
12 checks passed

tsahee deleted the get-machine-hashes-with-step branch July 11, 2024 00:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define Execution Run Method to Compute Machine Hashes With Step Size for BOLD #2392

Define Execution Run Method to Compute Machine Hashes With Step Size for BOLD #2392

rauljordan commented Jun 13, 2024

rauljordan commented Jun 17, 2024

eljobe left a comment

eljobe Jun 21, 2024

rauljordan Jun 21, 2024

eljobe Jun 21, 2024

eljobe Jun 21, 2024

rauljordan Jun 24, 2024

tsahee Jun 28, 2024

eljobe Jun 21, 2024

rauljordan Jun 24, 2024

rauljordan commented Jun 24, 2024

eljobe left a comment

rauljordan commented Jun 24, 2024

tsahee left a comment

tsahee Jun 28, 2024

tsahee Jun 28, 2024

rauljordan commented Jul 8, 2024

tsahee left a comment

Define Execution Run Method to Compute Machine Hashes With Step Size for BOLD #2392

Define Execution Run Method to Compute Machine Hashes With Step Size for BOLD #2392

Conversation

rauljordan commented Jun 13, 2024

Background

rauljordan commented Jun 17, 2024

eljobe left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rauljordan commented Jun 24, 2024

eljobe left a comment

Choose a reason for hiding this comment

rauljordan commented Jun 24, 2024

tsahee left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rauljordan commented Jul 8, 2024

tsahee left a comment

Choose a reason for hiding this comment