RISCV bootloader types #1935

pacheco · 2024-10-22T01:07:48Z

Refactor to support multiple implementations of the bootloader and executor.
It introduces a BootloaderImpl, with types/functions that must be given by the concrete bootloader implementation (large/small).
It also creates a BootloaderInputs type to encapsulate all the indexing logic for manipulating bootloader inputs.

adds a BootloaderImpl trait that should be implemented by the small/large field variants of the bootloader

pacheco · 2024-10-22T01:17:23Z

I'm open for suggestions on the design here.
My idea was to extract the "generic" parts into the trait, so as to avoid code duplication, since the code in these parts is already quite complex/fragile.

pacheco · 2024-10-22T12:09:54Z

riscv-executor/src/large_field/mod.rs

+//!
+//! TODO: perform determinism verification for each instruction independently
+//! from execution.
+


Most of the code here didn't change, was just moved to the large_field module.
There's one small change ill point out.

pacheco · 2024-10-22T12:16:55Z

riscv-executor/src/large_field/mod.rs

+}
+
+impl<F: FieldElement> RegisterMemory<F> {
+    pub fn for_bootloader(&self) -> HashMap<u32, Vec<F>> {


The return type here was HashMap<u32, F>. It became Vec<F> because the small_field machine needs two F for a register value.
Ideally, this would be u32 as is the MemoryState, but the large_field machine actually uses the full F for register values during its execution (e.g., for to_signed).

Wouldn't be possible to use an array instead of a Vec (the length can be a generic parameter)? Feels better that way.

I couldn't use the BootloaderImpl trait because it makes the crate dependencies circular, but maybe just a const parameter might work... ill try

pacheco · 2024-10-22T12:22:50Z

riscv-executor/src/lib.rs

 }

 pub type MemoryState = HashMap<u32, u32>;
-pub type RegisterMemoryState<F> = HashMap<u32, F>;
+/// Value of registers is Vec<F> to unify the output for different field sizes
+pub type RegisterMemoryState<F> = HashMap<u32, Vec<F>>;


see my comment above

see my response above

pacheco · 2024-10-22T12:57:57Z

riscv/src/continuations/bootloader.rs

+/// This trait provides all the field specific types and implementations that the bootloader needs.
+///
+/// For now, this trait is implemented directly by each `FieldElement` type, because the hash functions (i.e., poseidon) are field-specific.
+pub(crate) trait BootloaderImpl<F> {


main reason for this trait here are the Page and Hash types, which are just arrays of F but with different sizes for the small/large machines.
These are used in the merkle tree and bootloader code, and this is the way I found to make it generic.

pacheco · 2024-10-22T13:56:37Z

riscv/src/large_field/code_gen.rs

@@ -197,7 +197,7 @@ machine Main with min_degree: {}, max_degree: {} {{

 {}

-let initial_memory: (fe, fe)[] = [
+let initial_memory: (int, int)[] = [


this needs to be (int, int) because we cant represent u32 values using fe in small field

pacheco · 2024-10-22T13:58:09Z

@lvella

riscv/src/continuations/bootloader.rs

riscv/src/continuations.rs

riscv/src/small_field/bootloader.rs

leonardoalt · 2024-10-22T16:19:32Z

riscv/src/small_field/bootloader.rs

+
+    fn update_page(page: &mut Self::Page, idx: usize, word: u32) {
+        let (hi, lo) = split_word(word);
+        // TODO: check proper endianess here!


I think RISCV memory is [low, high], so this should fail

yes, riscv is little endian, but i was not sure here (we use hi,lo in our interface to the memory machine), so I left the todo

actually yea, it's tricky. We use big-endian in the machine, but Rust uses little endian when representing u64 as 2 words in RISCV32, for example. So this is probably correct actually

Whatever order chosen here, it has nothing to do with RISCV (our implementation always loads 32 bits and then simulates little-endian).

It must, however, match the bootloader implementation. It seems that in gl case it does some preprocessing to fit 1 field element every two words, and it uses little-endian there. Similarly, as poseidon_bb is implemented, it expects one field element per word, so the order which the limbs are padded matter.

leonardoalt · 2024-10-23T10:15:29Z

lgtm, but I'll let @lvella take a look too

lvella · 2024-10-23T14:45:52Z

riscv-executor/src/large_field/mod.rs

+}
+
+impl<F: FieldElement> RegisterMemory<F> {
+    pub fn for_bootloader(&self) -> HashMap<u32, Vec<F>> {


Wouldn't be possible to use an array instead of a Vec (the length can be a generic parameter)? Feels better that way.

lvella · 2024-10-23T14:46:51Z

riscv-executor/src/lib.rs

 }

 pub type MemoryState = HashMap<u32, u32>;
-pub type RegisterMemoryState<F> = HashMap<u32, F>;
+/// Value of registers is Vec<F> to unify the output for different field sizes
+pub type RegisterMemoryState<F> = HashMap<u32, Vec<F>>;


see my response above

lvella · 2024-10-23T14:52:04Z

riscv-executor/src/lib.rs

+    match F::known_field() {
+        Some(KnownField::BabyBearField | KnownField::Mersenne31Field) => small_field::execute_ast(


I feel like that the FieldElement trait should have a static method telling if it is a small or large field. This whole KnownField is kinda hackey. I think I used it before I knew of std::any::TypeId.

Suggested change

match F::known_field() {

Some(KnownField::BabyBearField | KnownField::Mersenne31Field) => small_field::execute_ast(

match F::size_category() {

FieldSize::Small => small_field::execute_ast(

it's already there as known_field().unwrap().field_size(), this PR predates that so I forgot to update here, will change

But the problem is the known_field(). Field size should be a property of the FieldElement trait itself. But maybe this is out of scope for this PR.

yea there are other places that could benefit from that, but we could do it after

lvella · 2024-10-23T14:54:35Z

riscv-executor/src/small_field/mod.rs

+//! A specialized executor for our RISC-V assembly that can speedup witgen and
+//! help with making partition decisions.
+//!
+//! WARNING: the general witness generation/execution code over the polynomial
+//! constraints try to ensure the determinism of the instructions. If we bypass
+//! much of witness generation using the present module, we lose the
+//! non-determinism verification.
+//!
+//! TODO: perform determinism verification for each instruction independently
+//! from execution.


I think this docstring belongs in the toplevel lib.rs file.

lvella · 2024-10-23T14:55:08Z

riscv-executor/src/large_field/mod.rs

+//! A specialized executor for our RISC-V assembly that can speedup witgen and
+//! help with making partition decisions.
+//!
+//! WARNING: the general witness generation/execution code over the polynomial
+//! constraints try to ensure the determinism of the instructions. If we bypass
+//! much of witness generation using the present module, we lose the
+//! non-determinism verification.
+//!
+//! TODO: perform determinism verification for each instruction independently
+//! from execution.


I think this docstring belongs in the toplevel lib.rs file.

lvella · 2024-10-23T17:39:16Z

riscv/src/small_field/bootloader.rs

+                todo!("call rust implememtation of poseidon bb")
+                // poseidon_bb(&buffer)


Or poseidon2.

lvella · 2024-10-23T17:56:40Z

riscv/src/small_field/bootloader.rs

+
+    fn iter_word_as_fe(v: u32) -> impl Iterator<Item = F> {
+        let (hi, lo) = split_word(v);
+        // TODO: check proper endianess here!


Doesn't matter, as long as it is consistent with the bootloader implementation.

Which I think could be improved, if the poseidon machine assumes that each memory limb can contain a full field element: it saves some preprocessing to prepare the input on the bootloader, and multiple calls to split machine on the output.

Currently, both poseidon_memory_gl and poseidon_bb assume that a memory position will never contains more than 32-bits, so they need the double of field elements (in the memory machine) to represent each field element.

lvella · 2024-10-23T17:58:10Z

riscv/src/large_field/bootloader.rs

+use powdr_number::{FieldElement, KnownField, LargeInt};
+use powdr_riscv_executor::large_field::poseidon_gl::poseidon_gl;
+
+pub fn split_fe<F: FieldElement>(v: &F) -> [F; 2] {


This one places the low bits first. Like the split machines, that returns (low, high).

lvella · 2024-10-23T18:25:28Z

riscv/src/small_field/bootloader.rs

+
+    fn update_page(page: &mut Self::Page, idx: usize, word: u32) {
+        let (hi, lo) = split_word(word);
+        // TODO: check proper endianess here!


Whatever order chosen here, it has nothing to do with RISCV (our implementation always loads 32 bits and then simulates little-endian).

It must, however, match the bootloader implementation. It seems that in gl case it does some preprocessing to fit 1 field element every two words, and it uses little-endian there. Similarly, as poseidon_bb is implemented, it expects one field element per word, so the order which the limbs are padded matter.

lvella · 2024-10-23T19:52:29Z

riscv/src/continuations/bootloader.rs

+// It's only used by this project's benchmarks (`benches` folder), which are
+// users of the crate and not part of it...
+pub fn default_input_witness<F: FieldElement>(accessed_pages: &[u64]) -> Vec<(String, Vec<F>)> {
+    match F::known_field() {


I would use std::any::TypeId instead of KnownField here. It is more standard.

pacheco added 3 commits October 21, 2024 18:33

refactor to support small field executor and bootloader

5fd4669

adds a BootloaderImpl trait that should be implemented by the small/large field variants of the bootloader

restore merkle tree tests

14def01

comment

d35a6ce

lint

48b6bd8

pacheco commented Oct 22, 2024

View reviewed changes

small change

9650c7c

pacheco commented Oct 22, 2024

View reviewed changes

outdated comment

d536971

pacheco commented Oct 22, 2024

View reviewed changes

pacheco marked this pull request as ready for review October 22, 2024 13:57

pacheco requested review from leonardoalt and lvella October 22, 2024 13:57

leonardoalt reviewed Oct 22, 2024

View reviewed changes

lvella reviewed Oct 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RISCV bootloader types #1935

RISCV bootloader types #1935

pacheco commented Oct 22, 2024

pacheco commented Oct 22, 2024

pacheco Oct 22, 2024 •

edited

Loading

pacheco Oct 22, 2024 •

edited

Loading

lvella Oct 23, 2024

pacheco Oct 24, 2024 •

edited

Loading

pacheco Oct 22, 2024 •

edited

Loading

lvella Oct 23, 2024

pacheco Oct 22, 2024

pacheco Oct 22, 2024

pacheco commented Oct 22, 2024

leonardoalt Oct 22, 2024

pacheco Oct 22, 2024

leonardoalt Oct 23, 2024

lvella Oct 23, 2024

leonardoalt commented Oct 23, 2024

lvella Oct 23, 2024

lvella Oct 23, 2024

lvella Oct 23, 2024

pacheco Oct 24, 2024

lvella Oct 24, 2024

leonardoalt Oct 24, 2024

lvella Oct 23, 2024

lvella Oct 23, 2024

lvella Oct 23, 2024

lvella Oct 23, 2024

lvella Oct 23, 2024

lvella Oct 23, 2024

lvella Oct 23, 2024

		match F::known_field() {
		Some(KnownField::BabyBearField \| KnownField::Mersenne31Field) => small_field::execute_ast(

		todo!("call rust implememtation of poseidon bb")
		// poseidon_bb(&buffer)

RISCV bootloader types #1935

Are you sure you want to change the base?

RISCV bootloader types #1935

Conversation

pacheco commented Oct 22, 2024

pacheco commented Oct 22, 2024

pacheco Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

pacheco Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pacheco Oct 24, 2024 • edited Loading

Choose a reason for hiding this comment

pacheco Oct 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pacheco commented Oct 22, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

leonardoalt commented Oct 23, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

pacheco Oct 22, 2024 •

edited

Loading

pacheco Oct 22, 2024 •

edited

Loading

pacheco Oct 24, 2024 •

edited

Loading

pacheco Oct 22, 2024 •

edited

Loading