Unique selector bytes for History, Beacon, and State networks #233

mrferris · 2023-10-02T18:17:43Z

Content keys on different portal networks can and do have overlapping selectors. For example the first byte of a history header content key is 0x00 and the first byte of a beacon bootstrap content key is 0x00.

I'm proposing that, as convention, we un-overlap these selectors for the sake of clarity. I'm not proposing that we make it required that these not overlap, which would put a limit on the number of auxillary networks (eg ethstorage) that can be built in the future.

The motivation for doing this is the ambiguity that it removes. With overlapping selectors a given content key needs to always be accompanied by information about which network it belongs to, otherwise it means nothing to the parser/reader. I think that this ambiguity will become a frustration while developing/debugging/discussing multi-subnet portal clients and the software that uses them.

History network keeps its selectors, so not much live data would need to be dumped to do this. Only beacon which is young enough for this to be a good time to make this change if we choose to.

pipermerriam · 2023-10-02T20:18:49Z

This is good to merge as-is...

When Jason and I chatted about this we talked about making the selector LEB128 encoded, which I believe changes nothing until we hit a selector value >=128. We could then segment off the range between 0-127 by nibbles which would let us do:

0x00-0x0f: history
0x10-0x1f: beacon
0x20-0x2f: state
etc

This has the nice property of being able to have the high nibbles tell us which network a key belongs to and giving us 8 blocks of keys to dole out before we have to expand the LEB encoded value to a 2-byte length value.

pipermerriam · 2023-10-02T20:20:22Z

beacon-chain/beacon-network.md

@@ -155,7 +155,7 @@ HISTORICAL_ROOTS_LIMIT = 2**24  # = 16,777,216

 ```
 light_client_bootstrap_key = Container(block_hash: Bytes32)
-selector                   = 0x00
+selector                   = 0x03


Updating this to 0x10 (and 0x11, 0x12, etc) would give us a nice namespace and give us room to expand the history network set of keys without awkward gaps.

pipermerriam · 2023-10-02T20:20:46Z

state-network.md

@@ -140,7 +140,7 @@ A leaf node from the main account trie and accompanying merkle proof against a r

 ```
 account_trie_proof_key := Container(address: Bytes20, state_root: Bytes32)
-selector               := 0x00
+selector               := 0x08


Same here, 0x20, 0x21 and so on would give us some nice namespacing.

kdeme

I think I'd like to understand a bit better (in the form of examples?) where/when this is deemed needed or useful?
Wouldn't the content always be together with its network context?

e.g. Any of the portal JSON RPC APIs is already per network prefix separated. The networks themselves are separated.

I guess test vectors could be parsed and tested in a more modular sense with this change? But that is also achievable by adding little network metadata to them (besides that you just might want to keep it seperate per network anyhow)

Side note: It also breaks compatibility with using these Prefixes as SSZ Union. And yes, the SSZ Union is not explicitly specified in the specs anymore, but Fluffy does still use it as it originally was in the spec, considering it is the same currently.
I'm not fully bound to it and it is not a huge change, but if I don't see a real benefit, I can't say I'm pro for it either :)

carver · 2023-10-05T21:26:28Z

I think I'd like to understand a bit better (in the form of examples?) where/when this is deemed needed or useful?
Wouldn't the content always be together with its network context?

I see this as most useful when the content keys start to escape the network. Like this link:
http://glados.ethportal.net/content/key/0x002d5460dbc4dfe734c7f00667e0cd678767c35f8ec8b08f033c2a299879bdb333/

Using the current url, there is no reliable way to determine what kind of content is at that key. In fact, nothing is stopping keys like this from being duplicated on multiple networks.

Modifying the ID byte could solve this problem for us.

There are other solutions for this particular scenario, of course. The URL format could be updated to include the network in the path, for example. More broadly, any app anywhere that wants to tell you something about a content key will always have to ask you: what is the key, and also which network did it come from?

So it seems like a simple solution to modify the byte in order to can clean up 3rd party interfaces that accept content keys, without requiring a second input.

Sure, this is a nice-to-have, but it's also a nearly-impossible-to-change-later, and a not-too-hard-to-change-now. :)
(though I admit ignorance into how hard it is for you to make the SSZ Union change in Fluffy)

ScottyPoi

👍 from me.

Will open a PR on Ultralight and merge it when this is final

kdeme · 2023-11-17T13:07:16Z

I personally still don't see that much use for this change. But I'm not specifically against it either, so if everyone else agrees with this, good to go for me too.

Sure, this is a nice-to-have, but it's also a nearly-impossible-to-change-later, and a not-too-hard-to-change-now. :)

This is probably the most compelling reason I've heard :-)

pipermerriam · 2023-11-18T01:57:47Z

I'm a strong +1 that we should do this.

pipermerriam · 2023-11-28T17:48:10Z

I think there's a follow up PR that needs to be done to update the main spec documents to denote which networks use which namespaces.

mrferris requested review from carver, pipermerriam and kdeme October 2, 2023 18:20

pipermerriam reviewed Oct 2, 2023

View reviewed changes

kdeme reviewed Oct 3, 2023

View reviewed changes

Gives beacon and state networks unique content key selector bytes

1ec1c8d

mrferris force-pushed the sequential-selectors branch from b4713bb to 1ec1c8d Compare November 3, 2023 21:25

mrferris requested a review from kdeme November 3, 2023 21:30

ScottyPoi approved these changes Nov 15, 2023

View reviewed changes

ScottyPoi mentioned this pull request Nov 15, 2023

Unique-content-selectors ethereumjs/ultralight#520

Merged

mrferris merged commit 72327da into master Nov 27, 2023
3 checks passed

ogenev mentioned this pull request Nov 28, 2023

Update content key selector bytes for Beacon network ethereum/trin#1044

Closed

kdeme mentioned this pull request Dec 4, 2023

Unique content key prefixes cross portal content networks status-im/nimbus-eth1#1924

Closed

ogenev mentioned this pull request Dec 11, 2023

Update content key selector bytes for State network ethereum/trin#1066

Closed

pipermerriam deleted the sequential-selectors branch December 13, 2023 18:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unique selector bytes for History, Beacon, and State networks #233

Unique selector bytes for History, Beacon, and State networks #233

mrferris commented Oct 2, 2023

pipermerriam commented Oct 2, 2023

pipermerriam Oct 2, 2023

pipermerriam Oct 2, 2023

kdeme left a comment

carver commented Oct 5, 2023 •

edited

Loading

ScottyPoi left a comment

kdeme commented Nov 17, 2023

pipermerriam commented Nov 18, 2023

pipermerriam commented Nov 28, 2023

Unique selector bytes for History, Beacon, and State networks #233

Unique selector bytes for History, Beacon, and State networks #233

Conversation

mrferris commented Oct 2, 2023

pipermerriam commented Oct 2, 2023

pipermerriam Oct 2, 2023

Choose a reason for hiding this comment

pipermerriam Oct 2, 2023

Choose a reason for hiding this comment

kdeme left a comment

Choose a reason for hiding this comment

carver commented Oct 5, 2023 • edited Loading

ScottyPoi left a comment

Choose a reason for hiding this comment

kdeme commented Nov 17, 2023

pipermerriam commented Nov 18, 2023

pipermerriam commented Nov 28, 2023

carver commented Oct 5, 2023 •

edited

Loading