cache ArchetypeId in QueryManyIter and QuerySortedIter #16564

Victoronz · 2024-11-30T08:54:42Z

Objective

When querying for a list of entities, QueryManyIter and QuerySortedIter call D::set_archetype and F::set_archetype with each item fetch (every next() call). Because the vast majority of queries do not have as many archetypes as entities, this is a lot of repeated work. For most QueryData, set_archetype is pretty cheap, but for EntityRef-style types it is implemented as a clone of Access.

Solution

Cache the previous archetype id in the iteration struct, and compare the current archetype id to decide whether to run archetype fetch setup logic.

Consequentially, ìter_many can now perform better than a series of gets. (This was already true, but to a much lesser degree)

Testing

Existing iteration tests.

Potential Future Work

Query::get() is even worse: For EntityRef types, each get() not only constructs an Access with D::init_fetch, but then also clones it twice! By giving Query both a fetch and filter field, D::init_fetch could be avoided in all forms of get() and iter(), and we could perform the same caching trick in all get()s!
However, because we do not have dedicated structs like with iteration, the latter runs into mutability issues: fetch/filter` need to be updated for "last archetype id" caching.

crates/bevy_ecs/src/query/iter.rs

hymm · 2024-12-03T22:57:56Z

crates/bevy_ecs/src/query/iter.rs

-                continue;
-            }
+            if *previous_archetype_id != location.archetype_id {
+                if !query_state


shouldn't this check be outside the new if statement?

We only cache valid archetype ids, so if an entity's archetype id matches the last, it must match the query.

However this depends on us initializing the cache with an archetype id that cannot be returned by Entities::get for any possible input entity.

I was under the impression that this held for ArchetypeId::INVALID, but I realize now that this id can be returned by pending entities in QueryManyIter.

Does ArchetypeId::EMPTY work for initialization instead?

Actually, it seems that the note on Entities::get about pending entities is incorrect? Which means no need to change anything here.
I hope nothing depends on that noted behavior...

That makes sense. It'd be nice if we could test this, but I'm not sure if it's even possible to create an entity in the invalid archetype. Someone should probably update the docs on Entities::get in another pr.

Hmm, you might be able too between a despawn and a flush.

To be clear, this is the implementation of Entities::get:

/// Returns the location of an [`Entity`]. /// Note: for pending entities, returns `Some(EntityLocation::INVALID)`. #[inline] pub fn get(&self, entity: Entity) -> Option<EntityLocation> { if let Some(meta) = self.meta.get(entity.index() as usize) { if meta.generation != entity.generation || meta.location.archetype_id == ArchetypeId::INVALID { return None; } Some(meta.location) } else { None } }

Any invalid ArchetypeId is filtered without exception, the note might have been referring to the validity of the other EntityLocation fields.

If we can call iter_many while an archetype (or other location fields) is invalid, we'll have to remove some debug_checked_unwraps

chescock · 2024-12-04T17:56:49Z

crates/bevy_ecs/src/query/iter.rs

-                .debug_checked_unwrap();
-            table = self.tables.get(location.table_id).debug_checked_unwrap();
-        }
+        let location = self.entities.get(entity).debug_checked_unwrap();


debug_checked_unwrap is unsafe, so this should be in an unsafe block with a safety comment.

It's allowed at the moment because it's inside an unsafe fn, but I think we eventually want to turn on the lint to check for these.

chescock · 2024-12-04T18:03:21Z

crates/bevy_ecs/src/query/iter.rs

-                table,
-            );
+        if self.previous_archetype_id != location.archetype_id {
+            self.previous_archetype_id = location.archetype_id;


Do we need to worry about unwind safety? If set_archetype panics and the caller does a catch_unwind and then calls this again, we'll call fetch without set_archetype having completed.

If we move the assignment after the call to set_archetype, then there's the risk that we'll call fetch for an entity in the previous archetype without a fresh set_archetype call, which could also be bad.

So maybe we have to set previous_archetype_id to INVALID here and then to location.archetype_id after the call to set_archetype?

Or is something in here already completely unwind-unsafe and we don't need to worry about making it worse?

I assume not? Going by unwind safety, QueryIterationCursor can skip over storage ids, which would make several iterator types behave incorrectly.

Yeah, but I don't think skipping entities can cause any unsoundness, since we don't impl TrustedLen anywhere.

We definitely have QueryData where it's unsound to call fetch without calling set_table or set_archetype first. So if you have some EvilQueryData that panics on one table and then works on the next, then a Query<(EvilQueryData, &C)> could cause UB when <&C>::fetch indexes into the wrong table column.

We might be able to blame the unsafe impl WorldQuery for EvilQueryData, but WorldQuery doesn't currently require that set_table and set_archetype not panic.

I'd be gently in favor of setting it to INVALID during the calls just to be safe, but I admit it's a crazy edge case that's unlikely to come up, so I'm not going to push too hard. :)

hmm, Query::iter explicitly states:
"This iterator is always guaranteed to return results from each matching entity once and only once." (No # Panic section)
Breaking this could cause UB in code that relies on this.
Besides, storage id skipping could allow EvilQueryData to falsify query emptiness, single and Populated checks, which are also present in get_param/validate_param. It could break wide swaths of bevy machinery!
Maybe we want to adjust the Safety section on WorldQuery?
I'd be in favor of making a note comment for now, and if we want unwind safety, adding it later.

under what conditions does set_archetype panic?

It is a trait function (with no Safety requirement regarding panicking), so there is no guarantee that it won't when implemented by some library/user. For in-engine impls set_archetype shouldn't panic.

cache ArchetypeId in QueryManyIter and QuerySortedIter

deac5be

allow clippy lint

3e505df

Victoronz force-pushed the previous-entity-location-caching branch from 0009905 to 3e505df Compare November 30, 2024 09:05

Victoronz added this to the 0.16 milestone Nov 30, 2024

BenjaminBrienen approved these changes Nov 30, 2024

View reviewed changes

crates/bevy_ecs/src/query/iter.rs Outdated Show resolved Hide resolved

switch clippy allow to expect

33098b4

hymm added the D-Unsafe Touches with unsafe code in some way label Dec 3, 2024

hymm reviewed Dec 3, 2024

View reviewed changes

chescock reviewed Dec 4, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cache ArchetypeId in QueryManyIter and QuerySortedIter #16564

cache ArchetypeId in QueryManyIter and QuerySortedIter #16564

Victoronz commented Nov 30, 2024 •

edited

Loading

hymm Dec 3, 2024

Victoronz Dec 4, 2024 •

edited

Loading

Victoronz Dec 4, 2024

hymm Dec 4, 2024 •

edited

Loading

hymm Dec 4, 2024

Victoronz Dec 4, 2024

Victoronz Dec 4, 2024

chescock Dec 4, 2024

chescock Dec 4, 2024

Victoronz Dec 4, 2024 •

edited

Loading

chescock Dec 4, 2024

Victoronz Dec 4, 2024

hymm Dec 5, 2024

Victoronz Dec 6, 2024

cache ArchetypeId in QueryManyIter and QuerySortedIter #16564

Are you sure you want to change the base?

cache ArchetypeId in QueryManyIter and QuerySortedIter #16564

Conversation

Victoronz commented Nov 30, 2024 • edited Loading

Objective

Solution

Testing

Potential Future Work

Choose a reason for hiding this comment

Victoronz Dec 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

hymm Dec 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Victoronz Dec 4, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Victoronz commented Nov 30, 2024 •

edited

Loading

Victoronz Dec 4, 2024 •

edited

Loading

hymm Dec 4, 2024 •

edited

Loading

Victoronz Dec 4, 2024 •

edited

Loading