-
-
Notifications
You must be signed in to change notification settings - Fork 95
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Dependency parallelization #151
Comments
Just wanted to chime in, I've been mocking around a scheduler for building automatically parallel systems with dependency management like this. Hasn't gotten anywhere yet besides building the dependency graph so can't provide a lot of input. But there's some great existing reference material from bevy and its stageless design. https://hackmd.io/@alice-i-cecile/SJvmN1rAi Basic premise is you have SystemSets which can be ordered and configured the same as Systems, then any System can be put in a SystemSet and it will inherit all the configuration of the SystemSets it belongs to. This lets you easily group up systems and define rough dependencies without having to know the exact system ordering. E.g. Instead of having to say MyRenderTextures system has to happen after MyMoveSprites, you can have a LogicSet and a RenderSet, put the systems in there, and say LogicSet happens after RenderSet. |
Now that ZeroAllocJobScheduler is getting to a pretty good place, I have some thoughts on the dependency API. First off -- We need two separate options for parallel queries: one to schedule a query along with other queries, and another to actually run a single query in parallel with I'm not sure what that API would look like, exactly. Since we have 3 ways of creating queries, that would mean 9 different versions to implement, which isn't fun. It'd be nice to get something a bit more unifying, but I'm not sure how. But that aside: I think a good (eventual?) goal would be to implement automatic dependency tracking (I think this is similar to what @xentripetal is talking about, but I'm not sure since I've never used Bevy, so I don't really understand those pages). But Unity does have a fairly nice API for handling dependencies in a fully-automatic way. The idea is to track components accessed, and whether they're accessed in read mode or write mode. If we track the dependency graph, we can then automatically parallelize any code. A few caveats...
This does bring up the question of our own API, though. I see a few options:
I definitely prefer 1 or 2, and am loosely in favor of 2. That said, I worry that it adds bloat to Arch and makes its central API less beautifully simple (unless we can get a beautifully simple solution up and running). |
Sorry, I explained myself poorly there. My input was on how explicit ordering could be defined from a user perspective. The docs I linked don't discuss how bevy handles system dependencies but how it builds a user defined schedule. It handles dependencies similar to how you're discussing, look at read/writes and determine what archetypes each system will hit and prevents R/W W/W conflicts. Though it also enforces everything is scheduled and prevents any conflicts from not having an explicit user defined order so there's no random side effects. But that's likely outside the scope of this. The keypoint being I think there should be some way to refer to the dependencies without directly referencing the exact implementation of another system. There should be some sort of way to group multiple systems together and be able to reference that another system depends on that group without knowing all of the systems in it. Additionally, I think it would be helpful to be able to define after/before relationships and not just before. The Unity Dependencies API only allows defining before dependencies. Though both of these would require building an intermediate graph and allowing configuration on top of it, so I understand if it seems out of scope. Someone could just build a third party scheduler on top of the Arch solution that resolves all queries and its dependencies then translates it into Arch's JobHandle model.
My vote would also be on 2, it would allow that complex scheduling graph described above to be designed on Arch.Extended or some other third party lib while letting any standard Arch main thread queries be safe of race conditions without having to wrap them in some external dependency Complete() manager. |
That actually sounds pretty interesting! Arch.Extended/Arch.System features a group which could take care of this basically ^^
Well, this is really a difficult topic. Arch's main selling point is simplicity and bare-minimum. Adding dependency parallelization to arch would just bloat the code, make the API more complex, and probably make the API slower even if you don't want to do anything with multithreading (extra checks, sync points, and so on). So this way would undermine Arch "philosophy" and make it slower. Arch must not lose that, it must always be designed to be easily extended. Instead I think the combination of approach 1 and 3 is best. We could simply rebuild or extend the So we could also easily generate "synchronization points" and at the same time add methods to enforce them or add independently created jobs to tracking. An example of this would be... [Parallel, Query]
[...]
public void CollisionChecking(...){
...
}
// From the Base system for manual control
public void Update(...){
World.Synchronize(); // A method the user could use to force a sync point manually.
World.Query(...); // No problem here since we are synchronized.
var handle = CollsionCheckingQuery(); // We could also execute or complete them manually since the generated query returns a JobHandle?
// Since generated method, it gets automatically tracked of course
var handle = World.ParallelQuery(...); // That would be in there by default as well
World.Track(handle, types?); // make that manual parallel query track aswell since its not auto generated.
...
} The optional override of Update already works in the normal source generator. If the user does not add a custom implementation, this will be done automatically, of course. So with this I think the user has the best of both worlds and Arch itself would not be bloated. |
Yeah, that makes sense! The API is a little more complex than I would like, but I get what you mean. Just brainstorming.... What if Arch.Extended extended It might be a pain to maintain... but it would give us a lot of power to make a very nice API. What do you think? Edit: Wouldn't actually be a pain to maintain if we added the right hooks to Arch. For example, a protected nullable delegate Edit edit: Kept thinking about this idea so I wrote out my thoughts and stuck them in a draft API document thing: https://gist.github.com/LilithSilver/fac8493d09ef7e0519ff3cae20b267d2 Hopefully that clarifies the advantages/disadvantages and pain points, at least. |
Background
Sometimes, I want to run multiple queries or systems in parallel, not just a single query, and track dependencies between them.
A very mature version of this feature might look something like Unity's Job Dependencies. Arch.Extended could build on top of that with something like Unity's Dependency property which automatically tracks dependencies between parallel queries across systems and sets up the job dependencies accordingly.
But even without that magic dependency finding, any way of running multiple queries in parallel, even if handles would need to be tracked manually, would be appreciated! i.e. a simple example would be
var handle = world.ParallelQuery(...., DependencyJobHandle)
that returns aJobHandle
itself for further scheduling. These chains could then be tracked across systems for the intrepid user (or by Arch.Extended of course).Is something along these lines planned? Am I missing something that's already possible? I know parallelization isn't supported for sourcegen yet, but anything for regular queries?
Idea
The idea is quite simple and has already been described in the background. Querys should also have a possibility to run in parallel and in dependency. There could be different syntaxes for this.
Or for Arch.Extended
State
In the meantime, lillith had submitted some great PRs to bring the job scheduler up to speed. Among them Dependencys inserted and some improvements made.
https://github.com/genaray/ZeroAllocJobScheduler
The text was updated successfully, but these errors were encountered: