You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, the split operation results in two sub-scopes:
GPU 0 and all n cores
GPU 1 and all n cores
Note that the n cores are shared in both scopes.
Trying to split the sub-scopes over the tasks to get exclusive cores is not possible because we cannot apply a collective split operation over different sub-scopes.
This issue can be addressed by maintaining a list of exclusive resources associated with each device (e.g., cpuset). In this case, GPU 0 would have half of the cores in its resource list and GPU 1 would have the other half of the cores in its list. With such internal distribution of resources, the call to qv_scope_split_at above would result in the following two sub-scopes:
GPU 0 and n/2 cores
GPU 1 and the other n/2 cores
At this point there is no need for an additional split operation to get exclusive cores associated with a GPU scope.
The text was updated successfully, but these errors were encountered:
Assume the following scenario:
n
cores and two devices (GPUs)Currently, the split operation results in two sub-scopes:
Note that the n cores are shared in both scopes.
Trying to split the sub-scopes over the tasks to get exclusive cores is not possible because we cannot apply a collective split operation over different sub-scopes.
This issue can be addressed by maintaining a list of exclusive resources associated with each device (e.g., cpuset). In this case, GPU 0 would have half of the cores in its resource list and GPU 1 would have the other half of the cores in its list. With such internal distribution of resources, the call to
qv_scope_split_at
above would result in the following two sub-scopes:n/2
coresn/2
coresAt this point there is no need for an additional split operation to get exclusive cores associated with a GPU scope.
The text was updated successfully, but these errors were encountered: