You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Shared memory for slice & cat operator is cool but introduce too much complexity in the code. The idea behind that was that slice & cat would be used intensively inside other functions like reduce(axis) etc.
However, it seems that the complexity introduced is too big compared to the advantages it brings (aka iterator etc). Just put main branch into a side branch and rework slice & cat -> remove range attribute for tensors. Maybe create a TensorView struct for everything related to the iterator.
The text was updated successfully, but these errors were encountered:
Shared memory for slice & cat operator is cool but introduce too much complexity in the code. The idea behind that was that slice & cat would be used intensively inside other functions like reduce(axis) etc.
However, it seems that the complexity introduced is too big compared to the advantages it brings (aka iterator etc). Just put main branch into a side branch and rework slice & cat -> remove range attribute for tensors. Maybe create a TensorView struct for everything related to the iterator.
The text was updated successfully, but these errors were encountered: