-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
operator() is horrendously slow #75
Comments
Do you mean x100 slower than nested loops with explicit offset computation
or than DGEMM? A code snippet would be useful.
|
See above. I will measure the timing more accurately. |
the BTAS code I am talking about is the following, which does not seem to be written for efficiency. The timing follows (takes a lot as it needs to recompile the entire BAGEL)
|
Code in BAGEL - I am talking about the loop in the middle (again, timing follows)
|
It is most probably due to dynamic memory allocation in index construction.
|
Sorry maybe this could be partly my fault. I will figure it out (also I should default to std::array now). |
we definitely need an alias for constexpr number of dimensions ... perhaps something like this: /// Tensor with const number of dimensions |
View::begin() seems very slow, which was perhaps responsible for my problem. just for your info. |
I know this is not a bug but has to be reported. The MP2 assembly step becomes slower by a lot if I use BTAS' native operator().
(I previously wrote 50-100, but seems that was not quite right. It is slow, though.)
The text was updated successfully, but these errors were encountered: