Support block matrices in several BLAS1 routines #226

ndryden · 2017-03-24T20:26:55Z

This adds support for block matrices in the Hadamard, Dot, HilbertSchmidt, ColumnTwoNorms, and ColumnMaxNorms functions (see #224). I also added tests for Hadamard, Dot, ColumnTwoNorms, and ColumnMaxNorms.

Let me know if any additional changes are needed.

…idt to support block matrices, and add tests for Hadamard, Dot, ColumnTwoNorms, and ColumnMaxNorms.

…bertSchmidt.

poulson

Thank you for the very high-quality PR! I only have a few minor comments.

poulson · 2017-03-25T21:30:50Z

tests/blas_like/ColumnNorms.cpp

+    T expected = 0;
+    for (Int i = 0; i < A.LocalHeight(); ++i) {
+      T val = A.GetLocal(i, j);
+      expected += val * val;


Explicitly accumulating the squares and then square-rooting should be fine for random data but can lead to very low accuracy in extreme cases. There is the routine El::UpdateScaledSquare for accumulating the square of the two norm as the product of a scale and a scaled square that should generally be preferred (and which is used within the norm computation routines).

My main goal there was to provide a simple baseline that ensures there's no major issues (e.g. block vs element distributions causing problems) while not just reimplementing the method that's being tested. If you think it would be better to switch to UpdateScaledSquare in this case, I can do that.

poulson · 2017-03-25T21:32:12Z

tests/blas_like/ColumnNorms.cpp

+    }
+    expected = mpi::AllReduce(expected, g.ColComm());
+    expected = Sqrt(expected);
+    if (Abs(got - expected) > 1e-5) {


It would be more rigorous to use a bound of the form n * limits::Epsilon<El::Base<T>>() times a small constant (e.g., 10).

poulson · 2017-03-25T21:34:26Z

tests/blas_like/Dot.cpp

+    TestDot<float, ELEMENT>(m, n, g, print);
+    TestDot<float, BLOCK>(m, n, g, print);
+    TestDot<double, ELEMENT>(m, n, g, print);
+    TestDot<double, BLOCK>(m, n, g, print);


Why exclude complex tests and more precise datatypes (e.g., El::DoubleDouble, El::QuadDouble, El::Quad, and El::BigFloat)? I would be happy to add the appropriate flag guards for the higher-precision tests, but the complex arithmetic variants also working is crucial.

poulson · 2017-03-25T21:34:56Z

tests/blas_like/Dot.cpp

+    TestDot<float, BLOCK>(m, n, g, print);
+    TestDot<double, ELEMENT>(m, n, g, print);
+    TestDot<double, BLOCK>(m, n, g, print);
+  } catch (exception& e) {


Nit: Elemental has made the (perhaps dubious) choice of putting curly braces on their own line.

poulson · 2017-03-25T21:35:32Z

tests/blas_like/Hadamard.cpp

+    for (Int i = 0; i < A.LocalHeight(); ++i) {
+      T got = C.GetLocal(i, j);
+      T expected = A.GetLocal(i, j) * B.GetLocal(i, j);
+      if (Abs(got - expected) > 1e-6) {


Same comment here about using El::limits::Epsilon<El::Base<T>>() times a small constant.

poulson · 2017-03-25T21:36:13Z

tests/blas_like/Hadamard.cpp

+    TestHadamard<float, ELEMENT>(m, n, g, print);
+    TestHadamard<float, BLOCK>(m, n, g, print);
+    TestHadamard<double, ELEMENT>(m, n, g, print);
+    TestHadamard<double, BLOCK>(m, n, g, print);


Same comment here about testing the complex (and high-precision) cases.

poulson · 2017-03-25T21:37:44Z

include/El/blas_like/level1/Hadamard.hpp

@@ -60,6 +60,9 @@ void Hadamard
        LogicError("A, B, and C must share the same distribution");
    if( A.ColAlign() != B.ColAlign() || A.RowAlign() != B.RowAlign() )
        LogicError("A and B must be aligned");
+    if ( A.BlockHeight() != B.BlockHeight() ||


Thank you noticing this crucial portion of the extension!

…ision and in complex variants. Clean up the syntax and use limits::Epsilon for checking error.

ndryden · 2017-03-27T01:29:50Z

My latest commit addresses your comments (except see the discussion on UpdateScaledSquare). Let me know if there's any other changes required!

poulson · 2017-03-27T02:28:38Z

tests/blas_like/Dot.cpp

-  } catch (exception& e) {
+    TestDot<Complex<double>, ELEMENT>(m, n, g, print);
+    TestDot<Complex<double>, BLOCK>(m, n, g, print);
+#if defined(EL_HAVE_QD) && defined(EL_ENABLE_DOUBLEDOUBLE)


There is no need for the EL_ENABLE macros here (or in any test driver), as they are only used to signal to some of the include macros used for template instantiation in the library.

poulson · 2017-03-27T02:46:28Z

tests/blas_like/ColumnNorms.cpp

      T val = A.GetLocal(i, j);
      expected += val * val;
    }
    expected = mpi::AllReduce(expected, g.ColComm());
    expected = Sqrt(expected);
-    if (Abs(got - expected) > 1e-5) {
+    if (Abs(got - expected) > 10 * limits::Epsilon<El::Base<T>>())


The accumulated error should be a function of the number of entries (with said function being logarithmic if a tree-based scheme is used, but linear with the current single-process implementation): would you mind multiplying the right-hand side by the number of entries?

Also, it would be better to use a relative bound and to diving the left-hand side by the maximum of the norm and 1 (so that it still behaves well for zero norms).

Just want to clarify: The new check should be Abs(got - expected) / std::max(expected, 1) > m * n * 10 * limits::Epsilon<El::Base<T>>()?

poulson · 2017-03-27T19:40:55Z

yes, if that isn't too much hassle. otherwise it is easy to think of cases where the check would fail.

…umnTwoNorms test.

poulson

Thank you for incorporating the changes!

ndryden added 2 commits March 24, 2017 07:24

Update Hadamard, Dot, ColumnTwoNorms, ColumnMaxNorms, and HilbertSchm…

90b6408

…idt to support block matrices, and add tests for Hadamard, Dot, ColumnTwoNorms, and ColumnMaxNorms.

Add check for whether the same block size is used in Hadamard and Hil…

4afe36a

…bertSchmidt.

poulson requested changes Mar 25, 2017

View reviewed changes

Test Hadamard, Dot, ColumnTwoNorms, and ColumnMaxNorms at higher prec…

be1c42a

…ision and in complex variants. Clean up the syntax and use limits::Epsilon for checking error.

poulson requested changes Mar 27, 2017

View reviewed changes

Remove unneeded macro checks and use a more rigorous bound in the Col…

e9139f4

…umnTwoNorms test.

poulson approved these changes Mar 28, 2017

View reviewed changes

poulson merged commit 776b805 into elemental:master Mar 28, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support block matrices in several BLAS1 routines #226

Support block matrices in several BLAS1 routines #226

ndryden commented Mar 24, 2017

poulson left a comment

poulson Mar 25, 2017

ndryden Mar 27, 2017

poulson Mar 25, 2017

poulson Mar 25, 2017

poulson Mar 25, 2017

poulson Mar 25, 2017

poulson Mar 25, 2017

poulson Mar 25, 2017

ndryden commented Mar 27, 2017

poulson Mar 27, 2017

poulson Mar 27, 2017

ndryden Mar 27, 2017

poulson commented Mar 27, 2017

poulson left a comment

Support block matrices in several BLAS1 routines #226

Support block matrices in several BLAS1 routines #226

Conversation

ndryden commented Mar 24, 2017

poulson left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ndryden commented Mar 27, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

poulson commented Mar 27, 2017

poulson left a comment

Choose a reason for hiding this comment