Amdahl's Law #3

Mysticial · 2017-10-25T04:10:46Z

Amdahl's Law is apparent on very large machines.

I had the opportunity back in 2016 to play with an 88-vcore Broadwell system and it was obvious just by looking at Task Manager. This is caused by a number of unparallelized linear operations such as bignum addition and subtraction.

These operations have historically been memory-bound and were largely neglected as far as optimizations go. At this point, we're entering an era where a parallelized bignum multiply may actually be faster than an unparallelized bignum add.

Some work has been done in the v0.7.x releases to parallelize some of these linear operations. So this needs to be re-tested. Unfortunately I don't regularly have access to these types of machines to see what kind of progress have been made.

fffffgggg54 · 2017-12-23T22:30:58Z

What type of machines do you have access to and would like access to?

Mysticial · 2017-12-24T00:44:10Z

The 88-vcore Broadwell-EP system I played with last year had 768GB of ram which is enough to run 100 billion digits of Pi all in ram. That is large enough for me to see long parts of the computation under Task Manager using only 1 core of CPU usage (due to Amdahl's Law on the non-parallelized routines).

The largest machine I have access to right now is a 10-core/20-thread Core i9 7900X with 128GB of ram. While it's not large enough to see the non-parallelized parts in Task Manager with the naked eye (as single-core CPU usage), it's visible under a suitable profiler with millisecond granularity.

Since I am able to see them under a profiler, I've been able to track down a number of Amdahl's Law offenders in the code and fix (parallelize) them. These will come out in v0.7.5 and the speedup on the 7900X is small and barely noticeable.

The real benchmark is to see how much things will have improved on a large system similar to the 88-core Broadwell I played with last year. A Knights Landing system with a lot of memory should also be a good benchmark for Amdahl's Law effects.

MikeS159 · 2018-02-06T13:27:59Z

Maybe you could get in contact with one of the guys from Linus Tech Tips (a tech YouTube channel if you haven't heard of them). They use y-cruncher as part of their standard benchmarking for CPU's and they are always playing around with high core count systems.

fffffgggg54 · 2018-02-10T00:54:46Z

@MikeS159 True. Do they still use it? I have not heard much mention of it.

Mysticial · 2018-02-10T01:28:41Z

I saw it in his Ryzen and Skylake X reviews. Not sure if he used it in his Coffee Lake review since I haven't really paid attention to that line.

Mysticial added bug performance labels Oct 25, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Amdahl's Law #3

Amdahl's Law #3

Mysticial commented Oct 25, 2017

fffffgggg54 commented Dec 23, 2017 •

edited

Loading

Mysticial commented Dec 24, 2017

MikeS159 commented Feb 6, 2018

fffffgggg54 commented Feb 10, 2018

Mysticial commented Feb 10, 2018

Amdahl's Law #3

Amdahl's Law #3

Comments

Mysticial commented Oct 25, 2017

fffffgggg54 commented Dec 23, 2017 • edited Loading

Mysticial commented Dec 24, 2017

MikeS159 commented Feb 6, 2018

fffffgggg54 commented Feb 10, 2018

Mysticial commented Feb 10, 2018

fffffgggg54 commented Dec 23, 2017 •

edited

Loading