Version 0.8.5.9539
Zen5 optimizations were originally intended for this release. But due to delays in getting the final hardware, this will be pushed to a future update.
New Features:
- The BBP digit extractor has now been made into a formal benchmark.
- The BBP program now supports command line options.
- The default BBP algorithm has been changed to Huvent's formula.
- BBP computations now produce validation files.
It remains to be seen if this benchmark will be added to HWBOT.
The BBP benchmark will be a CPU-only benchmark that is unaffected by memory bottlenecks. It will complement the existing Pi benchmarks which have become increasingly memory-bound.
Stress Tester Changes:
- Added new tests SNT and SVT which are small in-cache versions of N63 and VT3.
- Added new tests FFTv4 and SFTv4. This is a new floating-point FFT implementation.
- FFT and SFT is the old floating-point FFT and remains for now. It will be removed in the future.
Other Changes:
- Added a new floating-point FFT implementation. This improves performance by a few %.
- Switched compilers from ICC to ICX. This hurts performance by a few %. (mostly offsetting with the above)
- The BBP program is now slower because the ICX compiler is worse than the ICC compiler.
- y-cruncher will now accept json for loading config files.
- y-cruncher will now allow computations up to 500 trillion digits without developer authorization.
You may notice some visual changes where "Tuning Profiles" have been added to the program in various places. These are part of a future feature that did not make the cut for this release. So for now, they serve no purpose other than other than visual aesthetics.
Fixes:
- For the BBP program, the 12-BD2 binary now correctly uses the FMA3 codepath instead of the SSE4.1 codepath.