[RFC] PPC hardfpu for instructions that don't alter FPSCR.FI #80

vcoracolombo · 2022-03-24T19:42:25Z

This is a very simple patch written on top of #76 which tries to enable hardfpu for PPC targets. Specifically, this RFC makes the changes for multiply-add instructions to be able to use hardfpu.

if (!(fpscr_mask & FP_FI) && (env->fpscr & FP_FI)) {                  \
    float_exception_flags |= float_flag_inexact;                \
}                                                                     \

The construct above allows the inexact flag to be set when

The instruction does not alter FPSCR.FI. This is important because this bit is not emulated correctly with hardfpu, so we want it to be guaranteed to not be changed.
The FI bit is already set.

Having the inexact flag set in float_exception_flags is a requirement for the hardfpu code to work (see can_use_fpu() in fpu/softfloat.c)

The problem with this solution is that it would only be able to enable hardfpu for a small set of instructions (those which don't alter FI). I think the list of all instructions which would work with this idea is:

xscmp* and xvcmp*
xsmax* and xvmax*
xsmin* and xvmin*
xstdivdp and similar
xvadd*
xvmul*
xvdiv*
xvcv*
xv[n]madd* and xv[n]msub*
xvr*

Which are only a subset of vector and vsx instructions. For floating point instructions, we need to think in another solution. For now, this RFC is for the instructions that can benefit from the 'not alter FI' behavior

I did some testing and the results and the FPSCR flags seem to be emulated correctly with these patches applied. However, I can't get rid of the feeling that I'm not understanding something and it might not be that simple. Can anyone think of a situation this won't work?

vcoracolombo · 2022-03-25T12:00:42Z

This is a very simple patch written on top of #76 which tries to enable hardfpu for PPC targets. Specifically, this RFC makes the changes for multiply-add instructions to be able to use hardfpu.
if (!(fpscr_mask & FP_FI) && (env->fpscr & FP_FI)) {                  \

After some thinking, I think if (!(fpscr_mask & FP_FI) && (env->fpscr & FP_XX)) { might be better. FI still doesn't need to be changed, and XX is already set and sticky so it won't need to be changed also. This would allow this case to be hit more frequently.

Signed-off-by: Víctor Colombo <[email protected]>

vcoracolombo · 2022-04-07T16:52:33Z

Change to be based on #81, and use FP_XX as the condition instead of FP_FI

@mferst ping

vcoracolombo requested review from luporl, alqotel, mferst and lrcoutinho March 24, 2022 19:42

vcoracolombo self-assigned this Mar 24, 2022

vcoracolombo added 6 commits March 31, 2022 14:26

target/ppc: Fix FPSCR.FI bit

ec21fa8

Signed-off-by: Víctor Colombo <[email protected]>

target/ppc: Remove FPSCR.FI changing in float_overflow_excp()

8d711f0

Signed-off-by: Víctor Colombo <[email protected]>

target/ppc: Add invalid imz, isi and snan to do_float_check_status()

5eb7170

Signed-off-by: Víctor Colombo <[email protected]>

target/ppc: Rely on do_float_check_status for VSX_MADD invalid excepts

7598a50

Signed-off-by: Víctor Colombo <[email protected]>

fpu: Activate hardfpu for PPC

8459a16

Signed-off-by: Víctor Colombo <[email protected]>

target/ppc: Make madd instructions work with hardfpu

f31cd81

Signed-off-by: Víctor Colombo <[email protected]>

vcoracolombo force-pushed the vccolombo-hardfloat branch from 3b0361c to f31cd81 Compare April 7, 2022 16:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RFC] PPC hardfpu for instructions that don't alter FPSCR.FI #80

[RFC] PPC hardfpu for instructions that don't alter FPSCR.FI #80

vcoracolombo commented Mar 24, 2022

vcoracolombo commented Mar 25, 2022

vcoracolombo commented Apr 7, 2022

[RFC] PPC hardfpu for instructions that don't alter FPSCR.FI #80

Are you sure you want to change the base?

[RFC] PPC hardfpu for instructions that don't alter FPSCR.FI #80

Conversation

vcoracolombo commented Mar 24, 2022

vcoracolombo commented Mar 25, 2022

vcoracolombo commented Apr 7, 2022