Skip to content
  • sheaf's avatar
    a40f4ab2
    Fix FMA instruction on LLVM · a40f4ab2
    sheaf authored and Marge Bot's avatar Marge Bot committed
    We were emitting the wrong instructions for fused multiply-add
    operations on LLVM:
    
      - the instruction name is "llvm.fma.f32" or "llvm.fma.f64", not "fmadd"
      - LLVM does not support other instructions such as "fmsub"; instead
        we implement these by flipping signs of some arguments
      - the instruction is an LLVM intrinsic, which requires handling it
        like a normal function call instead of a machine instruction
    
    Fixes #24223
    a40f4ab2
    Fix FMA instruction on LLVM
    sheaf authored and Marge Bot's avatar Marge Bot committed
    We were emitting the wrong instructions for fused multiply-add
    operations on LLVM:
    
      - the instruction name is "llvm.fma.f32" or "llvm.fma.f64", not "fmadd"
      - LLVM does not support other instructions such as "fmsub"; instead
        we implement these by flipping signs of some arguments
      - the instruction is an LLVM intrinsic, which requires handling it
        like a normal function call instead of a machine instruction
    
    Fixes #24223
Loading