python (65.2k questions)
javascript (44.3k questions)
reactjs (22.7k questions)
java (20.8k questions)
c# (17.4k questions)
html (16.3k questions)
r (13.7k questions)
android (13k questions)
Fastest way to multiply and sum/add two arrays (dot product) - unaligned surprisingly faster than FMA
Hi I have the following code:
public unsafe class MultiplyAndAdd : IDisposable
{
float[] rawFirstData = new float[1024];
float[] rawSecondData = new float[1024];
static int alignment = 32...
Peter
Votes: 0
Answers: 1
Terminology: why "floating multiply-add" instead of "fused multiply-add"?
C11 (and newer):
7.12.13 Floating multiply-add
IEEE 754-2008:
fused multiply add,
fusedMultiplyAdd
Wikipedia:
fused multiply-add
Why C11 (and newer) uses "floating multiply-add" instea...
pmor
Votes: 0
Answers: 1