Functions | |
enum YepStatus | yepCore_SumSquares_V32f_S32f (const Yep32f *restrict v, Yep32f *restrict sumSquares, YepSize length) |
Computes the sum of squares of single precision (32-bit) floating-point array elements. More... | |
enum YepStatus | yepCore_SumSquares_V64f_S64f (const Yep64f *restrict v, Yep64f *restrict sumSquares, YepSize length) |
Computes the sum of squares of double precision (64-bit) floating-point array elements. More... | |
enum YepStatus yepCore_SumSquares_V32f_S32f | ( | const Yep32f *restrict | v, |
Yep32f *restrict | sumSquares, | ||
YepSize | length | ||
) |
Computes the sum of squares of single precision (32-bit) floating-point array elements.
[in] | v | Pointer to the array of elements which will be squared (without write-back) and summed up. |
[out] | sumSquares | Pointer to the variable where the sum of squares will be stored. |
[in] | length | Length of the array specified by v. If length is zero, the computed sum of squares will be 0. |
YepStatusOk | The computation finished successfully. |
YepStatusNullPointer | v or sumSquares argument is null. |
YepStatusMisalignedPointer | v or sumSquares argument is not naturally aligned. |
Architecture | Target microarchitecture | Required instruction extensions |
---|---|---|
x86-64 | Intel Nehalem | SSE, SSE3 |
x86-64 | Intel Sandy Bridge | AVX |
x86-64 | Intel Haswell | AVX, FMA3 |
x86-64 | AMD Bulldozer | AVX, FMA4 |
enum YepStatus yepCore_SumSquares_V64f_S64f | ( | const Yep64f *restrict | v, |
Yep64f *restrict | sumSquares, | ||
YepSize | length | ||
) |
Computes the sum of squares of double precision (64-bit) floating-point array elements.
[in] | v | Pointer to the array of elements which will be squared (without write-back) and summed up. |
[out] | sumSquares | Pointer to the variable where the sum of squares will be stored. |
[in] | length | Length of the array specified by v. If length is zero, the computed sum of squares will be 0. |
YepStatusOk | The computation finished successfully. |
YepStatusNullPointer | v or sumSquares argument is null. |
YepStatusMisalignedPointer | v or sumSquares argument is not naturally aligned. |
Architecture | Target microarchitecture | Required instruction extensions |
---|---|---|
x86-64 | Intel Nehalem | SSE, SSE2 |
x86-64 | Intel Sandy Bridge | AVX |
x86-64 | Intel Haswell | AVX, FMA3 |
x86-64 | AMD Bulldozer | AVX, FMA4 |