Data Types | |
interface | yepcore::yepCore_SumSquares_V32f_S32f |
Computes the sum of squares of single precision (32-bit) floating-point array elements. More... | |
interface | yepcore::yepCore_SumSquares_V64f_S64f |
Computes the sum of squares of double precision (64-bit) floating-point array elements. More... | |
interface yepcore::yepCore_SumSquares_V32f_S32f |
Computes the sum of squares of single precision (32-bit) floating-point array elements.
[in] | v | The array of elements which will be squared (without write-back) and summed up. |
[out] | sumSquares | The variable where the sum of squares will be stored. |
[in] | length | Length of the array specified by v. If length is zero, the computed sum of squares will be 0. |
0 | The computation finished successfully. |
2 | v or sumSquares argument is not naturally aligned. |
Architecture | Target microarchitecture | Required instruction extensions |
---|---|---|
x86-64 | Intel Nehalem | SSE, SSE3 |
x86-64 | Intel Sandy Bridge | AVX |
x86-64 | Intel Haswell | AVX, FMA3 |
x86-64 | AMD Bulldozer | AVX, FMA4 |
interface yepcore::yepCore_SumSquares_V64f_S64f |
Computes the sum of squares of double precision (64-bit) floating-point array elements.
[in] | v | The array of elements which will be squared (without write-back) and summed up. |
[out] | sumSquares | The variable where the sum of squares will be stored. |
[in] | length | Length of the array specified by v. If length is zero, the computed sum of squares will be 0. |
0 | The computation finished successfully. |
2 | v or sumSquares argument is not naturally aligned. |
Architecture | Target microarchitecture | Required instruction extensions |
---|---|---|
x86-64 | Intel Nehalem | SSE, SSE2 |
x86-64 | Intel Sandy Bridge | AVX |
x86-64 | Intel Haswell | AVX, FMA3 |
x86-64 | AMD Bulldozer | AVX, FMA4 |