Yeppp!
 All Classes Groups Pages
Data Types
Sum of squares (squared L2 norm)

Data Types

interface  yepcore::yepCore_SumSquares_V32f_S32f
 Computes the sum of squares of single precision (32-bit) floating-point array elements. More...
 
interface  yepcore::yepCore_SumSquares_V64f_S64f
 Computes the sum of squares of double precision (64-bit) floating-point array elements. More...
 

Detailed Description


Data Type Documentation

interface yepcore::yepCore_SumSquares_V32f_S32f

Computes the sum of squares of single precision (32-bit) floating-point array elements.

Parameters
[in]vThe array of elements which will be squared (without write-back) and summed up.
[out]sumSquaresThe variable where the sum of squares will be stored.
[in]lengthLength of the array specified by v. If length is zero, the computed sum of squares will be 0.
Return values
0The computation finished successfully.
2v or sumSquares argument is not naturally aligned.
Optimized implementations
ArchitectureTarget microarchitectureRequired instruction extensions
x86-64Intel NehalemSSE, SSE3
x86-64Intel Sandy BridgeAVX
x86-64Intel HaswellAVX, FMA3
x86-64AMD BulldozerAVX, FMA4
interface yepcore::yepCore_SumSquares_V64f_S64f

Computes the sum of squares of double precision (64-bit) floating-point array elements.

Parameters
[in]vThe array of elements which will be squared (without write-back) and summed up.
[out]sumSquaresThe variable where the sum of squares will be stored.
[in]lengthLength of the array specified by v. If length is zero, the computed sum of squares will be 0.
Return values
0The computation finished successfully.
2v or sumSquares argument is not naturally aligned.
Optimized implementations
ArchitectureTarget microarchitectureRequired instruction extensions
x86-64Intel NehalemSSE, SSE2
x86-64Intel Sandy BridgeAVX
x86-64Intel HaswellAVX, FMA3
x86-64AMD BulldozerAVX, FMA4