mirror of
https://github.com/janishutz/eth-summaries.git
synced 2026-03-14 23:10:03 +01:00
11 lines
495 B
TeX
11 lines
495 B
TeX
\newpage
|
|
\subsection{Vector Operations}
|
|
|
|
Extreme performance gains beyond the results of the previous section can be gained using hardware vector registers on supported CPUs.
|
|
|
|
\content{Example} In Intel AVX2, $256$b vector registers like \verb|%ymm0|, \verb|%ymm1| can be used to perform component-wise single/double precision FP operations.
|
|
\begin{minted}{gas}
|
|
vaddsd %ymm0, %ymm1, %ymm1 # Comp.-wise 32b FP add
|
|
vaddsd %ymm0, %ymm1, %ymm1 # Comp.-wise 64b FP add
|
|
\end{minted}
|