Ignore:
Timestamp:
02/12/2023 05:46:15 AM (17 months ago)
Author:
Xi Ruoyao <xry111@…>
Branches:
11.3, 12.0, 12.1, kea, ken/TL2024, ken/inkscape-core-mods, ken/tuningfonts, lazarus, lxqt, plabs/newcss, python3.11, qt5new, rahul/power-profiles-daemon, renodr/vulkan-addition, trunk, xry111/llvm18, xry111/xf86-video-removal
Children:
f848bc0
Parents:
90354d5
git-author:
Xi Ruoyao <xry111@…> (02/12/2023 05:26:44 AM)
git-committer:
Xi Ruoyao <xry111@…> (02/12/2023 05:46:15 AM)
Message:

fftw: Add --enable-avx2, and document --enable-{sse2,avx,avx2,avx512}

The first AVX2-capable CPUs are released in 2013 (ten years ago)
and now all recent Intel Core and AMD Ryzen models support it. So
it makes sense to enable it now. AVX512F is much uncommon now for
non-server CPU models (only Rocket Lake, Alder Lake with E-cores
disabled, and Zen 4), so leave it as <option>.

File:
1 edited

Legend:

Unmodified
Added
Removed
  • general/genlib/fftw.xml

    r90354d5 rcdbdcb1c  
    9898            --enable-threads \
    9999            --enable-sse2    \
    100             --enable-avx     &amp;&amp;
     100            --enable-avx     \
     101            --enable-avx2    &amp;&amp;
    101102make</userinput></screen>
    102103
     
    125126            --enable-sse2    \
    126127            --enable-avx     \
     128            --enable-avx2    \
    127129            --enable-float   &amp;&amp;
    128130make</userinput></screen>
     
    171173
    172174      <para>
     175        <parameter>--enable-{sse2,avx,avx2}</parameter>: These enables building
     176        the optimized routines using SSE2, AVX, and AVX2 instructions.  FFTW
     177        will check if these routines can be really used on the current CPU when
     178        the FFTW library is loaded, so a FFTW build with these routines enabled
     179        can still run on a CPU without SSE2, AVX, or AVX512.  These options
     180        are not compatible with <parameter>--enable-long-double</parameter>.
     181      </para>
     182
     183      <para>
    173184        <parameter>--enable-float</parameter>: This enables building the library that
    174185        uses single precision floating point arithmetic.  It is faster but less
     
    185196      </para>
    186197
     198      <para>
     199        <option>--enable-avx512</option>: This enables building the optimized
     200        routines using AVX512F instructions.  FFTW will check if these routines
     201        can be really used on the current CPU when the FFTW library is loaded,
     202        so a FFTW build with these routines enabled can still run on a CPU
     203        without AVX512F.  Use this option if the FFTW build will be used on
     204        a CPU with AVX512F.  This option is not compatible with
     205        <option>--enable-long-double</option>.
     206      </para>
    187207  </sect2>
    188208
Note: See TracChangeset for help on using the changeset viewer.