Created
January 13, 2023 13:20
-
-
Save ErwanAliasr1/f50cc41d6c08ba6da7ee88980960ea0f to your computer and use it in GitHub Desktop.
AMD Zen4 7950X avx-turbo
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
[erwan@7950 avx-turbo]$ ./avx-turbo | |
CPUID highest leaf : [10h] | |
Running as root : [NO ] | |
MSR reads supported : [NO ] | |
CPU pinning enabled : [YES] | |
CPU supports zeroupper: [YES] | |
CPU supports AVX2 : [YES] | |
CPU supports AVX-512F : [YES] | |
CPU supports AVX-512VL: [YES] | |
CPU supports AVX-512BW: [YES] | |
CPU supports AVX-512CD: [YES] | |
CPUID doesn't support leaf 0x15, falling back to manual TSC calibration. | |
tsc_freq = 4500.0 MHz (from calibration loop) | |
CPU brand string: AMD Ryzen 9 7950X 16-Core Processor | |
32 available CPUs: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31] | |
16 physical cores: [0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15] | |
Will test up to 16 CPUs | |
Cores | ID | Description | OVRLP3 | Mops | |
1 | pause_only | pause instruction | 1.000 | 8396 | |
1 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1387 | |
1 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1387 | |
1 | scalar_iadd | Scalar integer adds | 1.000 | 5747 | |
1 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5549 | |
1 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5549 | |
1 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5549 | |
1 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5549 | |
1 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5549 | |
1 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5549 | |
1 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22222 | |
1 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22222 | |
1 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32680 | |
1 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32573 | |
1 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32680 | |
1 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5747 | |
1 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5747 | |
1 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5750 | |
1 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5750 | |
1 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5747 | |
1 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5747 | |
1 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5549 | |
1 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5549 | |
1 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5549 | |
1 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11099 | |
1 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11099 | |
1 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5549 | |
1 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2775 | |
1 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2775 | |
1 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2775 | |
1 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11099 | |
1 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11099 | |
1 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5549 | |
1 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1850 | |
1 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1850 | |
1 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1850 | |
1 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5747 | |
1 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5750 | |
1 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5750 | |
1 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1387 | |
1 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1387 | |
1 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1388 | |
1 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11099 | |
1 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11099 | |
1 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5552 | |
1 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1110 | |
1 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5552 | |
1 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1110 | |
1 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5552 | |
Cores | ID | Description | OVRLP3 | Mops | |
2 | pause_only | pause instruction | 1.000 | 8410, 8410 | |
2 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1387, 1387 | |
2 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1387, 1387 | |
2 | scalar_iadd | Scalar integer adds | 1.000 | 5747, 5747 | |
2 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5549, 5549 | |
2 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5549, 5549 | |
2 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5549, 5549 | |
2 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5549, 5549 | |
2 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5549, 5549 | |
2 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5549, 5549 | |
2 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22222, 22222 | |
2 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22222, 22222 | |
2 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32680, 32680 | |
2 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32680, 32680 | |
2 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32680, 32680 | |
2 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5747, 5747 | |
2 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5750, 5501 | |
2 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5747, 5504 | |
2 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5750, 5747 | |
2 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5750, 5747 | |
2 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5747, 4602 | |
2 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549 | |
2 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549 | |
2 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549 | |
2 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11013, 11013 | |
2 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11013, 11013 | |
2 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549 | |
2 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775 | |
2 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775 | |
2 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775 | |
2 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099 | |
2 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099 | |
2 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5549, 5549 | |
2 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1850, 1850 | |
2 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1850, 1850 | |
2 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1850, 1850 | |
2 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5750, 5747 | |
2 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5747, 5501 | |
2 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5556, 5747 | |
2 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1387, 1387 | |
2 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1387, 1387 | |
2 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1387, 1387 | |
2 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11013 | |
2 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11013, 11013 | |
2 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5546, 5546 | |
2 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1110, 1110 | |
2 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5549, 5549 | |
2 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1110, 1110 | |
2 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5549, 5549 | |
Cores | ID | Description | OVRLP3 | Mops | |
3 | pause_only | pause instruction | 1.000 | 8340, 8340, 8340 | |
3 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1387, 1387, 1387 | |
3 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1387, 1387, 1388 | |
3 | scalar_iadd | Scalar integer adds | 1.000 | 5747, 5747, 5747 | |
3 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5549, 5549, 5549 | |
3 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5549, 5549, 5549 | |
3 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5549, 5549, 5549 | |
3 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549 | |
3 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549 | |
3 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549 | |
3 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 21978, 22026 | |
3 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22222, 22222, 22222 | |
3 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362 | |
3 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32680, 32680, 32680 | |
3 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 33784, 27100, 33784 | |
3 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5747, 5747, 5747 | |
3 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5747, 5747, 5747 | |
3 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5747, 5747, 5747 | |
3 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5747, 5747, 5747 | |
3 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5750, 5747, 5747 | |
3 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5750, 5750, 5747 | |
3 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549 | |
3 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549 | |
3 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549 | |
3 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11099, 11099, 11099 | |
3 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11099, 11099, 11099 | |
3 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549 | |
3 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775 | |
3 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775 | |
3 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775 | |
3 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099, 11099 | |
3 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099, 11099 | |
3 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5549, 5549, 5549 | |
3 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850 | |
3 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850 | |
3 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850 | |
3 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5747, 5750, 5747 | |
3 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5750, 5747, 5750 | |
3 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5747, 5747, 5747 | |
3 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1387, 1387, 1387 | |
3 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1387, 1387, 1387 | |
3 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1387, 1387, 1387 | |
3 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11099, 11099, 11099 | |
3 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11099, 11099, 11099 | |
3 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5549, 5552, 5552 | |
3 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1110, 1110, 1110 | |
3 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5546, 5549, 5549 | |
3 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1110, 1110, 1110 | |
3 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5546, 5546, 5546 | |
Cores | ID | Description | OVRLP3 | Mops | |
4 | pause_only | pause instruction | 1.000 | 8410, 8410, 8410, 8410 | |
4 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1387, 1387, 1387, 1387 | |
4 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1387, 1387, 1387, 1387 | |
4 | scalar_iadd | Scalar integer adds | 1.000 | 5747, 5750, 5747, 5747 | |
4 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5546, 5546, 5549, 5546 | |
4 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22222, 22222, 22222, 22222 | |
4 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22222, 22222, 22124, 22222 | |
4 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 33784, 27027, 33784, 33784 | |
4 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32468, 31153, 32362, 32362 | |
4 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 33784, 27100, 33784, 33784 | |
4 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5750, 5750, 5750, 5750 | |
4 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5747, 5750, 5747, 5747 | |
4 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5747, 5747, 5747, 5747 | |
4 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5747, 5747, 5747, 5747 | |
4 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5747, 5747, 5747, 5747 | |
4 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5750, 5750, 5747, 5750 | |
4 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11099, 11099, 11099, 11099 | |
4 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11099, 11099, 11099, 11099 | |
4 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775, 2775 | |
4 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775, 2775 | |
4 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2775, 2775, 2775, 2775 | |
4 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099, 11099, 11099 | |
4 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11099, 11099, 11099, 11099 | |
4 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850, 1850 | |
4 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850, 1850 | |
4 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1850, 1850, 1850, 1850 | |
4 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5747, 5747, 5747, 5750 | |
4 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5747, 5750, 5747, 5747 | |
4 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5747, 5747, 5747, 5747 | |
4 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1387, 1387, 1387, 1387 | |
4 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1387, 1387, 1387, 1387 | |
4 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1388, 1388, 1387, 1388 | |
4 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11099, 11099, 11099, 11099 | |
4 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11099, 11099, 11099, 11099 | |
4 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5549, 5549, 5549, 5549 | |
4 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1110, 1110, 1110, 1110 | |
4 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5549, 5549, 5549, 5546 | |
4 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1110, 1110, 1110, 1110 | |
4 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5549, 5549, 5549, 5549 | |
Cores | ID | Description | OVRLP3 | Mops | |
5 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333 | |
5 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375 | |
5 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375 | |
5 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5498, 5501, 5501, 5501 | |
5 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 22026, 22026, 22026 | |
5 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 21978, 21978, 21978, 22026 | |
5 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362 | |
5 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362 | |
5 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362 | |
5 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501 | |
5 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5501, 5501 | |
5 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5498, 5498, 5498, 5501, 5498 | |
5 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 10989, 11001 | |
5 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001 | |
5 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750 | |
5 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750 | |
5 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750 | |
5 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001 | |
5 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 10989, 11001 | |
5 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1833 | |
5 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1833, 1833, 1833 | |
5 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1833, 1833, 1834 | |
5 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375 | |
5 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375 | |
5 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375 | |
5 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001 | |
5 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001 | |
5 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100 | |
5 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
5 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100 | |
5 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501 | |
Cores | ID | Description | OVRLP3 | Mops | |
6 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333 | |
6 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375 | |
6 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375 | |
6 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501 | |
6 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 21978, 22026, 22026, 22026 | |
6 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 22026, 22026, 21978 | |
6 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362 | |
6 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362 | |
6 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362 | |
6 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501 | |
6 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501 | |
6 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501 | |
6 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501 | |
6 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001 | |
6 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001 | |
6 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750 | |
6 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750 | |
6 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750 | |
6 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001 | |
6 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001 | |
6 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1834, 1834 | |
6 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1834, 1834, 1833, 1834 | |
6 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1833, 1833, 1834, 1833 | |
6 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375 | |
6 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375 | |
6 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375 | |
6 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001 | |
6 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001 | |
6 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100 | |
6 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
6 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100 | |
6 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501 | |
Cores | ID | Description | OVRLP3 | Mops | |
7 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333 | |
7 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
7 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
7 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 21978, 21978, 22026, 22026, 22026, 21978 | |
7 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 21978, 21978, 22026, 21978, 22026, 21978 | |
7 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362 | |
7 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362 | |
7 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362 | |
7 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5498, 5501, 5498, 5501, 5501, 5501, 5501 | |
7 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498 | |
7 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5498 | |
7 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 10989, 10989, 11001, 11001, 11001 | |
7 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001 | |
7 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750 | |
7 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750 | |
7 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750 | |
7 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 10989, 11001, 11001, 11001, 11001, 11001, 11001 | |
7 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 11001 | |
7 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1833, 1833, 1833, 1834, 1833 | |
7 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1834, 1834, 1834, 1834, 1834 | |
7 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1833, 1834, 1833, 1834, 1833 | |
7 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5498 | |
7 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
7 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
7 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
7 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001 | |
7 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001 | |
7 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100 | |
7 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
7 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100 | |
7 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
Cores | ID | Description | OVRLP3 | Mops | |
8 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333 | |
8 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
8 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
8 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 21978, 21978, 21978, 21978, 22026, 22026, 22026 | |
8 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 21978, 22026, 21978, 21978, 22026, 21978, 21978, 22026 | |
8 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32258, 32362, 32362, 32362, 32362 | |
8 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362 | |
8 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362 | |
8 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5501 | |
8 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5498 | |
8 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5498 | |
8 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501 | |
8 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 10989, 11001 | |
8 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 11001, 11001 | |
8 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750 | |
8 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750 | |
8 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750 | |
8 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 10989, 11001, 11001 | |
8 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 10989, 11001, 11001, 11001 | |
8 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1833, 1834, 1833, 1833 | |
8 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1833, 1834, 1833, 1834, 1833 | |
8 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1834, 1833, 1834, 1833, 1834, 1833 | |
8 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501 | |
8 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5501 | |
8 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
8 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
8 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375 | |
8 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001 | |
8 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001 | |
8 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100 | |
8 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
8 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100 | |
8 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501 | |
Cores | ID | Description | OVRLP3 | Mops | |
9 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333 | |
9 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363 | |
9 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363 | |
9 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5498, 5498, 5501, 5501, 5450 | |
9 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476 | |
9 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476 | |
9 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476 | |
9 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476 | |
9 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476 | |
9 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476 | |
9 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 21978, 21978, 22026, 22026, 22026, 21882 | |
9 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 22026, 21978, 21978, 22026, 21978, 22026, 21882 | |
9 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32258 | |
9 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32154 | |
9 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32154 | |
9 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453 | |
9 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5498, 5453 | |
9 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5498, 5501, 5501, 5501, 5501, 5450 | |
9 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5498, 5498, 5498, 5501, 5501, 5498, 5450 | |
9 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450 | |
9 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450 | |
9 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5476 | |
9 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476 | |
9 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476 | |
9 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 10989, 11001, 10953 | |
9 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 10989, 11001, 11001, 11001, 10953 | |
9 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5498, 5501, 5476 | |
9 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737 | |
9 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737 | |
9 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737 | |
9 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 10989, 10989, 11001, 11001, 11001, 11001, 11001, 11001, 10953 | |
9 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10953 | |
9 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5476 | |
9 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1834, 1834, 1834, 1834, 1825 | |
9 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1834, 1833, 1834, 1833, 1834, 1834, 1825 | |
9 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1833, 1833, 1834, 1834, 1833, 1825 | |
9 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450 | |
9 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5498, 5501, 5450 | |
9 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450 | |
9 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369 | |
9 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369 | |
9 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369 | |
9 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10953 | |
9 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10953 | |
9 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5473 | |
9 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1090 | |
9 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453 | |
9 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1090 | |
9 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450 | |
Cores | ID | Description | OVRLP3 | Mops | |
10 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8292, 8292 | |
10 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363, 1363 | |
10 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363, 1363 | |
10 | scalar_iadd | Scalar integer adds | 1.000 | 5498, 5501, 5498, 5501, 5498, 5501, 5501, 5501, 5450, 5450 | |
10 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5473, 5473 | |
10 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453, 5453 | |
10 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 22026, 21978, 21978, 22026, 21978, 21786, 21786 | |
10 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 22026, 22026, 21978, 21978, 21978, 22026, 21786, 21786 | |
10 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32051, 32051 | |
10 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32051, 32051 | |
10 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32154, 32154 | |
10 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5498, 5501, 5498, 5501, 5501, 5501, 5450, 5450 | |
10 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5498, 5501, 5498, 5501, 5501, 5498, 5450, 5450 | |
10 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5450, 5450 | |
10 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5470, 5470 | |
10 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453, 5453 | |
10 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5473, 5473 | |
10 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10905, 10905 | |
10 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10941, 10941 | |
10 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737, 2737 | |
10 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2737, 2737 | |
10 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2725, 2725 | |
10 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 11001, 10989, 10905, 10905 | |
10 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10989, 10905, 10905 | |
10 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1834, 1834, 1833, 1834, 1825, 1825 | |
10 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1834, 1834, 1834, 1834, 1817, 1817 | |
10 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1833, 1833, 1834, 1834, 1834, 1834, 1817, 1817 | |
10 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369, 1369 | |
10 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1369, 1369 | |
10 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1363, 1363 | |
10 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10905, 10905 | |
10 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10893, 10905 | |
10 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1090, 1090 | |
10 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5450, 5450 | |
10 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1090, 1090 | |
10 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5453, 5450 | |
Cores | ID | Description | OVRLP3 | Mops | |
11 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8258, 8258, 8258 | |
11 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350 | |
11 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350 | |
11 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5402, 5402, 5400 | |
11 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 21978, 21978, 21978, 21978, 22026, 21692, 21692, 21692 | |
11 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 21978, 22026, 22026, 21978, 22026, 22026, 22026, 21692, 21692, 21692 | |
11 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949 | |
11 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949 | |
11 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31847 | |
11 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5423, 5423 | |
11 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5498, 5498, 5498, 5423, 5423, 5423 | |
11 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5405, 5405, 5402 | |
11 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5423, 5423 | |
11 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5423, 5423 | |
11 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846 | |
11 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10989, 10846, 10846, 10846 | |
11 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712 | |
11 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2713, 2712 | |
11 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712 | |
11 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 10989, 10989, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846 | |
11 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 10989, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846 | |
11 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1833, 1833, 1833, 1834, 1834, 1834, 1809, 1809, 1808 | |
11 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1834, 1833, 1833, 1833, 1834, 1834, 1809, 1809, 1809 | |
11 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1833, 1833, 1834, 1834, 1834, 1808, 1808, 1809 | |
11 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400 | |
11 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5426, 5426, 5426 | |
11 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5405, 5405, 5405 | |
11 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356 | |
11 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356 | |
11 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356 | |
11 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846 | |
11 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846 | |
11 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1080, 1080, 1080 | |
11 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
11 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1080, 1080, 1080 | |
11 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426 | |
Cores | ID | Description | OVRLP3 | Mops | |
12 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8217, 8224, 8217, 8224 | |
12 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350 | |
12 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350 | |
12 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 22026, 22026, 21978, 22026, 22026, 22026, 22026, 21978, 21692, 21692, 21692, 21692 | |
12 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 21978, 22026, 21978, 22026, 22026, 22026, 21692, 21692, 21692, 21692 | |
12 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949, 31949 | |
12 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949, 31949 | |
12 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31949, 31949, 31949, 31949 | |
12 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846 | |
12 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846 | |
12 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5426, 5426, 5426, 5426 | |
12 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712, 2712 | |
12 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712, 2712 | |
12 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2712, 2712, 2712, 2712 | |
12 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846 | |
12 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 10989, 10989, 11001, 11001, 10846, 10846, 10846, 10846 | |
12 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5402, 5402, 5402, 5402 | |
12 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1834, 1833, 1833, 1833, 1834, 1808, 1808, 1808, 1808 | |
12 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1834, 1834, 1834, 1834, 1808, 1808, 1808, 1808 | |
12 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1833, 1833, 1833, 1833, 1833, 1833, 1808, 1808, 1808, 1808 | |
12 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
12 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356, 1356 | |
12 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356, 1356 | |
12 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1356, 1356, 1356, 1356 | |
12 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846 | |
12 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10846, 10846, 10846, 10846 | |
12 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5423, 5423, 5423 | |
12 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1080, 1080, 1080, 1080 | |
12 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5423, 5426, 5426, 5423 | |
12 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1080, 1080, 1080, 1080 | |
12 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400 | |
Cores | ID | Description | OVRLP3 | Mops | |
13 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8217, 8217, 8217, 8217, 8217 | |
13 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344 | |
13 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344 | |
13 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376 | |
13 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5400, 5400, 5400, 5400, 5400 | |
13 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 21978, 21978, 21978, 21978, 21978, 21978, 21598, 21598, 21598, 21598, 21598 | |
13 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 21978, 21978, 22026, 21978, 22026, 22026, 22026, 22026, 21598, 21598, 21598, 21598, 21598 | |
13 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746 | |
13 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746 | |
13 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31847, 31746, 31847, 31847, 31847 | |
13 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5498, 5400, 5400, 5400, 5400, 5400 | |
13 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5498, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799 | |
13 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799 | |
13 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2701, 2701, 2701, 2701, 2701 | |
13 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2700, 2700, 2700, 2700, 2700 | |
13 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2700, 2700, 2700, 2700, 2700 | |
13 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 10989, 11001, 11001, 10799, 10799, 10799, 10799, 10799 | |
13 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799 | |
13 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5397, 5400, 5400 | |
13 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1833, 1834, 1834, 1833, 1834, 1834, 1800, 1800, 1800, 1800, 1800 | |
13 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1834, 1833, 1834, 1834, 1834, 1833, 1834, 1800, 1800, 1800, 1800, 1800 | |
13 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1833, 1834, 1833, 1834, 1834, 1834, 1800, 1800, 1800, 1800, 1800 | |
13 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5498, 5498, 5498, 5501, 5501, 5498, 5376, 5376, 5376, 5376, 5376 | |
13 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5498, 5501, 5501, 5498, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350 | |
13 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350 | |
13 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350 | |
13 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10811, 10799 | |
13 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10811, 10811, 10799, 10799, 10799 | |
13 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5402, 5400, 5400, 5400, 5400 | |
13 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1075, 1075, 1075, 1075, 1075 | |
13 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400 | |
13 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1075, 1075, 1075, 1075, 1075 | |
13 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5397, 5397, 5397, 5397, 5397 | |
Cores | ID | Description | OVRLP3 | Mops | |
14 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8183, 8183, 8183, 8183, 8183, 8183 | |
14 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344 | |
14 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344 | |
14 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 22026, 21978, 22026, 21978, 21978, 22026, 21598, 21598, 21598, 21598, 21598, 21598 | |
14 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 22026, 22026, 21978, 21978, 22026, 22026, 21598, 21598, 21598, 21598, 21598, 21598 | |
14 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746, 31746 | |
14 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746, 31746 | |
14 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31847, 31847, 31847, 31746, 31847 | |
14 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5498, 5501, 5498, 5501, 5498, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5498, 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5498, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799 | |
14 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799 | |
14 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400 | |
14 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2700, 2700, 2700, 2700, 2700, 2700 | |
14 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2700, 2700, 2700, 2700, 2700, 2700 | |
14 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2699, 2699, 2698, 2699, 2699, 2699 | |
14 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 10989, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799 | |
14 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799 | |
14 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1834, 1834, 1834, 1834, 1800, 1800, 1800, 1800, 1800, 1800 | |
14 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1833, 1834, 1834, 1833, 1834, 1834, 1800, 1800, 1800, 1800, 1800, 1800 | |
14 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1833, 1834, 1833, 1834, 1800, 1800, 1800, 1800, 1800, 1800 | |
14 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5498, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376 | |
14 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350, 1350 | |
14 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350, 1350 | |
14 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350, 1350 | |
14 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10799, 10799, 10799, 10799 | |
14 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10799, 10799, 10811, 10799, 10799, 10799 | |
14 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373 | |
14 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1075, 1075, 1075, 1075, 1075, 1075 | |
14 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373 | |
14 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1075, 1075, 1075, 1075, 1075, 1075 | |
14 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373 | |
Cores | ID | Description | OVRLP3 | Mops | |
15 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8183, 8183, 8183, 8183, 8183, 8183, 8183 | |
15 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344 | |
15 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1338, 1338, 1338, 1338, 1338, 1338, 1338 | |
15 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 22026, 22026, 22026, 21978, 21978, 21978, 21978, 21505, 21505, 21505, 21505, 21505, 21505, 21505 | |
15 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 21978, 21978, 21978, 21978, 22026, 22026, 21505, 21505, 21505, 21505, 21505, 21505, 21505 | |
15 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746, 31746, 31746 | |
15 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31746, 31746, 31746, 31746, 31746, 31746, 31746 | |
15 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31646, 31646, 31646, 31646, 31646, 31646, 31646 | |
15 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5498, 5501, 5498, 5498, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5501, 5501, 5498, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5400, 5400, 5400, 5400, 5400, 5400, 5400 | |
15 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5397, 5400, 5400, 5400, 5400, 5400, 5400 | |
15 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 10989, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10741 | |
15 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10787, 10787, 10787, 10787, 10799, 10787, 10787 | |
15 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2688, 2687, 2687, 2687, 2687, 2688, 2688 | |
15 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2688, 2688, 2688, 2688, 2688, 2688, 2688 | |
15 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2688, 2687, 2687, 2688, 2687, 2688, 2687 | |
15 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10741, 10753, 10753, 10753, 10753, 10753 | |
15 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 10989, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10741, 10753 | |
15 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1834, 1834, 1834, 1834, 1834, 1800, 1800, 1800, 1800, 1800, 1800, 1800 | |
15 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1834, 1834, 1833, 1833, 1833, 1834, 1800, 1800, 1800, 1800, 1800, 1800, 1800 | |
15 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1834, 1834, 1833, 1834, 1834, 1834, 1834, 1792, 1792, 1792, 1792, 1792, 1792, 1792 | |
15 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5498, 5501, 5498, 5501, 5501, 5498, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
15 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1350, 1350, 1350, 1350, 1350, 1350, 1350 | |
15 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344 | |
15 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344 | |
15 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753 | |
15 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10695, 10695, 10695, 10695, 10695, 10695, 10695 | |
15 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373 | |
15 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1070, 1070, 1070, 1070, 1070, 1070, 1070 | |
15 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373 | |
15 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1070, 1070, 1070, 1070, 1070, 1070, 1070 | |
15 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373 | |
Cores | ID | Description | OVRLP3 | Mops | |
16 | pause_only | pause instruction | 1.000 | 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8333, 8183, 8183, 8183, 8183, 8183, 8183, 8183, 8183 | |
16 | ucomis_clean | scalar ucomis (w/ vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1338, 1338, 1338, 1338, 1338, 1338, 1338, 1338 | |
16 | ucomis_dirty | scalar ucomis (no vzeroupper) | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1338, 1337, 1338, 1338, 1338, 1338, 1338, 1338 | |
16 | scalar_iadd | Scalar integer adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5498, 5501, 5501, 5350, 5350, 5350, 5350, 5350, 5350, 5350, 5350 | |
16 | avx128_iadd | 128-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx256_iadd | 256-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx512_iadd | 512-bit integer serial adds | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx128_iadd16 | 128-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx256_iadd16 | 256-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx512_iadd16 | 512-bit integer serial adds zmm16 | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx128_iadd_t | 128-bit integer parallel adds | 1.000 | 21978, 21978, 21978, 21978, 22026, 21978, 22026, 22026, 21505, 21505, 21505, 21505, 21505, 21505, 21505, 21505 | |
16 | avx256_iadd_t | 256-bit integer parallel adds | 1.000 | 22026, 22026, 21978, 21978, 22026, 21978, 22026, 22026, 21505, 21505, 21505, 21459, 21505, 21505, 21505, 21505 | |
16 | avx128_xor_zero | 128-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31646, 31646, 31646, 31646, 31646, 31646, 31646, 31646 | |
16 | avx256_xor_zero | 256-bit zeroing xor | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31646, 31646, 31646, 31646, 31646, 31646, 31646, 31646 | |
16 | avx512_xor_zero | 512-bit zeroing xord | 1.000 | 32362, 32362, 32362, 32362, 32362, 32362, 32362, 32362, 31646, 31646, 31646, 31646, 31646, 31646, 31646, 31646 | |
16 | avx128_mov_sparse | 128-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx256_mov_sparse | 256-bit reg-reg mov | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx512_mov_sparse | 512-bit reg-reg mov | 1.000 | 5498, 5501, 5501, 5501, 5498, 5501, 5501, 5498, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx128_merge_sparse | 128-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5498, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373, 5373 | |
16 | avx256_merge_sparse | 256-bit reg-reg merge mov | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5498, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx512_merge_sparse | 512-bit reg-reg merge mov | 1.000 | 5501, 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5373, 5376, 5376, 5376, 5376 | |
16 | avx128_vshift | 128-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx256_vshift | 256-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx512_vshift | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx128_vshift_t | 128-bit variable shift (vpsrlvd) | 1.000 | 11001, 10989, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753, 10753 | |
16 | avx256_vshift_t | 256-bit variable shift (vpsrlvd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 10989, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753, 10753 | |
16 | avx512_vshift_t | 512-bit variable shift (vpsrlvd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx128_vlzcnt | 128-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2688, 2687, 2687, 2687, 2687, 2687, 2687, 2687 | |
16 | avx256_vlzcnt | 256-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2687, 2687, 2687, 2687, 2687, 2687, 2687, 2687 | |
16 | avx512_vlzcnt | 512-bit lzcnt (vplzcntd) | 1.000 | 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2750, 2687, 2687, 2687, 2687, 2687, 2687, 2687, 2687 | |
16 | avx128_vlzcnt_t | 128-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 10989, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10741, 10753, 10753, 10753 | |
16 | avx256_vlzcnt_t | 256-bit lzcnt (vplzcntd) | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10741, 10753, 10753, 10753, 10753, 10753, 10753 | |
16 | avx512_vlzcnt_t | 512-bit lzcnt (vplzcntd) | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx128_imul | 128-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1833, 1834, 1834, 1834, 1833, 1833, 1792, 1792, 1792, 1792, 1792, 1792, 1792, 1792 | |
16 | avx256_imul | 256-bit integer muls (vpmuldq) | 1.000 | 1833, 1833, 1834, 1833, 1834, 1834, 1834, 1833, 1792, 1792, 1792, 1792, 1792, 1792, 1792, 1792 | |
16 | avx512_imul | 512-bit integer muls (vpmuldq) | 1.000 | 1834, 1833, 1833, 1834, 1834, 1834, 1834, 1834, 1792, 1792, 1792, 1792, 1792, 1792, 1792, 1792 | |
16 | avx128_fma_sparse | 128-bit 64-bit sparse FMAs | 1.000 | 5498, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5373, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx256_fma_sparse | 256-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx512_fma_sparse | 512-bit 64-bit sparse FMAs | 1.000 | 5501, 5501, 5498, 5501, 5501, 5498, 5501, 5501, 5376, 5376, 5376, 5376, 5376, 5376, 5376, 5376 | |
16 | avx128_fma | 128-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344, 1344 | |
16 | avx256_fma | 256-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344, 1344 | |
16 | avx512_fma | 512-bit serial DP FMAs | 1.000 | 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1375, 1344, 1344, 1344, 1344, 1344, 1344, 1344, 1344 | |
16 | avx128_fma_t | 128-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753, 10753 | |
16 | avx256_fma_t | 256-bit parallel DP FMAs | 1.000 | 11001, 11001, 11001, 11001, 11001, 11001, 11001, 11001, 10753, 10753, 10753, 10753, 10753, 10753, 10753, 10753 | |
16 | avx512_fma_t | 512-bit parallel DP FMAs | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5356, 5356, 5356, 5373, 5353, 5373, 5373 | |
16 | avx512_vpermw | 512-bit serial WORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1070, 1070, 1070, 1070, 1070, 1070, 1070, 1070 | |
16 | avx512_vpermw_t | 512-bit parallel WORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5350, 5350, 5350, 5350, 5350, 5350, 5350, 5350 | |
16 | avx512_vpermd | 512-bit serial DWORD permute | 1.000 | 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1100, 1070, 1070, 1070, 1070, 1070, 1070, 1070, 1070 | |
16 | avx512_vpermd_t | 512-bit parallel DWORD permute | 1.000 | 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5501, 5373, 5373, 5373, 5373, 5373, 5373, 5373, 5373 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment