Skip to content

Instantly share code, notes, and snippets.

@Benitoite
Last active August 28, 2024 05:10
Show Gist options
  • Save Benitoite/9982cd08a997f79d89c27b0c05ce2f83 to your computer and use it in GitHub Desktop.
Save Benitoite/9982cd08a997f79d89c27b0c05ce2f83 to your computer and use it in GitHub Desktop.
wintimer
git clone https://github.com/Benitoite/raw-test .\raw-test; $processor = Get-ComputerInfo -Property CsProcessors; $sockets = (Get-CimInstance Win32_Processor).SocketDesignation.Count ; $num = ($processor.CsProcessors | findstr NumberOfLogicalProcessors).Split(' ')[2]; $num *= $sockets ; $name = ($processor.CsProcessors | findstr Name).Split(':')[-1]; $mhz = ($processor.CsProcessors | findstr Max).Split(':')[-1]; $proc = (cat .\AboutThisBuild.txt | findstr Processor); echo "``````" "================================"; echo "Available threads = $num / CPU =$name / $mhz MHz / Target = $proc"; for ($threads = 2; $threads -le $num; $threads *= 2) { $env:OMP_NUM_THREADS=$threads; $t = 0; $n = 5; $x = 0; for ($i = 0; $i -lt $n; $i++) { $x+=(Measure-Command { .\rawtherapee-cli.exe -j -s -Y -c .\raw-test\typewriter.CR2 } | findstr Ticks).Split(': ')[-1] }; $t+=($x/$n); $x = 0; for ($i = 0; $i -lt $n; $i++) { $x+=(Measure-Command { .\rawtherapee-cli.exe -j -s -Y -c .\raw-test\naturalbridges.CR2 } | findstr Ticks).Split(': ')[-1] }; $t+=($x/$n); $x = 0; for ($i = 0; $i -lt $n; $i++) { $x+=(Measure-Command { .\rawtherapee-cli.exe -j -s -Y -c .\raw-test\beachcabin.ARW } | findstr Ticks).Split(': ')[-1] }; $t+=($x/$n); echo "$([math]::round([decimal]($t/10000),0)) total milliseconds elapsed (average of $n runs) using OMP_NUM_THREADS = $threads" }; echo "================================" "``````"
@Benitoite
Copy link
Author

Benitoite commented Jul 28, 2024

Instructions:

  1. Install the latest git for windows if you don't have it already: https://github.com/git-for-windows/git/releases/download/v2.45.2.windows.1/Git-2.45.2-64-bit.exe

  2. Download/unzip the 3 standalone builds to test:

Generic x86 (all 64-bit CPUs / Windows 7-8):

https://github.com/Benitoite/RawTherapee/releases/download/nightly-github-actions/RawTherapee_genericwin_win64_release.zip

SandyBridge-IvyBridge (circa 2011-2015 / Windows 8-10):

https://github.com/Benitoite/RawTherapee/releases/download/nightly-github-actions/RawTherapee_midwin_win64_release.zip

SkyLake-RaptorLake (circa 2015-2022 / Windows 10-11):

https://github.com/Benitoite/RawTherapee/releases/download/nightly-github-actions/RawTherapee_fastwin_win64_release.zip

  1. Run Powershell and cd into the RawTherapee program directory you would like to test. Pro Tip: type "cd " (cd then space bar) and drag the folder onto the powershell window, then press return.

  2. Run the one-liner script above: (simply copy and paste into powershell and press return.)

  3. Repeat Steps 1-2 for the next build.

  4. Post the results of your tests in this gist in the following manner (each report should be surrounded by three backticks):


```
================================
Available threads = 8  /  CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz  /  3408 MHz  /  Target = Processor: skylake-raptorlake
62421 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
41778 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
37596 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
```
```
================================
Available threads = 8  /  CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz  /  3408 MHz  /  Target = Processor: sandybridge-ivybridge
63748 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
42508 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
35636 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
```
```
================================
Available threads = 8  /  CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz  /  3408 MHz  /  Target = Processor: generic x86
64380 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
43011 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
35738 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
```



pretty version of script one-liner:

git clone https://github.com/Benitoite/raw-test .\raw-test
$processor = Get-ComputerInfo -Property CsProcessors
$num = ($processor.CsProcessors | findstr NumberOfLogicalProcessors).Split(' ')[2]
$sockets = (Get-CimInstance Win32_Processor).SocketDesignation.Count
$num *= $sockets
$name = ($processor.CsProcessors | findstr Name).Split(':')[-1]
$mhz = ($processor.CsProcessors | findstr Max).Split(':')[-1]
$proc = (cat .\AboutThisBuild.txt | findstr Processor)

echo "``````" "================================"
echo "Available threads = $num  /  CPU =$name  / $mhz MHz  /  Target = $proc"

for ($threads = 2; $threads -le $num; $threads *= 2)
  {
    $env:OMP_NUM_THREADS=$threads;
    $t = 0
    $n = 5

    $x = 0; for ($i = 0; $i -lt $n; $i++) 
      { $x+=(Measure-Command { .\rawtherapee-cli.exe -j -s -Y -c .\raw-test\typewriter.CR2 } | findstr Ticks).Split(': ')[-1] }
    $t+=($x/$n)

    $x = 0; for ($i = 0; $i -lt $n; $i++)
      { $x+=(Measure-Command { .\rawtherapee-cli.exe -j -s -Y -c .\raw-test\naturalbridges.CR2 } | findstr Ticks).Split(': ')[-1] }
    $t+=($x/$n)

    $x = 0; for ($i = 0; $i -lt $n; $i++)
      { $x+=(Measure-Command { .\rawtherapee-cli.exe -j -s -Y -c .\raw-test\beachcabin.ARW } | findstr Ticks).Split(': ')[-1] }
    $t+=($x/$n)

    echo "$([math]::round([decimal]($t/10000),0)) total milliseconds elapsed (average of $n runs) using OMP_NUM_THREADS = $threads"
  }

echo "================================" "``````"

@Benitoite
Copy link
Author

Benitoite commented Jul 28, 2024

================================
Available threads = 8 / CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz /  3408 MHz / Target = Processor: skylake-raptorlake
44807 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
30655 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
26047 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 8 / CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz /  3408 MHz / Target = Processor: sandybridge-ivybridge
45756 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
30946 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
26819 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 8 / CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz /  3408 MHz / Target = Processor: generic x86
45781 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
30916 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
26355 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================

================================
Available threads = 4  /  CPU = AMD EPYC 7763 64-Core Processor  /  2445 MHz  /  Target = Processor: skylake-raptorlake
40640 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
30768 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 4  /  CPU = AMD EPYC 7763 64-Core Processor  /  2445 MHz  /  Target = Processor: sandybridge-ivybridge
41630 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
31578 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 4  /  CPU = AMD EPYC 7763 64-Core Processor  /  2445 MHz  /  Target = Processor: generic x86
41959 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
32302 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================

@chaimav
Copy link

chaimav commented Jul 28, 2024

================================
Available threads = 24  /  CPU = 13th Gen Intel(R) Core(TM) i7-13700  /  2100 MHz  /  Target = Processor: generic x86
27082 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
18057 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
14663 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
14928 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 24  /  CPU = 13th Gen Intel(R) Core(TM) i7-13700  /  2100 MHz  /  Target = Processor: sandybridge-ivybridge
31369 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
18424 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
15234 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
15113 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 24  /  CPU = 13th Gen Intel(R) Core(TM) i7-13700  /  2100 MHz  /  Target = Processor: skylake-raptorlake
34491 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
27324 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
24788 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
24322 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================

@SilvioGrosso
Copy link

SilvioGrosso commented Jul 29, 2024


System Information

     Operating System: Windows 10 Home 64-bit (10.0, Build 19045) (19041.vb_release.191206-1406)
       System Manufacturer: Micro-Star International Co., Ltd.
         System Model: Prestige 15 A10SC
                Processor: Intel(R) Core(TM) i7-10710U CPU @ 1.10GHz (12 CPUs), ~1.6GHz
                   Memory: 16384MB RAM

Display devices

Card name: NVIDIA GeForce GTX 1650 with Max-Q Design

RESULTS:

================================
Available threads = 12 / CPU = Intel(R) Core(TM) i7-10710U CPU @ 1.10GHz /  1608 MHz / Target = Processor: generic x86
46538 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
34209 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
30573 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 12 / CPU = Intel(R) Core(TM) i7-10710U CPU @ 1.10GHz /  1608 MHz / Target = Processor: sandybridge-ivybridge
52214 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
34668 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
29545 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 12 / CPU = Intel(R) Core(TM) i7-10710U CPU @ 1.10GHz /  1608 MHz / Target = Processor: skylake-raptorlake
50492 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
32684 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
29928 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================

@SilvioGrosso
Copy link

SilvioGrosso commented Jul 29, 2024

System Information

     Operating System: Windows 10 Pro 64-bit (10.0, Build 19045) (19041.vb_release.191206-1406)
                 System Model: HP Z2 SFF G9 Workstation Desktop PC
                      Processor: 12th Gen Intel(R) Core(TM) i5-12600K (16 CPUs), ~3.7GHz
                          Memory: 16384MB RAM

Display Devices

       Card name: NVIDIA T400 4GB

RESULTS:

================================
Available threads = 16 / CPU = 12th Gen Intel(R) Core(TM) i5-12600K /  3686 MHz / Target = Processor: generic x86
44895 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
28757 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
20286 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
17384 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 16 / CPU = 12th Gen Intel(R) Core(TM) i5-12600K /  3686 MHz / Target = Processor: sandybridge-ivybridge
45948 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
28641 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
19884 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
16977 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 16 / CPU = 12th Gen Intel(R) Core(TM) i5-12600K /  3686 MHz / Target = Processor: skylake-raptorlake
45103 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
28374 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
19703 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
16992 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================

@tonyike
Copy link

tonyike commented Jul 29, 2024

================================
Available threads = 8  /  CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz  /  3401 MHz  /  Target = Processor: skylake-raptorlake
46842 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
34395 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
30936 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 8  /  CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz  /  3401 MHz  /  Target = Processor: sandybridge-ivybridge
48023 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
34563 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
31239 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================ 
Available threads = 8  /  CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz  /  3401 MHz  /  Target = Processor: generic x86 
47294 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2 
35790 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4 
30700 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8 
================================

@SilvioGrosso
Copy link

SilvioGrosso commented Jul 30, 2024


System Information

     Operating System: Windows 10 Pro 64-bit (10.0, Build 18362) (18362.19h1_release.190318-1202)
         System Model: HP ProDesk 600 G4 SFF
             Processor: Intel(R) Core(TM) i5-8500 CPU @ 3.00GHz (6 CPUs), ~3.0GHz
               Memory: 16384MB RAM

Display Devices

       Card name: Intel(R) UHD Graphics 630

RESULTS:

================================
Available threads = 6 / CPU = Intel(R) Core(TM) i5-8500 CPU @ 3.00GHz /  3000 MHz / Target = Processor: generic x86
49716 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
35320 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 6 / CPU = Intel(R) Core(TM) i5-8500 CPU @ 3.00GHz /  3000 MHz / Target = Processor: sandybridge-ivybridge
49459 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
34910 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 6 / CPU = Intel(R) Core(TM) i5-8500 CPU @ 3.00GHz /  3000 MHz / Target = Processor: skylake-raptorlake
49750 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
34505 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================

@SilvioGrosso
Copy link

SilvioGrosso commented Jul 30, 2024


System Information

   Operating System: Windows 11 Pro 64-bit (10.0, Build 22621) (22621.ni_release.220506-1250)
     System Manufacturer: LENOVO
           Processor: AMD Ryzen 7 PRO 4750G with Radeon Graphics (16 CPUs), ~3.6GHz
               Memory: 16384MB RAM

Display Devices

       Card name: AMD Radeon(TM) Graphics

RESULTS:

================================
Available threads = 16 / CPU = AMD Ryzen 7 PRO 4750G with Radeon Graphics /  3600 MHz / Target = Processor: generic x86
41535 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
29975 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
27861 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
30086 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 16 / CPU = AMD Ryzen 7 PRO 4750G with Radeon Graphics /  3600 MHz / Target = Processor: sandybridge-ivybridge
40998 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
29729 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
27739 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
30334 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 16 / CPU = AMD Ryzen 7 PRO 4750G with Radeon Graphics /  3600 MHz / Target = Processor: skylake-raptorlake
41017 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
29786 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
27744 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
30213 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================

@SilvioGrosso
Copy link

SilvioGrosso commented Jul 31, 2024


System Information

     Operating System: Windows 11 Home 64-bit (10.0, Build 22631) (22621.ni_release.220506-1250)
         System Manufacturer: Acer - Predator PH317-56
                 Processor: 12th Gen Intel(R) Core(TM) i7-12700H (20 CPUs), ~2.7GHz
                        Memory: 32768MB RAM

Display devices

       Card name: NVIDIA GeForce RTX 3070 Ti Laptop GPU

RESULTS:

================================
Available threads = 20 / CPU = 12th Gen Intel(R) Core(TM) i7-12700H /  2700 MHz / Target = Processor: generic x86
48426 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
28899 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
23573 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
25115 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 20 / CPU = 12th Gen Intel(R) Core(TM) i7-12700H /  2700 MHz / Target = Processor: sandybridge-ivybridge
54110 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
29962 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
24928 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
32158 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================

================================
Available threads = 20 / CPU = 12th Gen Intel(R) Core(TM) i7-12700H /  2700 MHz / Target = Processor: skylake-raptorlake
54053 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
29972 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
24789 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
30107 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================

@SilvioGrosso
Copy link

SilvioGrosso commented Jul 31, 2024

System Information

     Operating System: Windows 10 Home 64-bit (10.0, Build 19045) (19041.vb_release.191206-1406)
       System Manufacturer: LENOVO - 90NA001RIX
             Processor: Intel(R) Core(TM) i5-10400 CPU @ 2.90GHz (12 CPUs), ~2.9GHz
               Memory: 8192MB RAM

Display Devices

       Card name: Intel(R) UHD Graphics 630
================================
Available threads = 12 / CPU = Intel(R) Core(TM) i5-10400 CPU @ 2.90GHz /  2904 MHz / Target = Processor: generic x86
46444 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
32377 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
29014 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 12 / CPU = Intel(R) Core(TM) i5-10400 CPU @ 2.90GHz /  2904 MHz / Target = Processor: sandybridge-ivybridge
46463 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
32346 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
29087 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 12 / CPU = Intel(R) Core(TM) i5-10400 CPU @ 2.90GHz /  2904 MHz / Target = Processor: skylake-raptorlake
46081 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
32172 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
28971 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================

@SilvioGrosso
Copy link


System Information

Operating System: Windows 10 Home 64-bit (10.0, Build 19044) (19041.vb_release.191206-1406)
System Manufacturer: ASUSTeK COMPUTER - X302UJ
Processor: Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz (4 CPUs), ~2.6GHz
Memory: 8192MB RAM


Display devices

       Card name: NVIDIA GeForce 920M
================================
Available threads = 4 / CPU = Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz /  2592 MHz / Target = Processor: generic x86
64245 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
50253 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 4 / CPU = Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz /  2592 MHz / Target = Processor: sandybridge-ivybridge
63369 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
50047 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 4 / CPU = Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz /  2592 MHz / Target = Processor: skylake-raptorlake
62785 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
49675 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================

@SilvioGrosso
Copy link


System Information

     Operating System: Windows 10 Pro for Workstations 64-bit (10.0, Build 19044) (19041.vb_release.191206-1406)
         System Model: HP Z4 G4 Workstation
            Processor: Intel(R) Xeon(R) W-2223 CPU @ 3.60GHz (8 CPUs), ~3.6GHz
               Memory: 65536MB RAM

Display Devices

       Card name: NVIDIA T1000
================================
Available threads = 8 / CPU = Intel(R) Xeon(R) W-2223 CPU @ 3.60GHz /  3600 MHz / Target = Processor: generic x86
46518 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
28263 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
23211 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 8 / CPU = Intel(R) Xeon(R) W-2223 CPU @ 3.60GHz /  3600 MHz / Target = Processor: sandybridge-ivybridge
46263 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
28326 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
23179 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 8 / CPU = Intel(R) Xeon(R) W-2223 CPU @ 3.60GHz /  3600 MHz / Target = Processor: skylake-raptorlake
45711 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
28011 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
22954 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================

@Lawrence37
Copy link

================================
Available threads = 4 / CPU = Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz /  2601 MHz / Target = Processor: generic x86
67089 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
54341 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 4 / CPU = Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz /  2601 MHz / Target = Processor: sandybridge-ivybridge
69117 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
52926 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 4 / CPU = Intel(R) Core(TM) i7-6500U CPU @ 2.50GHz /  2601 MHz / Target = Processor: skylake-raptorlake
68949 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
53348 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================

@marter001
Copy link

marter001 commented Aug 5, 2024

================================
Available threads = 8  /  CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz  /  3408 MHz  /  Target = Processor: generic x86
44982 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
31102 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
26756 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 8  /  CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz  /  3408 MHz  /  Target = Processor: sandybridge-ivybridge
45878 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
31792 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
27732 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 8  /  CPU = Intel(R) Core(TM) i7-6700 CPU @ 3.40GHz  /  3408 MHz  /  Target = Processor: skylake-raptorlake
44999 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
31871 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
27553 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================

@Benitoite
Copy link
Author

Benitoite commented Aug 8, 2024

================================
Available threads = 12 / CPU = 13th Gen Intel(R) Core™ i7-1355U / 1700 MHz / Target = Processor: generic x86
32423 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
25558 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
21732 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 12 / CPU = 13th Gen Intel(R) Core™ i7-1355U / 1700 MHz / Target = Processor: sandybridge-ivybridge
33390 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
25824 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
22426 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 12 / CPU = 13th Gen Intel(R) Core™ i7-1355U / 1700 MHz / Target = Processor: skylake-raptorlake
30006 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
25740 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
23402 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================

@Benitoite
Copy link
Author

Benitoite commented Aug 18, 2024

================================
Available threads = 16 / CPU = AMD Ryzen 9 5900HX with Radeon Graphics  /  3301 MHz / Target = Processor: generic x86
36997 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
21236 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
16870 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
16433 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 16 / CPU = AMD Ryzen 9 5900HX with Radeon Graphics  /  3301 MHz / Target = Processor: sandybridge-ivybridge
35356 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
20612 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
16425 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
16188 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 16 / CPU = AMD Ryzen 9 5900HX with Radeon Graphics  /  3301 MHz / Target = Processor: skylake-raptorlake
34356 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
20331 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
16268 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
16081 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================

@Benitoite
Copy link
Author

Benitoite commented Aug 21, 2024

================================
Available threads = 8 / CPU = Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz /  4201 MHz / Target = Processor: generic x86
49539 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
34734 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
30618 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 8 / CPU = Intel(R) Core(TM) i7-3770K CPU @ 3.50GHz /  4201 MHz / Target = Processor: sandybridge-ivybridge
48427 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
34116 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
30256 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================

@Benitoite
Copy link
Author

================================
Available threads = 16 / CPU = Intel(R) Core(TM) i9-9900KS CPU @ 4.00GHz /  4008 MHz / Target = Processor: generic x86
34081 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
21858 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
21062 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
19592 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 16 / CPU = Intel(R) Core(TM) i9-9900KS CPU @ 4.00GHz /  4008 MHz / Target = Processor: sandybridge-ivybridge
34081 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
21959 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
19051 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
18913 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 16 / CPU = Intel(R) Core(TM) i9-9900KS CPU @ 4.00GHz /  4008 MHz / Target = Processor: skylake-raptorlake
33708 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
22018 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
20196 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
18876 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================

@Benitoite
Copy link
Author

Benitoite commented Aug 24, 2024

================================
Available threads = 28 / CPU = Intel(R) Xeon(R) CPU E5-2697 v3 @ 2.60GHz / 2601 MHz / Target = Processor: generic x86
58966 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
35114 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
23898 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
21951 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 28 / CPU = Intel (R) Xeon(R) CPU E5-2697 v3 @ 2.60GHz / 2601 MHz / Target = Processor: sandybridge-ivybridge
57590 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
34143 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
22609 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
21031 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================
================================
Available threads = 28 / CPU = Intel(R) Xeon(R) CPU E5-2697 V3 @ 2.60GHz / 2601 MHz / Target = Processor: skylake-raptorlake
59062 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
36041 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
25024 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
23098 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 16
================================

@Benitoite
Copy link
Author

================================
Available threads = 8 / CPU = Intel(R) Xeon(R) CPU E3-1275 V2 @ 3.50GHz /  3501 MHz / Target = Processor: generic x86
56608 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
40286 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
33342 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================
================================
Available threads = 8 / CPU = Intel(R) Xeon(R) CPU E3-1275 V2 @ 3.50GHz /  3501 MHz / Target = Processor: sandybridge-ivybridge
55199 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
36673 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
33284 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 8
================================

@Benitoite
Copy link
Author

================================
Available threads = 4 / CPU = Intel(R) Core(TM) i5-6260U CPU @ 1.80GHz /  1800 MHz / Target = Processor: generic x86
66715 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
53584 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 4 / CPU = Intel(R) Core(TM) i5-6260U CPU @ 1.80GHz /  1800 MHz / Target = Processor: sandybridge-ivybridge
65804 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
52279 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================
================================
Available threads = 4 / CPU = Intel(R) Core(TM) i5-6260U CPU @ 1.80GHz /  1800 MHz / Target = Processor: skylake-raptorlake
64647 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 2
52658 total milliseconds elapsed (average of 5 runs) using OMP_NUM_THREADS = 4
================================

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment